My replacement regex for the <div... stuff added a </p> to the end of all those paragraphs. So, all I did was delete all the <br...> tags and left the </p> tags to handle the paragraph spacing. I had to manually adjust some of the paragraphs on a couple of pages (on, for instance, the copyright page). But, in general, that part was easy.
I've got a reasonable looking ebook now and am reading it for issues and to look for places I can put my own heading/chapter marks and scene breaks. For some reason, Zelazny and/or the publishers didn't bother with things like that.
I've been scratching my head over why the publisher would have put that ridiculous html and css stuff in there. My guess is that they started with either a scanned copy of the paper book (or a PDF of one) and stuck those styles in there because of variations in how that came out instead of how it should have looked. Why else would they have lines/words in a single paragraph changing their height? I'd have thought that someone might have actually looked at the finished product and realized they were trying to reproduce scanning issues in CSS.
And, BTW, the original issue I started this thread with (liga 0) is now OBE: I deleted almost all of the stuff in those areas. Sorry I so quickly caused this thread to stray from an Editor issue to a Conversion one.
|