MobileRead Forums - View Single Post

dauwhe · 05-04-2009, 04:23 PM

Quote:

Originally Posted by tirsales

Yes - but it should be possible to create XHTML and ePub not from the PDF - but from the original source, shouldnt it?
Or at least possible to extract the text (or have the complete text in advance) and re-format this one...

Things are slightly better with "application" files (InDesign, etc.). If done by a decent typesetter, the split paragraph problem shouldn't happen, for example. But I do remember a book where most of the text appeared twice when first extracted from Quark. The (bad) typesetter had left almost another complete copy of the book "hidden" in a text box. The extraction program dutifully found all the text, whether hidden or not.

Dave