Quote:
Originally Posted by tirsales
Yes - but it should be possible to create XHTML and ePub not from the PDF - but from the original source, shouldnt it?
Or at least possible to extract the text (or have the complete text in advance) and re-format this one...
|
Things are slightly better with "application" files (InDesign, etc.). If done by a decent typesetter, the split paragraph problem shouldn't happen, for example. But I do remember a book where most of the text appeared twice when first extracted from Quark. The (bad) typesetter had left almost another complete copy of the book "hidden" in a text box. The extraction program dutifully found all the text, whether hidden or not.
Dave