MobileRead Forums - View Single Post - pdf to epub

itimpi · 03-29-2011, 09:32 AM

Quote:

Originally Posted by cybmole

so would ALL pdf convert programs fall at this hurdle ???

On this particular file I would say that the answer is yes.

It sounds like the file in question was created by scanning in image on the book and then applying OCR technology to create the underlying text. In this case some characters were not recognised correctly at the OCR stage. Unless you had some way of re-applying the OCR step (and doing a better job than the original program) then all conversion programs are going to fail with this PDF file.

Many (possibly the majority) PDF file are created from the original word processed document. In such a case the PDF file does not have the overlaying image and the underlying text is complete so a conversion program has a chance. However PDF conversion is still a little fraught ever with files created this way because of tricks that PDF does (ligatures, absolute placement of text, special symbols, etc) that a conversion program can struggle to understand and convert sensibly.