View Single Post
Old 03-29-2011, 09:32 AM   #10
itimpi
Wizard
itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.
 
Posts: 4,552
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
Quote:
Originally Posted by cybmole View Post
so would ALL pdf convert programs fall at this hurdle ???
On this particular file I would say that the answer is yes.

It sounds like the file in question was created by scanning in image on the book and then applying OCR technology to create the underlying text. In this case some characters were not recognised correctly at the OCR stage. Unless you had some way of re-applying the OCR step (and doing a better job than the original program) then all conversion programs are going to fail with this PDF file.

Many (possibly the majority) PDF file are created from the original word processed document. In such a case the PDF file does not have the overlaying image and the underlying text is complete so a conversion program has a chance. However PDF conversion is still a little fraught ever with files created this way because of tricks that PDF does (ligatures, absolute placement of text, special symbols, etc) that a conversion program can struggle to understand and convert sensibly.
itimpi is offline   Reply With Quote