View Single Post
Old 02-08-2011, 04:53 PM   #7
tjung
Junior Member
tjung began at the beginning.
 
Posts: 8
Karma: 10
Join Date: Feb 2011
Device: iPhone
Quote:
Originally Posted by ldolse View Post
This is covered in the PDF faq... This is a common occurrence with OCR pdfs that use images. It's true that if there is good OCR text that this is normally what's extracted, but there are many ways to define a pdf, and clearly whatever way yours is defined is not compatible with Calibre.

If the underlying OCR is good and you want that text, then the only option I can think of is for you is to use Acrobat Professional to try and change the way the pdf is formatted. There are a bunch of pdf 'optimization' options in Acrobat - for different pdf version compatibility/better compression - sometimes optimizing/re-compressing the pdf using Acrobat will make it compatible with Calibre's libraries (but not always).

Other freeware/open source pdf tools may do the same thing, but I don't have any experience with those.

I tried doing what you suggested and told it to reformat/save in it's "mobile" format which made it Acrobat 7.X compatible. Acrobat Pro said the original file was Acrobat 2.3 compatible. Anyway I didn't have any luck with that suggestion. So I guess I am stuck copy and pasting the text in to Sigil and then editing the whole thing by hand. Not what I wanted but at least it will get me the book in EPUB format or any other format I want in the future.

Thanks everyone for the suggestions and help.
tjung is offline   Reply With Quote