MobileRead Forums - View Single Post - Created A Document with Adobe Acrobat, DRMed?

Jellby · 04-12-2009, 02:10 PM

PDF probably has the images encoded in some low-quality format. The quality is good enough for a human reading it, but not for an OCR program, which may need other settings (maybe a two-level TIFF instead of a true-colour JPG, for instance). Other that that, yes, a scanned PDF is nothing more than a set of images, and you should be able to OCR them just as well.

04-12-2009, 02:10 PM	#8
Jellby frumious Bandersnatch Posts: 7,561 Karma: 20150435 Join Date: Jan 2008 Location: Spaniard in Sweden Device: Cybook Orizon, Kobo Aura	PDF probably has the images encoded in some low-quality format. The quality is good enough for a human reading it, but not for an OCR program, which may need other settings (maybe a two-level TIFF instead of a true-colour JPG, for instance). Other that that, yes, a scanned PDF is nothing more than a set of images, and you should be able to OCR them just as well.