PDF probably has the images encoded in some low-quality format. The quality is good enough for a human reading it, but not for an OCR program, which may need other settings (maybe a two-level TIFF instead of a true-colour JPG, for instance). Other that that, yes, a scanned PDF is nothing more than a set of images, and you should be able to OCR them just as well.
|