It seems to me that your OCR stage is not doing what you expect. I think it must be adding the ocred text to the file, not replacing the scans. The degredation in the image quality is just a result of over-compression of your scans.
If your scanned books were actually being turned into text instead of images, the size would drop to 1MB or less, not 25MB.
Paul
Quote:
Originally Posted by harryE123
Let me describe my set up a bit: I use an opticbook 3600 book scanner, I scan my books to jpg at max quality and then import to acrobat. I use highest quality setting while merging them (instead of default, or the even lower optimize for space), then run an ocr through the file and finally an "optimize file" with the highest setting and all the default settings. Sizes for a 325 p book go like these: about 900 mb, then about 90 mb, then about 25 mb. BUT the difference in quality is striking between the ocr level and the optimization level, the book fonts really degrade after optimization, with letters missing tiny fractions of them everywhere...has anyone run across this? I mean it's understandable why there might be a slight (almost unoticable) drop in quality from after ocr from the 900 mb to the 90 mb file (although why ocr would reduce the size and drop the quality slightly is something I can't figure out, maybe it has to do with the pages no longer being images but text, hence more compact?), but this drop after optimization? Is there maybe a setting in the optimize dialog box that is the culpitt and I need to change?
I will post some comparison photos as well as the exat optimization setting I use (which are the default ones anyway).
|