Quote:
Originally Posted by willus
You may wish to try the -ocrout option which just dumps all of the OCR text to an ASCII (UTF-8) file:
-ocrout outfile.txt
You'll probably have to go through and clean it up a bit, but the OCR layer appears to be very good, so hopefully your editing will be minimal. I've attached the output from pages 20-25.
|
That's great. Thank so so much for you help! It strips the formatting, but really, that should be too hard to patch back up. I really appreciate the help, and I'll try my best to help others in return.

