Quote:
Originally Posted by retiredbiker
Run OCR page by page, doing each column one at a time, avoiding ads and following "continued on page nnn" instructions: Tesseract OCR using OCRFeeder front end.
|
Just to add some context, there was OCR software with interactive correction back in 1990.
The same software could also use dictionaries to improve recognition, which Tesseract still cannot do.