Quote:
Originally Posted by shevirsy
Thanks for the links. I know the wikipedia article. It's depressing. Abbyy Finereader and that's about all. I was hoping for some missed gem.
|
There are a few proprietary programs that aren't on that list, and it does seem like that Wikipedia comparison COULD use some updating (for example, it says Finereader's latest version says 11, when 12 came out earlier this year).
If you want free, Tesseract is probably the best bet.
Quote:
Originally Posted by shevirsy
Well, Scan Tailor might help a lot more. But Jellby is right and I won't call names the users who just groom their post count. I just say "bye bye" and add them on ignore.
|
I used Scan Tailor when I first started to get into this, but now I lean in favor of the tools just built directly in Finereader. I find that Scan Tailor manipulated the original source images a little TOO much for my liking. (Another reason to lean towards the proprietary programs instead of free, a lot of the image manipulation tools are built-in, and allow easy tweaks/comparisons with the original source, while with something like Tesseract, you will get ONLY the OCR portion).
Also, keep in mind that Scan Tailor was really only built as a MIDDLEWARE program, to fit into a workflow like this:
Dirty/Speckled/Warped/Crappy scans/photos -> Scan Tailor -> OCR program.
It was made to try to clean up the images, so that OCR can (potentially) be more accurate.
Only thing I have found that Scan Tailor does better than Finereader is handling speckled documents, although with all of the negative baggage that comes with Scan Tailor, I have settled on cleaning speckles directly using Imagemagick.