Quote:
Originally Posted by Nergal
For the inital question: I recommend to have a look at tesseract ocr - it is an opensource command line tool - with an amazing recognition rate (95-99.9 %, mostly at 98-99% for me).
|
I am afraid Tesseract is not for me. I need some additional languages, I'd like a better accuracy, and the most important, I need a reasonable layout detection - the software must, at least, be able to detect paragraphs and store each on one line. That alone is worth the price difference for me. Thanks for the suggestion, though.