Thread: OCR to use
View Single Post
Old 05-26-2008, 01:29 PM   #16
pepak
Guru
pepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura aboutpepak has a spectacular aura about
 
Posts: 610
Karma: 4150
Join Date: Mar 2008
Device: Sony Reader PRS-T3, Kobo Libra H2O
Quote:
Originally Posted by Nergal View Post
For the inital question: I recommend to have a look at tesseract ocr - it is an opensource command line tool - with an amazing recognition rate (95-99.9 %, mostly at 98-99% for me).
I am afraid Tesseract is not for me. I need some additional languages, I'd like a better accuracy, and the most important, I need a reasonable layout detection - the software must, at least, be able to detect paragraphs and store each on one line. That alone is worth the price difference for me. Thanks for the suggestion, though.
pepak is offline   Reply With Quote