View Single Post
Old 04-29-2011, 04:02 AM   #6
DSpider
Evangelist
DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.DSpider ought to be getting tired of karma fortunes by now.
 
DSpider's Avatar
 
Posts: 450
Karma: 343115
Join Date: Nov 2009
Location: Romania
Device: PW2 2014
There was an article on Lifehacker a few months ago on which OCR program is the best and ABBYY FineReader had the most votes. I agree, but no OCR program is perfect. Far from it. Especially if the printed material was of poor quality and inconsistent throughout the book, you could end up with entire pages of bold text...


From what I've experienced, FineReader has some issues with Romanian (like not detecting the capital "î", wrong quote marks, etc), but those can be fixed either while proof-reading the whole thing or by doing a batch replace.

For instance replacing:

". î" with ". Î"
"? î" with "? Î"
"! î" with "! Î"

But it's not always a good idea. Sometimes it will mess up the indentation. For instance if a comma (,) was mistook for a period (.) then the whole phrase would be split and could have a whole different meaning. If you find recurring inconsistencies with certain characters in your language you should do a manual search and selectively replace them instead of doing a batch replace.
DSpider is offline   Reply With Quote