View Single Post
Old 07-20-2015, 03:10 AM   #4
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
Oh, OCR software has gotten a whole lot smarter since you worked with it. The error rate is way down, but there always will be typical OCR errors. Also GIGO plays a big role here. The better the source, the better the results. The main OCR player nowadays is ABBYY Finereader.

Re-typing is not cost-effective. It will cause other errors yet again, which will also be spotted only by proof-reading.

It is not without reason that I made my Word add-in. It is designed to take the output from the OCR process and either fix errors automatically or give you the tools to fix them. It saves me an enormous amount of time in digitizing a text.

The PDF with OCR text overlay is useful. I use it as well. If I find some strange text where I think there is an error but I am not quite sure what it should be, I use that one. It enables me to search quickly to the correct point and then see the original.
Toxaris is offline   Reply With Quote