It might help to see what kinds of errors are common with the OCR software you use, then write a script to go through the OCR output, looking for those types of errors, so that you can manually correct them. You could have the script find and replace, but odds are that the characters you are searching for will be in there legitimately somewhere.
|