I also use ABBYY Finereader. I've found it quicker to save the OCRed file to MS Word (or the free OpenOffice equivalent) to do my editing. In Word you can use the spell checker (but it doesn't fine everything, e.g. ABBYY often sees I'll as 111 and the spell checker is OK with 111.)
You will find many common errors depending on the printed font, e.g. rn may be seen as m and vice versa). If/when you keep running into the same error, you can then do a global replace. Some of my books are westerns and ABBYY will not recognize the tilde in seņor. This can be globally changed, etc.
Another thing, ABBYY doesn't like the standard way of showing Em dashes (nor do I). The standard is "word—word". Both MS Word and ABBYY see this as a spelling error. I wrote a macro to change it to "word — word" which does not show as a spelling error and I prefer it this way. Sometimes ABBYY will see a space between words as a double space. These can be easily changed with a "replace all".
Last edited by slayda; 02-04-2009 at 11:21 AM.
|