Quote:
Originally Posted by bizzybody
Since comers is such a rarely used and archaic word in English, almost always with the word all before it, any time English OCR software thinks it sees "comers" it should be flagged, tagged and bagged as corners
|
<grin>
I found one instance of exactly the same error in #1 of Diane Duanes 'Young Wizard' series. That particular passed my eye completely. I found it after running a scanno-finding pass.
The guys at Project Gutenberg Distributed Proofing have accumulated a database of the commonly corrections they have to apply, and automated the process of scanning for them with a tool called GuiGuts. It was beautiful watching it find things like that.
It doesn't substitute for proofreading -- and it's still a manual process -- but it'd work great as a backstop, or a verification pass. (If it picks up a lot of errors, it's probably also missed a similar number of errors, but it serves as a good suggestion that the book hasn't been proofed as well as the Distributed Proofers would have managed).