Thread: Typos in ebooks
View Single Post
Old 04-15-2011, 04:46 PM   #162
sourcejedi
Groupie
sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.sourcejedi ought to be getting tired of karma fortunes by now.
 
sourcejedi's Avatar
 
Posts: 155
Karma: 200000
Join Date: Dec 2009
Location: Britania
Device: Android
Quote:
Originally Posted by bizzybody View Post
Since comers is such a rarely used and archaic word in English, almost always with the word all before it, any time English OCR software thinks it sees "comers" it should be flagged, tagged and bagged as corners
<grin>

I found one instance of exactly the same error in #1 of Diane Duanes 'Young Wizard' series. That particular passed my eye completely. I found it after running a scanno-finding pass.

The guys at Project Gutenberg Distributed Proofing have accumulated a database of the commonly corrections they have to apply, and automated the process of scanning for them with a tool called GuiGuts. It was beautiful watching it find things like that.

It doesn't substitute for proofreading -- and it's still a manual process -- but it'd work great as a backstop, or a verification pass. (If it picks up a lot of errors, it's probably also missed a similar number of errors, but it serves as a good suggestion that the book hasn't been proofed as well as the Distributed Proofers would have managed).
sourcejedi is offline   Reply With Quote