Thread: Typos in ebooks
View Single Post
Old 01-27-2011, 05:42 AM   #128
bizzybody
Addict
bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.bizzybody ought to be getting tired of karma fortunes by now.
 
Posts: 300
Karma: 8317682
Join Date: Apr 2007
Location: Idaho, USA
Device: Various PalmOS PDAs, Android Phones, Sharper Image Literati
Since comers is such a rarely used and archaic word in English, almost always with the word all before it, any time English OCR software thinks it sees "comers" it should be flagged, tagged and bagged as corners unless all is right before it. http://www.thefreedictionary.com/All+comers

OCR software needs a lot more work on discriminating between lowercase m and rn. I've seen books where nearly every instance of each was recognized as the other. Same thing for some other troublesome letter pairs. A "Does this word really exist in English?" sanity check would cure tons of OCR errors. Of course that would require OCR software to include spelling and grammar checking too.
bizzybody is offline   Reply With Quote