View Single Post
Old 03-22-2009, 08:18 PM   #249
Patricia
Reader
Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.Patricia ought to be getting tired of karma fortunes by now.
 
Patricia's Avatar
 
Posts: 11,504
Karma: 8720163
Join Date: May 2007
Location: South Wales, UK
Device: Sony PRS-500, PRS-505, Asus EEEpc 4G
Quote:
Originally Posted by cerement View Post
I think it's because most of the converters here are starting with Gutenberg texts (or archive.org texts or Google Books), stuff that has been OCRed, then proofed a couple of times, but not to any major extent. We're just adding another layer of proofing.
The internet Archive text files at archive.org are not proofed. Nor are the text files derived from google books. The OCR from both leaves an awful lot to be desired. I have just spent all day cleaning one up.
Patricia is offline   Reply With Quote