View Single Post
Old 07-11-2007, 11:01 PM   #29
Xenophon
curmudgeon
Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.Xenophon ought to be getting tired of karma fortunes by now.
 
Xenophon's Avatar
 
Posts: 1,487
Karma: 5748190
Join Date: Jun 2006
Location: Redwood City, CA USA
Device: Kobo Aura HD, (ex)nook, (ex)PRS-700, (ex)PRS-500
Quote:
Originally Posted by Steve Jordan View Post
I don't know what's going on with PG's proofers, but perhaps they need a better (or more standardized, perhaps) OCR process in the first place, to minimize errors.

We've discussed this elsewhere on this forum. My personal recommendation has always been to photocopy pages to a larger size, as close to paper size as possible, and OCR from that... the larger the text, the fewer the errors. Without knowing how PG does it, though, I can't make a definitive statement, only suggestions.
If you want to know 'how PG does it' -- now, that is -- go visit www.pgdp.net. They have a very organized and effective method for proofing books that results in the cleanest available text. But Distributed Proofreaders has only been going since 2000, while PG got going nearly 20 years ago. At this point DP has produced (that is to say, scanned and proofed) a bit more than 50% of the total content at PG, and they're producing something like 95+% of what's getting added. That said, there's still the ~50% that didn't go through DP and which has a much higher frequency of typos, scanos, and OCR goofs.
I'm sure that there is some way of knowing which books arrived which way, but I don't know what it is.
Xenophon is offline   Reply With Quote