Quote:
Originally Posted by Worldwalker
Google's lousy scans aren't a rights issue.
Whatever the reason is that they don't think they need to make the book, y'know, readable, it's not rights.
|
No OCR is perfect. The only way to correct OCR errors is by hand. Man power is the reason Google can't correct it all. There are some techs intended to alleviate that -- like the captchca's that use internet users to interpret bad scans. I don't know if Google uses that, but either way, it would take a long time to correct a library that large two or so words at a time.
Quote:
Their PD books are just as awful as, if not worse than, their non-PD books.
|
Up till now, they haven't had the rights to not use scanning to get even their nonPD books. Once they have agreements with publishers, they won't need to scan books. Some titles won't have digital versions available at all, and scans will still be there for that. But yes, the reason there is scanning at all for titles that are new enough is a rights issue.