Originally Posted by SensualPoet
Wouldn't all "recent" titles exist as some sort of text-based electronic file? Surely anything that's been in the catalogue consistently since the latter 1980s exists digitally -- and not as pdf images that have to be OCR'd to create an e-book.
You might have thought so, but you'd be surprised. Publishers are an ultra-conservative lot. Although manuscripts have been submitted electronically for some years, many publishers did not retain electronic versions of the books after typesetting them, and hence have to OCR even fairly recent books in order to create an eBook. Hence lots of OCR errors.
Newly-published books should really not have this problem.