It's OCR'd because it's the most reliable way of obtaining the final text, especially when it comes to last minute changes on proofs.
And either way, the output to the printer is likely a PDF file, so that's not as useful either.
The other problem is that they're backlist titles, so the publisher won't necessarily have updated files, nor do they do the conversions themselves. It's optimal for them to outsource the conversion and give the converter a copy of the print book as the source.
|