MobileRead Forums - View Single Post

mr ploppy · 12-01-2010, 10:07 AM

Quote:

Originally Posted by JSWolf

It's not just scans/ocr that are a problem. A real problem can be the source. A lot of eBooks are made from a PDF source and as we know you cannot convert a novel length PDF without errors. I have never seen a way to do so. PDF is a terrible source. I've seen some books from PDF where all the iatalcs run into the text. You get incorrect words and even broken paragraphs. It's sad that publishers don't actually care. Books today are electronically created. So, there is an electronic source that is NOT PDF. So use that and deal. I know most authors use Word. But even the mess that Word makes with HTML can be fixed if you bother to do so.

I would suspect they use PDF because those will be the files they use for print and they are likely to be the most up to date versions. They probably don't even realise that PDF introduces its own errors in the ebook versions.