View Single Post
Old 02-21-2011, 05:58 PM   #7
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 27,549
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Quote:
I was checking some books that had gone into Calibre via the K4MobiDeDRM plug in and noticed several text errors in one of them, which are not there in the original DRM version.
To demonstrate what is actually going on:

with the original ebook, search for the text that is being re-produced "incorrectly" after the conversion and see what the search returns.

Then actually type in the mispelled word(s) - still in the original ebook - and see what that search returns.

The text errors are in the original ebook... you just don't see them, because the glyph data that is generated from the horrible OCR has been proofread and corrected before you see it.

Unfortunately, that glyph data is not very handy for converting into a different, reflowable format. So the plugin uses the included original OCR text (Amazon includes it for full-text searching)... which can be quite atrocious, but that blame lies entirely with Amazon and not the plugin.
DiapDealer is offline