Thread: Breaking DRM
View Single Post
Old 01-08-2011, 07:33 AM   #12
pdurrant
The Grand Mouse 高貴的老鼠
pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.
 
pdurrant's Avatar
 
Posts: 74,117
Karma: 315558334
Join Date: Jul 2007
Location: Norfolk, England
Device: Kindle Oasis
Quote:
Originally Posted by sbtx99 View Post
Most kindle books include both file size and number of pages under the product details. Someone pointed out on another thread that the ones that only display the number of pages (no file size) are usually topaz. When I checked I found it true for the few that I had downloaded that were topaz.
The latest tools also work on Topaz files. The Calibre plug-in produces a zip file of the HTML and images,. The stand-alone versions produce the HTML zip and also a book of SVG page images in another zip file.

The HTML zip can be converted by Calibre to other formats, but includes any OCR errors. The SVG images are essentially images of the original book pages, and can be used to correct the OCR (manually!) if you want to spend the time.
pdurrant is offline   Reply With Quote