Quote:
Originally Posted by eschwartz
HTMLZ? I regret to inform you that you have stumbled upon one of the rare Topaz format experiments, and discovered exactly why it is so horrible.
Topaz used, IIRC, some form of embedded image, backed by OCR. The DeDRM plugin and calibre as well can only extract the OCR layer.
Basically it is the PDF of ebook formats. 
|
Topaz is absolutely fine for the purpose it's intended for - it's just not a good format for conversion. Its intended use is the situation when the source material is not available in digital form, and you either can't afford to do a digital conversion, or the material isn't amenable to digital conversion.
I have a number of old Egyptology books in Topaz format that really can't practically be converted to Mob/ePub format because they contain a lot of non-standard glyphs (Egyptian hieroglyphs, etc). Topaz is an ideal format for such material, because it's a fraction the size that a page-scan would be, and it's reflowable.