08-17-2011, 07:32 PM | #1 |
Zealot
Posts: 100
Karma: 400000
Join Date: Jul 2010
Device: iPad 2 64GB
|
AZW de-DRM'd to HTMLZ
Hi,
I know that talking about HOW to do it is illegal, but talking about it is probably not (at least I hope so). I bought a book that was only available on Amazon and when I tried to strip the DRM, Calibre ended up with a book in HTMLZ format... what the heck do I do with it? Why didn't it convert to MOBI like it usually does? Thanks for the help! |
08-17-2011, 07:59 PM | #2 |
Resident Curmudgeon
Posts: 73,645
Karma: 127837858
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
When you remove the DRM from an AZW eBook, you end up with Mobipocket. AZW is just Mobipocket with different DRM. Try the stand along DRM stripper instead of the plug-in.
|
Advert | |
|
08-17-2011, 08:01 PM | #3 |
Zealot
Posts: 100
Karma: 400000
Join Date: Jul 2010
Device: iPad 2 64GB
|
Hmm.. after running the deDRM python script manually, I get this as part of the conversion log:
Code:
Processing Section: metadata . Successfully Extracted Topaz contents Updating to color images if available Creating cover.jpg Processing Dictionary Processing Meta Data and creating OPF Processing StyleSheet Using font size: 116 Using page height: 12960 Using page width: 8640 Processing Glyphs .... Processing Pages .... Processing Complete Book Successfully generated Creating NoDRM HTMLZ Archive Creating SVG HTMLZ Archive Creating XML ZIP Archive |
08-17-2011, 08:02 PM | #4 |
Evangelist
Posts: 448
Karma: 864744
Join Date: Mar 2011
Device: Kindle 3, LookBook, Nook Simple Touch
|
probably was a topaz book not azw, just convert the htmlz to mobi or epub or whatever the format of your choice is.
edit: you beat me with your post |
08-17-2011, 08:07 PM | #5 | |
Sigil & calibre developer
Posts: 2,488
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
The book is a Topaz book. This line gives it away:
Quote:
HTMLZ is a ZIP archive with a single HTML file inside of it. HTMLZ also has a few things like a metadata.opf file. As stated you can import the HTMLZ into calibre and convert to your preferred format. |
|
Advert | |
|
08-17-2011, 08:20 PM | #6 |
Zealot
Posts: 100
Karma: 400000
Join Date: Jul 2010
Device: iPad 2 64GB
|
Strangely enough, the file extension was AZW, and not AZW1 or TPZ.
The book contains mostly text, aside from a few diagrams, and the front matter seems to be mostly scanned (copyright page, for example). Chapter headings seem to be SVGs, and the first letter of a sentence in the beginning of a chapter is a fancy glyph. So far, my attempts at converting this to ePub/Mobi have failed. The inline table of contents ends up looking different than the original book: Original: EPUB: And that glyph that starts a sentence is gone: Original: EPUB: Last edited by Caleb666; 08-17-2011 at 08:23 PM. |
08-17-2011, 08:37 PM | #7 |
Evangelist
Posts: 448
Karma: 864744
Join Date: Mar 2011
Device: Kindle 3, LookBook, Nook Simple Touch
|
topaz never converts well, be happy if it's readable.
|
08-17-2011, 11:47 PM | #8 |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
Amazon uses AZW for all their files, doesn't matter what the actual binary file is. In the past the extension can vary based on how you downloaded the ebook.
|
08-18-2011, 02:32 AM | #9 |
Zealot
Posts: 100
Karma: 400000
Join Date: Jul 2010
Device: iPad 2 64GB
|
Edit: n/m
Last edited by Caleb666; 08-18-2011 at 04:05 AM. |
08-18-2011, 04:11 AM | #10 |
eBook Enthusiast
Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
I notice that you have an iPad. Could you not simply read the book in the Kindle app on the iPad?
Topaz books really don't convert well to other formats; they tend to be used in situations where you have complex layouts that can't be easily be implemented in Mobi format. |
08-18-2011, 10:04 AM | #11 | |
Zealot
Posts: 100
Karma: 400000
Join Date: Jul 2010
Device: iPad 2 64GB
|
Quote:
|
|
08-18-2011, 10:13 AM | #12 |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
The Calibre plugin only creates a htmlz file - if you use the standalone plugin for Topaz there are actually two output options - one is htmlz, just like Calibre, but the other is SVG - the SVG option basically gives you a pdf-like file made up exclusively of SVG. It's a true 'backup' of the topaz file, but it's a one-way backup - you could use the SVG to convert to PDF (or even an ePub that iBooks could render), but you can't convert it back to Topaz.
What's nice about the SVG export option is it looks exactly like the printed book, as this was the whole point of Topaz - emulate the printed book exactly, but also allow reflow. That said, I don't think a lot of people use that option, so it's limited in a few ways, give it a try and see if you like it. |
08-18-2011, 11:02 AM | #13 |
eBook Enthusiast
Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
It's not that you can't remove the DRM - you can. The issue is more that you can't convert it to any other format.
|
08-18-2011, 11:16 AM | #14 | |
Grand Sorcerer
Posts: 5,883
Karma: 464403178
Join Date: Feb 2010
Location: 33.9388° N, 117.2716° W
Device: Kindles K-2, K-KB, PW 1 & 2, Voyage, Fire 2, 5 & HD 8, Surface 3, iPad
|
just convert htmlz to mobi.
Quote:
|
|
08-18-2011, 11:41 AM | #15 | |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
Quote:
The only way to improve on what the plugin does is to use the option to create SVG, then convert to jpg/tiff images, and finally re-do the OCR and editing yourself from scratch with something like ABBY finereader. Generally not worth it unless you REALLY want a perfect ebook edition. |
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
HTMLZ Output | Mamaijee | Conversion | 1 | 06-23-2011 07:00 PM |
HTMLZ - Single HTML File Output | user_none | Calibre | 22 | 05-19-2011 02:33 AM |
HTMLZ | Ortep | Calibre | 21 | 05-09-2011 10:27 PM |
How does one tell if an ebook is DRM'd? | AlexBell | Kindle Formats | 10 | 04-29-2009 12:09 AM |
DRM'd books | latchkeyed | News | 21 | 04-04-2009 03:34 PM |