![]() |
#1 |
curly᷂͓̫̙᷊̥̮̾ͯͤͭͬͦͨ ʎʌɹnɔ
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,021
Karma: 50506927
Join Date: Dec 2010
Location: ♁ ᴺ₄₅°₃₀' ᵂ₇₃°₃₇' ±₆₀"
Device: K3₃.₄.₃ PW3&4₅.₁₃.₃
|
.azw1 or .tpz, antiquated Topaz sample file
Searched to no avail for a sample ebook in Topaz format.
Is there anyone who could make one available? The content is not important, it's just for testing. The sample could be scrambled but would need to be DRM free. (Alternately, a reference to an Amazon sample *in that format* that I could download and transfer to my Kindle Keyboard 3 would be adequate -- I couldn't locate one). |
![]() |
![]() |
![]() |
#2 |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,890
Karma: 110507267
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Topaz is a special kind of OCR where instead of trying to convert to text, each glyph is stored and similar ones replaced by one in a table.
Unlike a scan encapuslated as a PDF, the table of glyphs is used to build the reflowable page. The advantage over full OCR is that if the matching is set so there are almost no false positives (which would be abysmal for regular OCR) there is no need for human proofing. It will result in a huge table compared to proofed full OCR and a subsetted Unicode characterset, but will work with no training and any font/alphabet. Quality is poor compared to human proofed OCR or a decent scan, but worked on the 167 dpi 6" viziplex 4 level screen on K1. So the only viable conversion is to an image /TIFF/PDF image (calibre won't do it). Then the image might be OCRed. The K 3 probably can read topaz. I've no idea how the DRM worked. It may have used mobipocket or more likely Amazon mobi drm. This (with loads of nasty 3rd party javascript) lists topaz titles. I suspect amazon has removed most now. https://www.kboards.com/threads/the-...az-format.171/ I doubt Calibre scramble plugin works with Topaz. Topaz has no text. EDIT see https://wiki.mobileread.com/wiki/Topaz |
![]() |
![]() |
![]() |
#3 | ||
curly᷂͓̫̙᷊̥̮̾ͯͤͭͬͦͨ ʎʌɹnɔ
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,021
Karma: 50506927
Join Date: Dec 2010
Location: ♁ ᴺ₄₅°₃₀' ᵂ₇₃°₃₇' ±₆₀"
Device: K3₃.₄.₃ PW3&4₅.₁₃.₃
|
^ @Quoth Thanks for the research.
Quote:
Correct. Quote:
"The Tool" can apparently assuredly remove topaz drm, but the Calibre plugin then converts the file to HTLMZ and does not preserve the AZW1 format. Correct, the scrable plugin only accept AZW3 EPUB and KEPUB. Last edited by PoP; 10-13-2025 at 02:05 PM. Reason: DRM removal precisions |
||
![]() |
![]() |
![]() |
#4 |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,890
Karma: 110507267
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
I think that Amazon has removed most or all of the Topaz titles. There used to be some that were also "free".
I think it was a brilliant idea and maybe it could be done better today to convert PDFs or other fixed layout that's not comics to reflowable. Except you'd need a custom app, you wouldn't actually create a Topaz file. Such a custom app is obviously simple for iOS & Android but possible on Kobo without a jailbreak. I've not seen Topaz, but from what I've read, it wasn't implemented well, or didn't work as good as I'd have expected. |
![]() |
![]() |
![]() |
#5 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 28,842
Karma: 207000000
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Quote:
The plugin never even made an attempt to convert the topaz format itself. It wasn't at all feasible. It only used the underlying (and usually quite terrible and unformatted) original (sometimes OCRed) text that was only included to make it possible for the Kindle search function to still work. I had several of them at one time. Most of them rendered quite beautifully on Kindles. You just couldn't really do anything with them other than read them. Last edited by DiapDealer; 10-14-2025 at 03:26 PM. |
|
![]() |
![]() |
![]() |
#6 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,890
Karma: 110507267
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
![]() I'm glad to hear that most worked well on the Kindle. The idea behind it also seems to be part of DjVu (1998), though those are fixed layout and can convert to image based PDF (1992 and open standard since 2008). It seems a shame it's gone as the idea of scanning pages and having reflowable content on a smaller screen with no human proofing seems useful. Last edited by Quoth; 10-14-2025 at 03:39 PM. Reason: Case |
|
![]() |
![]() |
![]() |
#7 | ||
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,890
Karma: 110507267
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
This is the bit where Topaz and DjVu are similar:
Quote:
In contrast a scanned image in a PDF might simply be encapsulated TIFF. Quote:
DjVu beats PDF if the source is only scanned from paper, though like PDF it can have an OCR text layer to help search. PDF beats DjVu for output from maths typesetting, vector art, wordprocessing DTP etc. Both show a WYSIWG rendering of what might be printed on paper, and for DjVu the intention was a same size paper source. Normally the actual page size is pre-encoded into both. Topaz takes the compression/encoding idea of DjVu (and the OCR overlay for search that scanned to PDF + OCR also has), but instead of replicating the original layout it reflows and re-paginates for the actual screen. EDIT: Also of course Topaz and DjVu the work is done by the creator's tools (the readers rendering is simple), whereas non-scanned PDF (with postscript from laTex, vector art etc), azw3/KF8, KFX, epub2 and especially epub3 (with javascript, reflowable and fixed layout) require more work. The mobi /KF7 is HTML3 and has no CSS, so is pretty simple to render, especially as it only has three font faces (serif, sans and monospace) each in normal, bold, italic and bold italic. Last edited by Quoth; 10-14-2025 at 04:04 PM. |
||
![]() |
![]() |
![]() |
#8 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,890
Karma: 110507267
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
I find only one mention of Topaz on Wikipedia
Quote:
The K3 also supports azw3/KF8 if the firmware is updated. The international DXG (B009) actually was released after the B008 K3, but it's simply the gen 2 DX with the screen upgraded from Vizplex to Pearl, so it never got azw3/KF8 support. |
|
![]() |
![]() |
![]() |
#9 |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,890
Karma: 110507267
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
And of course the PDF spec later added the successor to DjVu's JBIG compression as an option, the JBIG2 compression. It can be x4 better than TIFF for 2 level images like text or fax.
However I think most scan to PDF software free on Windows doesn't use it, judging from the size. If we were re-inventing Topaz, then JBIG2 is freely available. |
![]() |
![]() |
![]() |
#10 | ||
curly᷂͓̫̙᷊̥̮̾ͯͤͭͬͦͨ ʎʌɹnɔ
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,021
Karma: 50506927
Join Date: Dec 2010
Location: ♁ ᴺ₄₅°₃₀' ᵂ₇₃°₃₇' ±₆₀"
Device: K3₃.₄.₃ PW3&4₅.₁₃.₃
|
@DiapDealer, history from the horse's mouth, thanks!
Quote:
Quote:
I was given one by a friend, along with the serial number of is now defunct K3, the converted htmlz reads perfectly. I would have liked to compare on a real device, maybe test annotations, bookmarks, notes ...etc. |
||
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
tpz file errors importing at command line but works in the GUI | hillcountryfare | Calibre | 5 | 09-22-2023 11:04 AM |
Fixing typos in Topaz file | mr ploppy | Amazon Kindle | 2 | 05-23-2011 06:14 PM |
Can Calibre convert .azw1 or .tpz to .mobi or .azw? | robcohen | Calibre | 3 | 08-30-2009 05:17 PM |
Identifying Topaz/AZW1 files prior to purchase | texasnightowl | Amazon Kindle | 1 | 09-05-2008 04:53 PM |
Kindle .azw1 file | bwit | Amazon Kindle | 4 | 08-05-2008 11:05 AM |