Quote:
Originally Posted by j.p.s
I had never heard of .han files, but it looks like they are already in a usable format, perhaps JSON.
|
Yes, JSON.
Quote:
|
If kindleunpack works on your azw3 files you can use it to get the raw XHTML file of the book. The startPositions and endPositions in your .han file are very likely byte (or maybe character) offsets in the raw XHTML file of the highlighted text.
|
Not right away, though, but maybe after stripping the markup.
Interestingly I can get a list of all annotations in Duokan, which I have installed for dual-boot on my device. It’s a 20 page document, and maybe I’ll just scan that with a flatbed scanner. Kind of dumb, but way quicker than trying to reverse engineer that format.
I don’t think I want to annotate with the Kindle again. This is all a pain. Fortunately, it seems to be easy to de-drm books, and then I can use another reader.