View Single Post
Old 12-18-2021, 02:40 PM   #7
feklee
Member
feklee began at the beginning.
 
Posts: 18
Karma: 10
Join Date: Feb 2013
Device: Kindle M2L4EK
Quote:
Originally Posted by j.p.s View Post
I had never heard of .han files, but it looks like they are already in a usable format, perhaps JSON.
Yes, JSON.

Quote:
If kindleunpack works on your azw3 files you can use it to get the raw XHTML file of the book. The startPositions and endPositions in your .han file are very likely byte (or maybe character) offsets in the raw XHTML file of the highlighted text.
Not right away, though, but maybe after stripping the markup.

Interestingly I can get a list of all annotations in Duokan, which I have installed for dual-boot on my device. It’s a 20 page document, and maybe I’ll just scan that with a flatbed scanner. Kind of dumb, but way quicker than trying to reverse engineer that format.

I don’t think I want to annotate with the Kindle again. This is all a pain. Fortunately, it seems to be easy to de-drm books, and then I can use another reader.
feklee is offline   Reply With Quote