Quote:
Originally Posted by j.p.s
Amazon startPositions and endPositions include the markup. Stripping the markup would render them useless.
|
I had a second look. First of all, the
.rawml file in the
mobi8 directory seems to be all ASCII. Non-ASCII characters are encoded using HTML entities. For verification:
Code:
$ wc -m book.rawml
793974 book.rawml
$ wc -c book.rawml
793974 book.rawml
So I picked a highlight from the
.han file:
Code:
{"endPosition":"316113","startPosition":"315821","lastModificationTime":"2021-07-22 20:43:27.262","type":"kindle.highlight"}
To get the text I needed to add an offet of 15 to the positions (spaces added by me, to prevent phpbb from expanding entities):
Code:
$ cut -z -c$((315821 + 15))-$((316113 + 15)) book.rawml
The solution wasn&# 8217;t just another round of stimulus. After all, &# 8220;with global growth weak, using stimulus to use up excess capacity is like preparing food for two when there&# 8217;s only one guest; they could eat as much as they could and it still wouldn&# 8217;t all be gone.&# 8221;
This matches what I could find in the list of annotations shown in Duokan:
Code:
The solution wasn’t just another round of stimulus. After all, “with global growth weak, using stimulus to use up excess capacity is like preparing food for two when there’s only one guest; they could eat as much as they could and it still wouldn’t all be gone.”
But all this is unnecessarily complicated. The easiest solution is to just download
documents/My Clippings.txt from the device. It has the annotations of all books in it.