Quote:
Originally Posted by jhowell
"kindleunpack -d" will produce a file named "assembled_text.dat" in the mobi8 folder containing a subset of the rawml corresponding to the actual book content (flow 0). I think you will find that the position number offsets are indexed into this data without any correction needed.
|
And so it does. But, it seems a bit magic. Somewhat early on, the rawml has the 14 byte string "</body></html>" not in assembled_text.dat, then somehow the two files have unaligned sets of opening and closing html and body tags which somehow do not affect the byte offsets of book text.
rzikaou's rawml file has extra header tags not in the assembled_text.dat file.
So the C azw3r and the perl azw3r.pl can be used as is with
-r assembled_text.dat -o 0
Quote:
Originally Posted by odamizu
Thank you jhowell! As always, you are a wonderful source of enlightenment 
|
Ditto.