Quote:
Originally Posted by RobbieClarken
For one book, each loc is 14 characters off the position in the rawml file. For another book, the discrepancy ranges from 22 to 2000 characters.
Have you come across this? (I get the same issue using the version of mobi_unpack you link to.)
|
Good question. I had the same issue, but only on DRM'd books. So if I had the DRM'd version on my Kindle, transfer it to my pc and remove the DRM to create the X-Ray, then use the X-Ray on the original DRM copy it would have an arbitrary offset like you're describing.
I have been copying to pc, removing DRM, then using the DRM-free version on my Kindle.
Edit: Also, I ran into issues with the locs being off because of the encoding being used. The books I've used are using UTF8, which causes things like quotes to take up more space than normal (I think 3 characters instead of 1). So if you're not loading the book and taking that into account, you will end up counting those characters as 1 loc instead of 3... hopefully that makes sense.