View Single Post
Old 02-25-2013, 07:19 PM   #152
Ephemerality
Addict
Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.Ephemerality ought to be getting tired of karma fortunes by now.
 
Posts: 328
Karma: 800105
Join Date: Feb 2013
Device: PW1
Quote:
Originally Posted by RobbieClarken View Post
For one book, each loc is 14 characters off the position in the rawml file. For another book, the discrepancy ranges from 22 to 2000 characters.

Have you come across this? (I get the same issue using the version of mobi_unpack you link to.)
Good question. I had the same issue, but only on DRM'd books. So if I had the DRM'd version on my Kindle, transfer it to my pc and remove the DRM to create the X-Ray, then use the X-Ray on the original DRM copy it would have an arbitrary offset like you're describing.
I have been copying to pc, removing DRM, then using the DRM-free version on my Kindle.

Edit: Also, I ran into issues with the locs being off because of the encoding being used. The books I've used are using UTF8, which causes things like quotes to take up more space than normal (I think 3 characters instead of 1). So if you're not loading the book and taking that into account, you will end up counting those characters as 1 loc instead of 3... hopefully that makes sense.

Last edited by Ephemerality; 02-25-2013 at 08:35 PM.
Ephemerality is offline   Reply With Quote