MobileRead Forums - View Single Post - KindleUnpack (MobiUnpack): Extracts text, images and metadata from Kindle/Mobi files

KevinH · 06-14-2014, 02:50 PM

Also here is a quick and dirty python script to decode apnx files to return page names and offsets into the assembled text file (not the raw markup language file). It's command line parsing is not full unicode safe (yet). It is merely meant to demonstrate how to decode the apnx file when it is has different page numbering schemes.

Not very useful without the assembled_text.dat file but KindleUnpack's mobi_k8proc.py can be easily modified to generate that file.

NOTE: this code was thrown together very quickly to show proof of concept. It really really needs to be cleaned up before being used for any production purposes.

Hope this helps,

KevinH