View Single Post
Old 11-07-2012, 01:55 PM   #441
hockpa2e
Junior Member
hockpa2e began at the beginning.
 
hockpa2e's Avatar
 
Posts: 2
Karma: 10
Join Date: Nov 2012
Location: New Jersey, USA
Device: Kindle Keyboard 3G
Hi, a DeDRM'd mobi7 that I unpacked has corrupted index entries in the HTML. The values in the idx:orth tags are byte sausage, with illegal control characters and everything:

<idx:orth value="^Ch^H\^H_">

(This is how emacs shows control characters.) The same thing happens whether I use the latest or older versions of mobiunpack.

All the other data in the file seems fine. The charset is utf-8.

Any idea what could be causing this? Thanks.
hockpa2e is offline   Reply With Quote