12-31-2014, 09:23 PM | #1111 |
Sigil Developer
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
|
Hi,
Please post the full debug kindleunpack log someplace first so I can see what might be going on. It will be large so zip it up first before posting it as an attachment. If I can't see an obvious problem, I would be happy to examine the full dictionary and try to figure out how to decode it in kindleunpack if possible. KevinH |
01-01-2015, 04:57 AM | #1112 |
Zealot
Posts: 128
Karma: 500
Join Date: Aug 2011
Device: kindle, boox
|
|
01-01-2015, 06:41 PM | #1113 | |
Sigil Developer
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
|
Hi elchamaco,
Please try again but this time: 1. use very latest version KindleUnpack v0.77 which is available at this link: https://www.mobileread.com/forums/sho...postcount=1074 2. Do NOT create a rar. Instead create a simple zip file as I asked 3. Remember to use the -d option in KindleUnpack (debug) to dump more information. 4. Do NOT upload it to a horrible/dodgy website like zippyshare! Simply attach the zip file to your post here as an attachment. The kindlegen build log seems to indicate that the dictionary is not correct when built. Error messages like that are not normal. If that is the case, I won't be able to help. KevinH Quote:
|
|
01-04-2015, 05:02 PM | #1114 |
Zealot
Posts: 128
Karma: 500
Join Date: Aug 2011
Device: kindle, boox
|
Hi,
I generated the log with 0.77. The unpack gives the same result as 0.75. Regards. |
01-04-2015, 06:30 PM | #1115 |
Sigil Developer
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
|
Hi elchamaco,
Yes this is very different from all other dictionaries I have seen. Examining the mobi7 header shows that there is no meta inflexion index at all but a new index called index_names does seem to exist and be used. Here is a snippet from the Mobi7 header. See the metainfindex field has been set to missing 0xFFFFFFFF but something called index_names does have a non 0xFFFFFFFF value and it points to a set of unknown indexes further along than the metaorthindex. Code:
Dumping section 0, Mobipocket Header version: 7, total length 272 Mobipocket header from section 0 Offset Value Hex Dec Description 0x000 ( 0) 0x0002 2 compression_type 0x002 ( 2) 0x0000 0 fill0 0x004 ( 4) 0x054DEC84 88992900 text_length 0x008 ( 8) 0x54DF 21727 text_records 0x00A ( 10) 0x1000 4096 max_section_size 0x00C ( 12) 0x0000 0 crypto_type 0x00E ( 14) 0x0000 0 fill1 0x010 ( 16) MOBI magic 0x014 ( 20) 0x00000100 256 header_length (from MOBI) 0x018 ( 24) 0x00000002 2 type 0x01C ( 28) 0x0000FDE9 65001 codepage 0x020 ( 32) 0x434E1E13 1129192979 unique_id 0x024 ( 36) 0x00000007 7 version 0x028 ( 40) 0x000054E1 21729 metaorthindex 0x02C ( 44) 0xFFFFFFFF 4294967295 metainflindex 0x030 ( 48) 0x000055E7 21991 index_names 0x034 ( 52) 0xFFFFFFFF 4294967295 index_keys Here are snippets from the map of sections in the mobi ebook: Code:
The metaorthindex points here: 21729 54E1 0x1C9B030 0x01F04 43458 0 Unknown INDX section, extracting as Unknown21729_INDX.dat 21730 54E2 0x1C9CF34 0x0FBF0 43460 0 Unknown INDX section, extracting as Unknown21730_INDX.dat 21731 54E3 0x1CACB24 0x0FBEC 43462 0 Unknown INDX section, extracting as Unknown21731_INDX.dat ... --snip-- The index_names point here. 21991 55E7 0x2C72C3C 0x000E0 43982 0 Unknown INDX section, extracting as Unknown21991_INDX.dat 21992 55E8 0x2C72D1C 0x000CC 43984 0 Unknown INDX section, So the current dictionary code will not deal with this at all as it never even looks at the "index_names" nor does it know how to interpret its data. I think that field name was based on information in the Wiki here and there may be no one who remembers why it was named "index_names" as I did not name it. So supporting these strange dictionaries which have errors in their own kindlegen logs and who use who use no inflections index may require a big reverse engineering effort. This is not something I can take on soon. But if you post the dictionary someplace that is not so swamped by unknown attacks (say www.datafilehost.com where I at least know how to avoid the issues) and send a personal mail on this site to "KevinH" with the link. I will take a look at eventually supporting it, just not right now as I am tied up with Sigil projects. Thanks, KevinH |
01-06-2015, 02:17 PM | #1116 |
Zealot
Posts: 128
Karma: 500
Join Date: Aug 2011
Device: kindle, boox
|
Done.
I've also uploaded a dictionary i've created in the same way the did. The dictionary works, kindleupack fails to decompress de inflected forms. I included the source files and the test dictionary compiled with mobipocket and with kindlegen, in both cases the result is the same. Regards. |
01-10-2015, 11:36 AM | #1117 | |
Sigil Developer
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
|
Hi,
Here is a status update. I looked at your dictionary and the index_names is meaningless. There is an index header but no real index data there. I am not sure why this empty index is even created. So I went back and looked at the orthIndex data and *all* of the words are there but some are given a different tagMap values that point to the entry they are subelements of. The current code only unpacks main elements not subelements since they should have been inflections. The problem is that both main elements and subelements are in sorted alphabetical order so some subelements actually come before the main element they are listed under. That means our current one pass algorithm to build up a dictionary will no longer work. So this is going to take some effort. These types of dictionaries may "work" but they are incorrect and all of these subelements should simply be inflections which would make the dictionary much smaller in size when compiled into a mobi. And, I really hate to slow down unpacking of all dictionaries just to accommodate these incorrectly amd inefficiently created dictionaries. So I am not sure I will add support for these. If I do decide to modify KindleUnpack to support them, I will make the code convert all of these new subelements to inflections. KevinH Quote:
|
|
01-11-2015, 04:28 PM | #1118 |
Zealot
Posts: 128
Karma: 500
Join Date: Aug 2011
Device: kindle, boox
|
Thanks for looking it. Yes for sure they use an incorrect way of doing it, i've not seeing any document explaining to code the entries in that way.
It's very strange, and it's a commercial dictionary. To not slowdown, using an algorithm to standard dictionaries and other when a dictionary uses this kind of inflected forms? Thanks anyway taking time to view it. |
01-16-2015, 08:20 AM | #1119 |
Grand Sorcerer
Posts: 27,548
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
@tkeo (and/or KevinH)
I have a bug report in the KindleUnpack plugin thread that looks to be related to your ePub3/Furigana-related code. Any chance you could take a look? |
01-16-2015, 11:56 AM | #1120 | |
Sigil Developer
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
|
Hi DiapDealer,
We released v077 on Oct 17 or so and this bug was fixed on Oct. 30th or so. It is so rare we have not made a new release for just that fix. So your best bet is to simply update to today's master or grab that Oct 30th commit (skip the part where an entire emacs backup file with ~ gets added to the tree!) ;-) Take care, KevinH Quote:
|
|
01-16-2015, 12:15 PM | #1121 |
Grand Sorcerer
Posts: 27,548
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
OK, cool. Thanks!
|
05-11-2015, 03:54 PM | #1122 |
Sigil Developer
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
|
KindleUnpack v0.80 Released
Hi All,
The very latest KindleUnpack version v0.80 has now been released. Please see: https://github.com/kevinhendricks/KindleUnpack/releases This version now includes support for running under both Python 2.7 and 3.4 and it has all of the latest bug fixes including a fix for the missing TOC links issue (thank you Kovid and DiapDealer!) and a fix for Japanese specific metadata entries generated by Amazon. @Developers I ran into many many conflict problems trying to merge/sync the python2and3 branch directly into master and basically gave up. Instead, I have moved the original master branch to oldmaster (to keep its full history) and then copied the python2and3 branch to become a new master. The branches currently labelled "python2and3" and "oldmaster" are now obsolete. The branch "master" should be used as the target for any changes/pull requests or patches. Also any proposed changes need to be tested under both python 2.7.X and python 3.4.X. If you have previously checked out a tree from this repo, you will probably have to make a clean clone of the repository since a proper merge was never fully done. Thanks, KevinH |
05-11-2015, 09:08 PM | #1123 |
just an egg
Posts: 1,586
Karma: 4300000
Join Date: Mar 2015
Device: Kindle, iOS
|
Thank you KevinH, DiapDealer and everyone else who has contributed time and effort to this AWESOME tool!
|
05-13-2015, 07:14 AM | #1124 |
Member
Posts: 16
Karma: 10
Join Date: Oct 2012
Device: Kindle 4
|
FXL EPUB do not include necessary viewport dimensions
When KindleUnpack unpacks a FXL KF8 and repackages it as a FXL EPUB 3, each XHTML document in the spine should have a `meta` element inside its `head` block describing the size of the viewport, in order for the FXL EPUB to be rendered correctly in compatible EPUB ereaders (iBooks, Google Play Books, Kobo…):
Code:
<meta name="viewport" content="width=575, height=749"/> Should I add these as issues at KindleUnpack's Github repo? |
05-13-2015, 02:12 PM | #1125 |
Sigil Developer
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
|
Please post links to a completely valid fixed layout KF8 made via kindlegen (no drm) and the valid fixed layout epub3 that was used as input to kindlegen and I will work out how to properly unpack it and make the needed changes to KindleUnpack.
Thanks, KevinH |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Can i rotate text and insert images in Mobi and EPUB? | JanGLi | Kindle Formats | 5 | 02-02-2013 04:16 PM |
PDF to Mobi with text and images | pocketsprocket | Kindle Formats | 7 | 05-21-2012 07:06 AM |
Mobi files - images | DWC | Introduce Yourself | 5 | 07-06-2011 01:43 AM |
pdf to mobi... creating images rather than text | Dumhed | Calibre | 5 | 11-06-2010 12:08 PM |
Transfer of images on text files | anirudh215 | 2 | 06-22-2009 09:28 AM |