Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > Kindle Formats

Notices

Reply
 
Thread Tools Search this Thread
Old 12-31-2014, 10:23 PM   #1111
KevinH
Wizard
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 1,125
Karma: 444444
Join Date: Nov 2009
Device: many
Hi,
Please post the full debug kindleunpack log someplace first so I can see what might be going on. It will be large so zip it up first before posting it as an attachment. If I can't see an obvious problem, I would be happy to examine the full dictionary and try to figure out how to decode it in kindleunpack if possible.

KevinH
KevinH is offline   Reply With Quote
Old 01-01-2015, 05:57 AM   #1112
elchamaco
Connoisseur
elchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enough
 
Posts: 62
Karma: 500
Join Date: Aug 2011
Device: kindle, boox
Here it's the compressed log:

http://www66.zippyshare.com/v/63012785/file.html

Thanks
elchamaco is offline   Reply With Quote
Old 01-01-2015, 07:41 PM   #1113
KevinH
Wizard
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 1,125
Karma: 444444
Join Date: Nov 2009
Device: many
Hi elchamaco,

Please try again but this time:

1. use very latest version KindleUnpack v0.77 which is available at this link:

http://www.mobileread.com/forums/sho...postcount=1074

2. Do NOT create a rar. Instead create a simple zip file as I asked

3. Remember to use the -d option in KindleUnpack (debug) to dump more information.

4. Do NOT upload it to a horrible/dodgy website like zippyshare!

Simply attach the zip file to your post here as an attachment.

The kindlegen build log seems to indicate that the dictionary is not correct when built. Error messages like that are not normal. If that is the case, I won't be able to help.

KevinH



Quote:
Originally Posted by elchamaco View Post
Here it's the compressed log:

http://www66.zippyshare.com/v/63012785/file.html

Thanks
KevinH is offline   Reply With Quote
Old 01-04-2015, 06:02 PM   #1114
elchamaco
Connoisseur
elchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enough
 
Posts: 62
Karma: 500
Join Date: Aug 2011
Device: kindle, boox
Hi,

I generated the log with 0.77. The unpack gives the same result as 0.75.

Regards.
Attached Files
File Type: zip lognew.zip (368.5 KB, 15 views)
elchamaco is offline   Reply With Quote
Old 01-04-2015, 07:30 PM   #1115
KevinH
Wizard
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 1,125
Karma: 444444
Join Date: Nov 2009
Device: many
Hi elchamaco,

Yes this is very different from all other dictionaries I have seen.

Examining the mobi7 header shows that there is no meta inflexion index
at all but a new index called index_names does seem to exist and be used.

Here is a snippet from the Mobi7 header. See the metainfindex field has been set to missing 0xFFFFFFFF but something called index_names does have a non 0xFFFFFFFF value and it points to a set of unknown indexes further along than the metaorthindex.


Code:
Dumping section 0, Mobipocket Header version: 7, total length 272
Mobipocket header from section 0
     Offset  Value Hex Dec        Description
0x000 (  0)     0x0002          2 compression_type
0x002 (  2)     0x0000          0 fill0
0x004 (  4) 0x054DEC84   88992900 text_length
0x008 (  8)     0x54DF      21727 text_records
0x00A ( 10)     0x1000       4096 max_section_size
0x00C ( 12)     0x0000          0 crypto_type
0x00E ( 14)     0x0000          0 fill1
0x010 ( 16)       MOBI            magic
0x014 ( 20) 0x00000100        256 header_length (from MOBI)
0x018 ( 24) 0x00000002          2 type
0x01C ( 28) 0x0000FDE9      65001 codepage
0x020 ( 32) 0x434E1E13 1129192979 unique_id
0x024 ( 36) 0x00000007          7 version
0x028 ( 40) 0x000054E1      21729 metaorthindex
0x02C ( 44) 0xFFFFFFFF 4294967295 metainflindex
0x030 ( 48) 0x000055E7      21991 index_names
0x034 ( 52) 0xFFFFFFFF 4294967295 index_keys

Here are snippets from the map of sections in the mobi ebook:

Code:
The metaorthindex points here:

21729 54E1  0x1C9B030 0x01F04    43458       0 Unknown INDX section, extracting as Unknown21729_INDX.dat
21730 54E2  0x1C9CF34 0x0FBF0    43460       0 Unknown INDX section, extracting as Unknown21730_INDX.dat
21731 54E3  0x1CACB24 0x0FBEC    43462       0 Unknown INDX section, extracting as Unknown21731_INDX.dat
...

--snip--

The index_names point here.

21991 55E7  0x2C72C3C 0x000E0    43982       0 Unknown INDX section, extracting as Unknown21991_INDX.dat
21992 55E8  0x2C72D1C 0x000CC    43984       0 Unknown INDX section,

So the current dictionary code will not deal with this at all as it never even looks at the "index_names" nor does it know how to interpret its data.

I think that field name was based on information in the Wiki here and there may be no one who remembers why it was named "index_names" as I did not name it.

So supporting these strange dictionaries which have errors in their own kindlegen logs and who use who use no inflections index may require a big reverse engineering effort.

This is not something I can take on soon. But if you post the dictionary someplace that is not so swamped by unknown attacks (say www.datafilehost.com where I at least know how to avoid the issues) and send a personal mail on this site to "KevinH" with the link. I will take a look at eventually supporting it, just not right now as I am tied up with Sigil projects.

Thanks,

KevinH

Quote:
Originally Posted by elchamaco View Post
Hi,

I generated the log with 0.77. The unpack gives the same result as 0.75.

Regards.
KevinH is offline   Reply With Quote
Old 01-06-2015, 03:17 PM   #1116
elchamaco
Connoisseur
elchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enough
 
Posts: 62
Karma: 500
Join Date: Aug 2011
Device: kindle, boox
Done.

I've also uploaded a dictionary i've created in the same way the did. The dictionary works, kindleupack fails to decompress de inflected forms. I included the source files and the test dictionary compiled with mobipocket and with kindlegen, in both cases the result is the same.

Regards.
elchamaco is offline   Reply With Quote
Old 01-10-2015, 12:36 PM   #1117
KevinH
Wizard
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 1,125
Karma: 444444
Join Date: Nov 2009
Device: many
Hi,
Here is a status update.

I looked at your dictionary and the index_names is meaningless. There is an index header but no real index data there. I am not sure why this empty index is even created.

So I went back and looked at the orthIndex data and *all* of the words are there but some are given a different tagMap values that point to the entry they are subelements of. The current code only unpacks main elements not subelements since they should have been inflections.

The problem is that both main elements and subelements are in sorted alphabetical order so some subelements actually come before the main element they are listed under. That means our current one pass algorithm to build up a dictionary will no longer work. So this is going to take some effort.

These types of dictionaries may "work" but they are incorrect and all of these subelements should simply be inflections which would make the dictionary much smaller in size when compiled into a mobi.

And, I really hate to slow down unpacking of all dictionaries just to accommodate these incorrectly amd inefficiently created dictionaries. So I am not sure I will add support for these.

If I do decide to modify KindleUnpack to support them, I will make the code convert all of these new subelements to inflections.

KevinH




Quote:
Originally Posted by elchamaco View Post
Done.

I've also uploaded a dictionary i've created in the same way the did. The dictionary works, kindleupack fails to decompress de inflected forms. I included the source files and the test dictionary compiled with mobipocket and with kindlegen, in both cases the result is the same.

Regards.
KevinH is offline   Reply With Quote
Old 01-11-2015, 05:28 PM   #1118
elchamaco
Connoisseur
elchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enoughelchamaco will become famous soon enough
 
Posts: 62
Karma: 500
Join Date: Aug 2011
Device: kindle, boox
Thanks for looking it. Yes for sure they use an incorrect way of doing it, i've not seeing any document explaining to code the entries in that way.

It's very strange, and it's a commercial dictionary.

To not slowdown, using an algorithm to standard dictionaries and other when a dictionary uses this kind of inflected forms?

Thanks anyway taking time to view it.
elchamaco is offline   Reply With Quote
Old 01-16-2015, 09:20 AM   #1119
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 9,893
Karma: 45801130
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
@tkeo (and/or KevinH)

I have a bug report in the KindleUnpack plugin thread that looks to be related to your ePub3/Furigana-related code. Any chance you could take a look?
DiapDealer is online now   Reply With Quote
Old 01-16-2015, 12:56 PM   #1120
KevinH
Wizard
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 1,125
Karma: 444444
Join Date: Nov 2009
Device: many
Hi DiapDealer,

We released v077 on Oct 17 or so and this bug was fixed on Oct. 30th or so. It is so rare we have not made a new release for just that fix.

So your best bet is to simply update to today's master or grab that Oct 30th commit (skip the part where an entire emacs backup file with ~ gets added to the tree!) ;-)

Take care,

KevinH

Quote:
Originally Posted by DiapDealer View Post
@tkeo (and/or KevinH)

I have a bug report in the KindleUnpack plugin thread that looks to be related to your ePub3/Furigana-related code. Any chance you could take a look?
KevinH is offline   Reply With Quote
Old 01-16-2015, 01:15 PM   #1121
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 9,893
Karma: 45801130
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
OK, cool. Thanks!
DiapDealer is online now   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Can i rotate text and insert images in Mobi and EPUB? JanGLi Kindle Formats 5 02-02-2013 05:16 PM
PDF to Mobi with text and images pocketsprocket Kindle Formats 7 05-21-2012 08:06 AM
Mobi files - images DWC Introduce Yourself 5 07-06-2011 02:43 AM
pdf to mobi... creating images rather than text Dumhed Calibre 5 11-06-2010 01:08 PM
Transfer of images on text files anirudh215 PDF 2 06-22-2009 10:28 AM


All times are GMT -4. The time now is 06:39 PM.


MobileRead.com is a privately owned, operated and funded community.