|
|
#1 |
|
Zealot
![]() Posts: 108
Karma: 10
Join Date: Sep 2020
Device: kindle paperwhite3/Oasis2
|
How kindle matches a word is looked up in a dictionlary
Hello,
Today, I unpacked an embedded kindle dictionary via KindleUnpack, found there was no idx:entry for the word "carefully", and there was also no idx:iform in the idx:entry of "careful". I'm curious how the word “carefully" can be matched by the embedded kindle dictionary? Here is the idx:entry for the word "careful" in the embedded kindle dictionary. Code:
<idx:entry scriptable="yes">
<idx:orth value="careful">
<idx:infl>
<idx:iform name="" value="carefuller" />
<idx:iform name="" value="carefullest" />
</idx:infl>
</idx:orth>
<i>adjective</i>
<b>carefuler</b>; <b>carefullest</b>
<div align="left">
<b>1</b>
: using or taking care <br />
<b>2</b>
: marked by solicitude, caution, or prudence
</div>
<div align="left"> <b>carefully</b>
<i>adverb</i>
</div>
</idx:entry>
Could anyone share any experiences on this? |
|
|
|
|
|
#2 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,160
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
When a dictionary is created much of the HTML markup is removed and converted into various indexes. Kindlegen attempts to undo that conversion but it does not handle every possible case.
I might be able to look into this in more detail if I can locate the specific dictionary that you are using. I took at look at the English language dictionaries on my Oasis 2. They are "Oxford Dictionary of English_B0053VMNYW" and "The New Oxford American Dictionary_B0053VMNY2". Neither has the same definition for "careful" as your dictionary. |
|
|
|
|
|
#3 | |
|
Zealot
![]() Posts: 108
Karma: 10
Join Date: Sep 2020
Device: kindle paperwhite3/Oasis2
|
Quote:
|
|
|
|
|
|
|
#4 | |
|
Zealot
![]() Posts: 108
Karma: 10
Join Date: Sep 2020
Device: kindle paperwhite3/Oasis2
|
Quote:
If I created my own dictionary with "register" defined but without "registered" defined" in idx:entry or idx:inform, "registered" could not be matched as "register". |
|
|
|
|
|
|
#5 | |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,763
Karma: 24088559
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
Code:
C:\Users\<user>\AppData\Local\Amazon\Kindle Previewer 3\lib\fc\bin\kindlegen.exe
|
|
|
|
|
|
|
#6 | |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,160
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Quote:
I used KindleUnpack to convert the dictionary to HTML and found the expected idx:orth element for "register" there along with the definition. There was no idx:iform for "registered" in that entry. In fact the HTML produced using KindleUnpack for that dictionary shows no idx:iform elements at all in the entire file, which is obviously incorrect. Looking at the original MOBI file I see that it does have an Inflection Index, but it has entries in a format that is not supported by KindleUnpack. From that index I determined that the nonsense word "registerthest" is another iform for "register" in that dictionary. I was able to confirm it by doing a dictionary lookup for "registerthest" on my Kindle which showed the definition for "register". |
|
|
|
|
|
|
#7 |
|
Zealot
![]() Posts: 108
Karma: 10
Join Date: Sep 2020
Device: kindle paperwhite3/Oasis2
|
|
|
|
|
|
|
#8 | |
|
Zealot
![]() Posts: 108
Karma: 10
Join Date: Sep 2020
Device: kindle paperwhite3/Oasis2
|
Quote:
So I should report a bug for KindleUnpack(https://github.com/kevinhendricks/KindleUnpack), right? BTW, how do you analyze MOBI file? Just open it with a editor in binary mode, then check the data according to the SPEC of MOBI format(https://wiki.mobileread.com/wiki/MOBI)? |
|
|
|
|
|
|
#9 | ||
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,160
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Quote:
Parsing metaInflIndexData Error: Dictionary uses obsolete inflection rule scheme which is not yet supported Quote:
|
||
|
|
|
|
|
#10 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,160
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
I was curious about the Merriam-Webster's Dictionary & Thesaurus so I went ahead an purchased it. As you found KindleUnpack does not produce anything for "carefully".
Looking at the MOBI file I see that the Orthographic Index does actually contain an entry for "carefully" that points to the entry for "careful". Apparently that is another situation that KindleUnpack does not handle. |
|
|
|
|
|
#11 | |
|
Zealot
![]() Posts: 108
Karma: 10
Join Date: Sep 2020
Device: kindle paperwhite3/Oasis2
|
Quote:
BTW, do you know what kind of inflection rules a kindle dictionary has? I just know "idx:iform" so far. |
|
|
|
|
|
|
#12 | |
|
Zealot
![]() Posts: 108
Karma: 10
Join Date: Sep 2020
Device: kindle paperwhite3/Oasis2
|
Quote:
|
|
|
|
|
|
|
#13 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,160
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
That is the only form that I have seen documented. There is at least one other undocumented method that applies simple rules like add “ed” and “ing” to every entry. I don’t know how that is activated.
|
|
|
|
|
|
#14 |
|
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 448
Karma: 3000000
Join Date: Nov 2020
Device: none
|
Libmobi(https://github.com/bfabiszewski/libmobi) has better support of dictionaries.
|
|
|
|
|
|
#15 | |
|
Zealot
![]() Posts: 108
Karma: 10
Join Date: Sep 2020
Device: kindle paperwhite3/Oasis2
|
Quote:
|
|
|
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Troubleshooting How kindle matches when a word is looked up in a dictionlary | njpig | Amazon Kindle | 1 | 04-14-2022 09:24 AM |
| Though this looked familiar | Kboland | Kobo Tablets | 0 | 01-07-2012 10:38 AM |
| Kitchen Confidential - BN matches Kindle Daily Deal 10/12/11 | wandalynn | Deals and Resources (No Self-Promotion or Affiliate Links) | 0 | 10-12-2011 12:28 PM |