![]() |
#1 |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 48
Karma: 41612
Join Date: Jan 2015
Location: Poland, Warsaw
Device: Kindle 3G
|
Polish dictionary (problem with polish signs)
Hi!
I'm building polish dictionary for my kindle 3 (and will share it somewhere...). But i've found one quite annoying problem: there are for example two words: laskę and łaskę. first one is recognised well, but second one shows definition for laskę, but the meaning should be different. can i make this dictionary work with this kind of words? regards! my opf file: Code:
<?xml version="1.0"?><!DOCTYPE package SYSTEM "oeb1.ent"> <!-- the command line instruction 'prcgen dictionary.opf' will produce the dictionary.prc file in the same folder--> <!-- the command line instruction 'mobigen dictionary.opf' will produce the dictionary.mobi file in the same folder--> <package unique-identifier="uid" xmlns:dc="Dublin Core"> <metadata> <dc-metadata> <dc:Identifier id="uid">Słownik Języka Polskiego, sjp.pl</dc:Identifier> <!-- Title of the document --> <dc:Title><h2>[ZM] sjp.pl (2016-05-14b)</h2></dc:Title> <dc:Language>pl-PL</dc:Language> </dc-metadata> <x-metadata> <DictionaryInLanguage>pl-PL</DictionaryInLanguage> <DictionaryOutLanguage>pl-PL</DictionaryOutLanguage> </x-metadata> </metadata> <!-- list of all the files needed to produce the .prc file --> <manifest> <item id="dictionary0" href="slo0.html" media-type="text/x-oeb1-document"/> (...) <item id="dictionary25" href="slo25.html" media-type="text/x-oeb1-document"/> </manifest> <!-- list of the html files in the correct order --> <spine> <itemref idref="dictionary0"/> <itemref idref="dictionary1"/> (...) <itemref idref="dictionary25"/> </spine> <guide> <reference type="search" title="Dictionary Search" onclick= "index_search()"/> </guide> </package> Code:
<?xml version="1.0" encoding="utf-8"?> <html xmlns:idx="www.mobipocket.com" xmlns:mbp="www.mobipocket.com" xmlns:xlink="http://www.w3.org/1999/xlink"> <body> <mbp:pagebreak/> <mbp:frameset> <mbp:slave-frame display="bottom" device="all" breadth="auto" leftmargin="0" rightmargin="0" bottommargin="0" topmargin="0"> <div align="center" bgcolor="yellow"/> <a onclick="index_search()">Dictionary Search</a> </div> </mbp:slave-frame> <mbp:pagebreak/> (...) <idx:entry name="word" scriptable="yes"> <h2> <idx:orth> łaska <idx:infl inflgrp="odmiany"> <idx:iform name="odmiana" value="łasce" /> <idx:iform name="odmiana" value="łask" /> <idx:iform name="odmiana" value="łaskach" /> <idx:iform name="odmiana" value="łaskami" /> <idx:iform name="odmiana" value="łaską" /> <idx:iform name="odmiana" value="łaskę" /> <idx:iform name="odmiana" value="łaski" /> <idx:iform name="odmiana" value="łasko" /> <idx:iform name="odmiana" value="łaskom" /> </idx:infl> </idx:orth> <idx:key key="łaska"> </h2> <br/> łaskawość, miłość, przychylność<br/> ssak, mały drapieżnik z rodziny łasicowatych; łasica<br/> przymiotnik od: Łask, rzadziej od: Łazy<br/> miasto w Polsce<br/> nazwisko </idx:entry> <hr /> (...) <idx:entry name="word" scriptable="yes"> <h2> <idx:orth> laska <idx:infl inflgrp="odmiany"> <idx:iform name="odmiana" value="lasce" /> <idx:iform name="odmiana" value="lasek" /> <idx:iform name="odmiana" value="laskach" /> <idx:iform name="odmiana" value="laskami" /> <idx:iform name="odmiana" value="laską" /> <idx:iform name="odmiana" value="laskę" /> <idx:iform name="odmiana" value="laski" /> <idx:iform name="odmiana" value="lasko" /> <idx:iform name="odmiana" value="laskom" /> </idx:infl> </idx:orth> <idx:key key="laska"> </h2> <br/> rodzaj kijka używanego przez osoby starsze oraz niepełnosprawne do podpierania się w czasie chodzenia$ książkowo: symbol niektórych godności i dostojeństw;<br/> długi i cienki pręt czego;<br/> potocznie: członek męski;<br/> środowiskowo: ocena niedostateczna;<br/> młodzieżowo o bardzo zgrabnej, atrakcyjnej dziewczynie; dziunia, lacha, lachon<br/> gwary laskie - mieszane gwary polsko-czeskie (w rejonie Raciborza)<br/> nazwisko<br/> nazwisko </idx:entry> <hr /> </mbp:frameset> </body> </html> adding <output encoding="utf-8" flatten-dynamic-dir="yes"/> won't help. Last edited by San Zamoyski; 05-16-2016 at 06:17 AM. |
![]() |
![]() |
![]() |
#2 |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 48
Karma: 41612
Join Date: Jan 2015
Location: Poland, Warsaw
Device: Kindle 3G
|
bump?
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Going Viral
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 17,212
Karma: 18210809
Join Date: Feb 2012
Location: Central Texas
Device: No K1, PW2, KV, KOA
|
Maybe nobody knows the answer.
|
![]() |
![]() |
![]() |
#4 |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 48
Karma: 41612
Join Date: Jan 2015
Location: Poland, Warsaw
Device: Kindle 3G
|
but it might be that someone knows that it is not possible at all...
![]() |
![]() |
![]() |
![]() |
#5 |
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 122
Karma: 43580
Join Date: Apr 2016
Device: KPW3, Kobo Clara HD, Onyx Boox Nova 2
|
I'd suggest checking this: http://1manfactory.com/create-your-o...uage-for-free/
Although I'm not sure, whether it'll help you. Also check the file encoding. <output encoding="utf-8" flatten-dynamic-dir="yes"/> Should stay. |
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 405
Karma: 2330752
Join Date: May 2012
Device: kt
|
It will not work in search box, as Kindle uses fuzzy search. However it should work in a pop-up box, looking up highlighted word. At least it works on my Kindle Touch.
You could also experiment with "exact" keyword, but I don't see any difference when using it. See Kindle Publishing Guidelance: Quote:
Code:
<idx:key>..</idx:key> Last edited by baf; 05-19-2016 at 11:29 AM. |
|
![]() |
![]() |
![]() |
#7 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,723
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
|
You'll need to add the exact="yes" attribute:
For example: Code:
<idx:iform value="łasce" exact="yes" />
Spoiler:
|
![]() |
![]() |
![]() |
#8 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 405
Karma: 2330752
Join Date: May 2012
Device: kt
|
|
![]() |
![]() |
![]() |
#9 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,723
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
|
This information comes directly from the Kindle Publishing Guidelines.
Quote:
Did you also test the two variations on an actual Kindle and did you get the same results for both versions? (I only tested my version and it worked.) |
|
![]() |
![]() |
![]() |
#10 | ||
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 405
Karma: 2330752
Join Date: May 2012
Device: kt
|
Quote:
Quote:
Both dictionaries work the same on my KT. When searching from the search box (index items) it doesn't recognise different forms of the words. Whatever I type in the search box it brings first entry: "łaska". It is probably limitation of the reader's fuzzy search. At the same time both dictionaries work correctly when highlighting word in a document. Highlighting "łaska" brings pop-up definition for "łaska", highlighting "laska" shows "laska". Maybe some other conditions must be fulfilled in order to make the keyword be used. |
||
![]() |
![]() |
![]() |
#11 |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 48
Karma: 41612
Join Date: Jan 2015
Location: Poland, Warsaw
Device: Kindle 3G
|
Hi!
Sorry for delay with answer. I've generated new version of my dictionary, but still, no luck (both definitions from laska). exact="yes" (to evary one) seems to make no difference. I've tested Doitsu dictionary - still no luck. Both words Laskę and Łaskę recognised as łaska. It seems that in-file order makes difference. Thanks, all of You, for helping solving this problem! Best regards! |
![]() |
![]() |
![]() |
#12 | |
Going Viral
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 17,212
Karma: 18210809
Join Date: Feb 2012
Location: Central Texas
Device: No K1, PW2, KV, KOA
|
Quote:
|
|
![]() |
![]() |
![]() |
#13 | |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 48
Karma: 41612
Join Date: Jan 2015
Location: Poland, Warsaw
Device: Kindle 3G
|
Quote:
|
|
![]() |
![]() |
![]() |
#14 |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 195
Karma: 42216
Join Date: Oct 2013
Location: Poland
Device: Kindles: KOA1, KV
|
I also can confirm that exact="yes" does not work as described in Kindle Publishing Guidelines.
|
![]() |
![]() |
![]() |
#15 | ||
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,723
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
Since the Kindle Publishing Guidelines specifically mention the Spanish words una and uña, can you create a short Spanish book with both words, download one of the free Spanish dictionaries from Amazon, e.g. the DRAE, and look up both words? If you get the definition of the unaccented word (una) both times it's definitely a firmware issue. Quote:
Since you own both a Kindle 4 and a PW1, can you test my dictionary on both machines? |
||
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
GUI [Polish] vs. CLI ebook-polish | phossler | Calibre | 6 | 10-01-2015 08:27 AM |
E-reader with Dutch/English or Dutch/Polish dictionary | tttx | Which one should I buy? | 17 | 08-20-2015 05:42 AM |
Dictionary question: changing word delimiters for french dictionary | oecherprinte | Amazon Kindle | 1 | 05-09-2011 04:45 AM |
Converting PWN Oxford Polish-English dictionary to Mobipocket | owl123 | Workshop | 1 | 05-24-2010 07:07 AM |
Oxford built-in dictionary disappears after changing default dictionary | YYZscientist | Amazon Kindle | 4 | 01-24-2010 08:42 PM |