Quote:
Originally Posted by tshering
I do not know what kind of encoding ÒüôÒüå is. Inside the html.gz-file the filename is again encoded in utf-8.
|
I have finally resolved this by using MAC OS X
It compreses (zip) the files to format which can be read by Kobo. I have created a test jap-eng dictionary and it works fine as Japanese dictionary
Quote:
Originally Posted by tshering
It is nice that the dictionary zip file contains now the source file for "words" too!
|
Yes, it is nice but I cannot create a proper
words file for my dictionary
Could you help me creating a good one?
My source list looks like the
words.original file. I am using Marisa to create
words file and it works ok using marisa-lookup or marisa-predictive-search tool. But when the file is added to my test dictionary (to zip file) it does not work in Kobo
Using original
words file with my dictionary works ok, however. It means my
words file is wrong.
What I have observed using Marisa tools is a different output using original Kobo
words file comparing to mine. Below is the output from marisa-predictive-search.
Original file:
Code:
いえ
18 found
71 いえ いえ
3502 いえ【家】 いえ
10968 いえい【遺影】 いえ
24088 いえがまえ【家構え】 いえ
24089 いえがら【家柄】 いえ
10969 いえき【胃液】 いえ
10970 いえじ【家路】 いえ
10971 いえで【家出】 いえ
10972 いえども【雖も】 いえ
10973 いえなみ【家並み】 いえ
My file:
Code:
いえ
43 found
58634 いえい【遺影】
58635 いえい【遺詠】
37152 いえい【家居】
37153 いえいえ【否否・否々】
37154 いえいえ【家々・家家】
37155 いえつき【家付き】
37156 いえつきのむすめ【家付きの娘】
37157 いえつきむすめ【家付き娘・家付娘】
18160 いえつづき【家続き】
58636 いえにかえる【家に帰る】