View Single Post
Old 11-17-2012, 05:59 PM   #70
tshering
Wizard
tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.tshering ought to be getting tired of karma fortunes by now.
 
Posts: 3,489
Karma: 2914715
Join Date: Jun 2012
Device: kobo touch
Quote:
Originally Posted by shutramp View Post
Would be possible to make one for chinese as well?
Generally speaking, it is possible. But I am not sure whether it is practicable. The dictionary entries are distributed among several html files according to the following rules:
1) If a word/expression consists of one letter "X" it goes into the html file with the name "Xa.html".
2) If a word/expression consist of more than one letter "XYZ" the name of the html file is made from the first 2 letters "XY.html".
3) If the first or second char is not a "letter", but a hyphen, a number and so on it goes to "11.html".

Example for case 1: "漢" goes into "漢a.html"
Example for case 2: "漢字" goes into "漢字.html"
I tried this with a very small sample of a Japanese dictionary (disguised as German dictionary), and it worked.

The problem that I see is the huge number of html files that are needed. I calculated it for a Japanese dictionary with 200000 entries. They would need approximately 60000 html files. This is already close to the maximum number of files in the 32-bit zip format. I do not know whether the Kobo can handle 64-bit zip format. I also do not know how this huge number of files would influence the performance.

As for the Japanese dictionary, the Kobo takes a different approach, which however does no work very well (cf. this post).
tshering is offline   Reply With Quote