MobileRead Forums - View Single Post

tshering · 11-17-2012, 05:59 PM

Quote:

Originally Posted by shutramp

Would be possible to make one for chinese as well?

Generally speaking, it is possible. But I am not sure whether it is practicable. The dictionary entries are distributed among several html files according to the following rules:
1) If a word/expression consists of one letter "X" it goes into the html file with the name "Xa.html".
2) If a word/expression consist of more than one letter "XYZ" the name of the html file is made from the first 2 letters "XY.html".
3) If the first or second char is not a "letter", but a hyphen, a number and so on it goes to "11.html".

Example for case 1: "漢" goes into "漢a.html"
Example for case 2: "漢字" goes into "漢字.html"
I tried this with a very small sample of a Japanese dictionary (disguised as German dictionary), and it worked.

The problem that I see is the huge number of html files that are needed. I calculated it for a Japanese dictionary with 200000 entries. They would need approximately 60000 html files. This is already close to the maximum number of files in the 32-bit zip format. I do not know whether the Kobo can handle 64-bit zip format. I also do not know how this huge number of files would influence the performance.

As for the Japanese dictionary, the Kobo takes a different approach, which however does no work very well (cf. this post).