Thread: Dictionary woes
View Single Post
Old 11-04-2011, 09:48 AM   #12
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,746
Karma: 24032915
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by omk3 View Post
I had come across this link but I'm not sure I understand what it says. Should I add all possible inflections to all words?
Yes, that's what dictionary publishers do.

Quote:
Originally Posted by omk3 View Post
That is surely too much work.
Not if you use a script. I created a very simple VB script that will add inflections to an existing .html file from a tab delimited inflection list file.
I.e., it parses each definition between <idx:orth>...</idx:orth> and then tries to find a matching entry in the inflections list. The tough part is finding a free and comprehensive inflection list. (I found a nice Spanish one as part of an Open Source part-of-speech tagger.) If you find one and cannot figure out how to add inflections, I might be able to help you.
Ideally, both dictionary data and inflections should be Open Source/GPL2 so that the resulting dictionary can be posted on MR.

BTW, Dukoan Lite has built-in StarDict support.
Doitsu is offline   Reply With Quote