View Single Post
Old 08-19-2013, 10:26 AM   #6
tuxor
Addict
tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!tuxor has a thesaurus and is not afraid to use it!
 
Posts: 320
Karma: 99999
Join Date: Oct 2011
Location: Germany
Device: Onyx Boox M92, Icarus Illumina E653
I think you are mixing up dictionary formats somehow. Here is the StarDict format specification: http://code.google.com/p/babiloo/wiki/StarDict_format - it doesn't include anything like morphology or inflections. And of course there is no "linguistic compression technique" as part of the StarDict format - the contents are stored as simple gzip data or even completely uncompressed.

Using the StarDict format, it is indeed possible storing the dictionary entries in an arbitrary markup language - but officially supported are only HTML, MediaWiki, Pango and XDXF. Only the latter can handle morphology and inflections. Looking up inflected word forms is definitely not part of the StarDict format specification and there is no canonical implementation of such a feature based on the StarDict format.

You definitely mix up StarDict with some competing dictionary format like Abbyy Lingvo, MobiPocket or Babylon BGL, all of which have certain morphology features and might include "linguistic compression techniques" - since those formats are not officially documented, I can't tell.


// EDIT

By the way: Maybe it's not even desirable having the morphology information as part of the dictionary. It seems slightly more feasible including the morphology support to the dictionary software - the software would then use a tool like hunspell and existing hunspell dictionaries to recognize inflected forms.

A real world example of "proper" morphology support: http://www.perseus.tufts.edu/hopper/...eek&prior=a%29

Last edited by tuxor; 08-19-2013 at 11:20 AM.
tuxor is offline   Reply With Quote