View Single Post
Old 06-04-2009, 10:04 AM   #24
Peter Sorotokin
speaking for myself
Peter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it isPeter Sorotokin knows what time it is
 
Posts: 139
Karma: 2166
Join Date: Feb 2008
Location: San Francisco Bay Area
Device: PRS-505
Quote:
Originally Posted by Nate the great View Post
I disagree about the entry length. I looked at the WordNet Mobi dictionary. The average length was at least twice as long as the link.
Oh, but 2 is approximately 1 ;-). I bet a full index for 100M Russian-English dictionary is going to be at least 10M and my gut feeling tells me that's about 10 times more than practical.

Quote:
Also, while the entries of a dictionary are alphabetical, having a list of just headwords without the entries means you can look at and discard more entries at a time. This will make finding a word (with uncertain spelling) faster.
You can think of my proposal as search tree (althouh very shallow). I think it is better for searches than flat array in almost all cases.

Quote:
Question: would it be possible to build the headword index into the toc.ncx file? If so, could it behave like an index?
Per spec, I do not see how, but I'd rather someone else confirm it.
Peter Sorotokin is offline   Reply With Quote