MobileRead Forums - View Single Post

Peter Sorotokin · 06-04-2009, 11:04 AM

Quote:

Originally Posted by Nate the great

I disagree about the entry length. I looked at the WordNet Mobi dictionary. The average length was at least twice as long as the link.

Oh, but 2 is approximately 1 ;-). I bet a full index for 100M Russian-English dictionary is going to be at least 10M and my gut feeling tells me that's about 10 times more than practical.

Quote:

Also, while the entries of a dictionary are alphabetical, having a list of just headwords without the entries means you can look at and discard more entries at a time. This will make finding a word (with uncertain spelling) faster.

You can think of my proposal as search tree (althouh very shallow). I think it is better for searches than flat array in almost all cases.

Quote:

Question: would it be possible to build the headword index into the toc.ncx file? If so, could it behave like an index?

Per spec, I do not see how, but I'd rather someone else confirm it.