You don't really need a modification to the EPUb standard, the following should do the trick:
Split up the html containing the definitions into sub files each sub file containing only definitions for words starting with a specific set of two letters. There will be 26*26 = 676 such files. In the ncx just add navpoints for each file with a text being the two letters that the file has the words for. Then in the OPF file just add an entry indicating the EPUB is a dictionary. Now the reader software when asked for the definition of a word has to do the following:
parse 576 entries in the NCX file to find the correct html file. Parse the HTML file to find the word.
If two letters results in too large HTML files, use three letters instead.
The HTML files should be designed with minimal in file markup to speed up processing.
|