Quote:
Originally Posted by ahi
How would you go about building such a database, Ankh?
Just processing oodles and oodles of PG eTexts, and manually hyphenate the words therefrom?
|
Start with the source of nrapallo Webster 1913 dictionary.
Then yes, expect users to help with the growth of the database. The database-assisted hyphenation engine can ask for intervention whenever a word is not in the database. When job is done, process the database, extract the words that were added to basic text file, one line per hyphenated word, submit such file back to the maintainer. Review (use dictionaries and any other tools available), merge changes, new version of the database.
Open source.