Quote:
Originally Posted by JicMic
Does anyone have any .txt word lists. Without the definitions.
Thanks.
|
Here's a wordlist from the Webster's Dictionary 1913 I converted a while ago. It contains approx 110,000 words; with a few duplicates!
It's a txt version of the html wordlist posted
here.
EDIT: This seems to be a very popular attachment, so I think it would be beneficial to finally remove any duplicated words ("dups") therein. In fact, I've now trimmed it down to only about 95,000 unique words/phrases (15694 words were deleted) in the newer attachment i.e.
-no-dups.zip
EDIT: Another sorting of the same word list showing, in increasing order, "words" with 'n' character(s) and their frequency count i.e.
-increasing order of characters.zip. Below is a summary count of this ordering:
Code:
Wordlist for Webster's Dictionary 1913 ver. 2.1
With Summary count of "words" with 'n' character(S)
by nrapallo (Nick Rapallo) - November 2009
'n' Characters Count %
-----------------------------
_ 1 character 26 0.0%
_ 2 characters 96 0.1%
_ 3 characters 924 1.0%
_ 4 characters 3413 3.6%
_ 5 characters 6066 6.4%
_ 6 characters 9684 10.2%
_ 7 characters 11986 12.6%
_ 8 characters 13870 14.6%
_ 9 characters 13689 14.4%
_10 characters 11788 12.4%
_11 characters 8892 9.3%
_12 characters 6283 6.6%
_13 characters 3968 4.2%
_14 characters 2240 2.4%
_15 characters 1187 1.2%
_16 characters 568 0.6%
_17 characters 280 0.3%
_18 characters 117 0.1%
_19 characters 57 0.1%
_20 characters 23 0.0%
_21 characters 18 0.0%
_22 characters 10 0.0%
_23 characters 5 0.0%
_24 characters 5 0.0%
_25 characters 6 0.0%
_26 characters 2 0.0%
_27 characters 1 0.0%
_30 characters 1 0.0%
_31 characters 1 0.0%
_32 characters 1 0.0%
_33 characters 1 0.0%
_34 characters 1 0.0%
_35 characters 2 0.0%
-----------------------------
"Word" count 95211 100.0%
=============================
Enjoy!
-NR