View Single Post
Old 05-03-2009, 10:53 PM   #2
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by JicMic View Post
Does anyone have any .txt word lists. Without the definitions.

Thanks.
Here's a wordlist from the Webster's Dictionary 1913 I converted a while ago. It contains approx 110,000 words; with a few duplicates!

It's a txt version of the html wordlist posted here.

EDIT: This seems to be a very popular attachment, so I think it would be beneficial to finally remove any duplicated words ("dups") therein. In fact, I've now trimmed it down to only about 95,000 unique words/phrases (15694 words were deleted) in the newer attachment i.e. -no-dups.zip

EDIT: Another sorting of the same word list showing, in increasing order, "words" with 'n' character(s) and their frequency count i.e. -increasing order of characters.zip. Below is a summary count of this ordering:
Code:
Wordlist for Webster's Dictionary 1913  ver. 2.1 
With Summary count of "words" with 'n' character(S) 
by nrapallo (Nick Rapallo) - November 2009

'n' Characters  Count       %
-----------------------------
_ 1 character   26       0.0%
_ 2 characters  96       0.1%
_ 3 characters  924      1.0%
_ 4 characters  3413     3.6%
_ 5 characters  6066     6.4%
_ 6 characters  9684    10.2%
_ 7 characters  11986   12.6%
_ 8 characters  13870   14.6%
_ 9 characters  13689   14.4%
_10 characters  11788   12.4%
_11 characters  8892     9.3%
_12 characters  6283     6.6%
_13 characters  3968     4.2%
_14 characters  2240     2.4%
_15 characters  1187     1.2%
_16 characters  568      0.6%
_17 characters  280      0.3%
_18 characters  117      0.1%
_19 characters  57       0.1%
_20 characters  23       0.0%
_21 characters  18       0.0%
_22 characters  10       0.0%
_23 characters  5        0.0%
_24 characters  5        0.0%
_25 characters  6        0.0%
_26 characters  2        0.0%
_27 characters  1        0.0%
_30 characters  1        0.0%
_31 characters  1        0.0%
_32 characters  1        0.0%
_33 characters  1        0.0%
_34 characters  1        0.0%
_35 characters  2        0.0%
-----------------------------
"Word" count    95211  100.0%
=============================
Enjoy!
-NR

Last edited by nrapallo; 11-24-2009 at 05:23 PM. Reason: added newer version without any duplicate words ("dups")
nrapallo is offline   Reply With Quote