View Single Post
Old 11-08-2024, 02:27 PM   #57
jackie_w
Grand Sorcerer
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 6,255
Karma: 16544692
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
Re: Kobo English dictionary changes

For anyone interested in looking at content details to compare the 2019 vs 2024 versions of the Kobo English dictionary, attached is a zip containing 4 TXT files where each file is a list of words contained in the dictionary.
Code:
dicthtml_2019_allwords.txt
dicthtml_2019_headwords.txt
dicthtml_2024_allwords.txt
dicthtml_2024_headwords.txt
I hope a simple list of words doesn't breach copyright ... but you never know.

Notes:
  • headwords: the main lookup words in the dictionary
  • allwords: all headwords plus any variant words included in a headword entry.

A couple of examples of headword/variants as found in the 2024 dicthtml.zip dictionary:
  • headword: "beach"
    variant words: "beaches", "beached", "beaching"
  • headword: "be"
    variant words: "are", "was", "am", "been", "were", "is", "being"
Attached Files
File Type: zip dicthtml_word_lists.zip (2.18 MB, 270 views)
jackie_w is offline   Reply With Quote