View Single Post
Old 09-29-2017, 11:49 AM   #12
Namenlos
Enthusiast
Namenlos began at the beginning.
 
Posts: 37
Karma: 10
Join Date: Jul 2014
Device: Kobo Mini
Quote:
Originally Posted by Tex2002ans View Post
[…]Along with the "nonhyphenated" + "non-hyphenated" words potentially being a mistake, accented words should also be compared with their non-accented versions:

"résumé" + "resume"
"coöperation" + "cooperation"
"rôle" + "role"

Usually the book sticks with one style, and when these are mix and matched, the OCR has messed up (or even the original book mistakenly forgot to add accents in some places).
This leads to problems in German as hatten/hätten (möchte/mochte) are different very common words that are both in my dictionary.txt and they get marked as "Unnecessary diacritics".

Anyways I found the project on github: https://github.com/drake7707/epubspellchecker Thanks for releasing the source! Can you add license please, so that people could file pull requests etc?
Namenlos is offline   Reply With Quote