I've created a new release that addresses some of those issues described above. I'm not entirely sure how to tackle the hyphenation issues with all the prefixes yet, so that's not included.
Quote:
- Added unnecssary diacritics test (if OCR introduced accents and umlauts etc)
- Handled numbers in hyphenation (like 98-99, 5-6 etc), those are almost always page numbers and never an error
- Handled subscript and superscript so it doesn't see it as 1 word
- Open epub files from command line (so you can do open with ...)
|