I am making it easier to determine which letters to retain. It is soon possible to drag-n-drop text (currently txt, log, (x)htm(l), xml and docx) files to the character box to have the contents in there. After that the unique characters can be found with a press of the button.
I am currently looking into importing ePUB and the possibility to use the command line.
|