Originally Posted by meme
I'm not sure. Do you have an example epub (a link or just a small file is fine) that contains ligatures? The code literally just reports each unicode character that appears in the text (and if it has an entity name).
The ePub does not contain the ligatures. ADE 2.0 and Calibre (and maybe other reading software) converts to using ligatures. So for example, if your text have a word such as flight, the fl will be converted to the ligature and displayed that way. Your code would have to handle fl as separate fl and as the ligature for reading software that does and does not convert to ligatures.
Oh and would it be possible to display each character for a given font for embedded fonts?