Quote:
Originally Posted by naki
I noticed that hyphenating an azw3 file removes ligatures
For example, the character fi (unicode U+FB01), which normally appears with the f connected to the dot of the i (with recent firmware and Bookerly font), display as separate two letters fi after hyphenating the azw3
Same for the ligatures ff, ffi, etc.. as well as with the font Caecilia Condensed (albeit less obvious). They are displayed as separate letters ff, ffi, etc... after hyphenating.
Is this an intended behaviour of the plugin, or something else?
|
Hi Naki, as I see it is the same problem about non-standard hyphenation that I outlined in my post
#211.
You will also find the suggested patch in that post.
My little modification adds an additional check that after removing the suggested soft-hyphen, the original word would be returned. All suggestions will be discarded where the result would be different, that is, at the end, only standard hyphenation points will be accepted.
Obviously the current behaviour cannot be intended. Depending on language, the results can be much worse than yours.
Fortunately enough for most folks, this problem does not persist for most languages (not even for German any more, with the new language rules).