Good to know. I will leave it in place and move on to other things.
That said, I do not have a testcase that has lots of emojis in it - emojis are typically in the range which require two utf-16 chars paired. ie. a high surrogate and a low surrogate pair
This impacts searches done when parts are done inside Qt (utf-16) and parts are done in python (utf-32).
So if anyone has a testcase epub which has a few emojis in it, or makes use of characters made from surrogate pairs, I would love to have a copy for testing and debugging the code that maps positions from inside Qt to inside python.
None of my current testcases exercise any of that code.
|