But isn't the definition of \w an alphabetic character? Or is it ASCII alphabetic character?
Depending on the sorting order, ö is within the set [a-z] (a..oö..z) or or outside (a..o..z..ö). But I thought that is set by the LOCALE and I was delighted to be able to set Calibre (via a tweak) to other sort order than just ASCII. This was a major improvement for me.
I'd expect that Calibre/Python all read the LOCALE and interpret \w accordingly. Unless this is a Python issue, not a Calibre issue.
I understand Kovid's response, but strictly speaking
non-whitespace (\S) is not equal to alphabetic (\w).
Anyway, did not want to start a religious discussion, just wanted to point out that this is not the expected behavior outside of English and therefore an opportunity to improve Calibre even further.
Thanxx, Mixx
|