Quote:
Originally Posted by capidamonte
Regex is no more difficult for these than for characters; don't have to open Character Map or type ALT-NUMPAD codes, so it might be easier.
|
It's possible you misunderstand me when it comes to "making my regex more difficult." Mainly, I mean that I already have quite an extensive personal collection of specific regexps that I don't want to have to overhaul. And besides my fingers have the ALT-NUMPAD codes down cold. Second nature. I
want to use them.
I also use a lot of the unicode regex classes: \p{P} doesn't know what html entities are and won't match them. Neither will \p{Pd} or my favorite... \p{Po}. My custom tailored regexps are
polluted with unicode classes like that.
I guess I don't understand why this even has to be an issue. People should be able to make their own decision with regard to entity vs character. That's the way 0.5.3 works for me: if I enter the mdash entity it
stays an entity... if I enter the mdash character it
stays a character. Beautiful.