Quote:
Originally Posted by jackie_w
You mentioned 'unicode properties'. I'm open to suggestions for a better simple algorithm to include a wider variety of languages.
|
if you're using unicode characters and strings (as you should) in python, take a look at unicodedata.category(), which returns the abbreviated general category values for Unicode characters enumerated
here.