Quote:
Looking a bit closer, it looks like there's been ~10,000 new CJK characters added to Unicode since then.
(~5,700 in Unicode 8.0 + ~5,000 in Unicode 13.0.)
(And ~4,200 more CJK characters are going to be added in Unicode 15.0, which will be coming out later this year.)
That \x{} numbers method would fail, if it doesn't cover all those new cases.
|
All the characters I am so far using were already in Unicode ages ago, so effectively I'm still a decade and a half ago. Wenlin gives the Unicode for each character I use and when I take an interest I see they're in the range already specified. But certainly it would be useful on a major new book project to look at what the current ranges are (but only for traditional, I don't need simple and I guess much of the new ones are simple). The fact that my copy of Wenlin has the character means it's in the old range. I'm not sure what these new characters are. In Chinese, believe it or not, you can actually get a dictionary several inches thick of ancient characters no-one knows the meaning of! What a great idea for a dictionary.
Yes, certainly I make a character style for Chinese. Then that will become the style name of the exported span class.