View Single Post
Old 07-20-2022, 11:23 PM   #242
bookman156
Addict
bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.bookman156 ought to be getting tired of karma fortunes by now.
 
Posts: 368
Karma: 1000000
Join Date: Mar 2016
Device: none
Quote:
and that would tag all Chinese words in a single shot.
Wow, I was wondering whether there was a good way to do that. But presumably it would take each character of a phrase rather than the whole phrase.

Quote:
Do you speak or read Chinese?
I read traditional Chinese, with some help from a dictionary or the Wenlin program, which I use for typesetting Chinese.

Quote:
Did I tag the Simplified/Traditional Chinese characters properly?
Here, I did notice this:

Code:
Liu E, also known as Liu Tieyun <span class="japanese" lang="ja" xml:lang="ja">劉鐵雲</span>, was born in 1857 at Liuhe <span class="japanese" lang="ja" xml:lang="ja">六合</span> county in what is today Nanjing <span class="japanese" lang="ja" xml:lang="ja">南京</span>.
you're tagging traditional Chinese as Japanese.

As a general rule, an English paper about ancient China or history with Chinese characters will be be traditional Chinese.

Quote:
Does my sample EPUB characters match the PDF?
I looked at the EPUB, you're tagging it as Hans when it should be Hant. Sometimes simple and traditional characters are the same, but I didn't check them. It all looks like traditional Chinese to me. For instance take li 禮, which you have in there, that's traditional, the simple is 礼. So you're getting traditional Chinese even though you've tagged it as simple. That's because you're using the right Unicode character. Certainly the Chinese that I looked at was correct. Oddly enough the paper was on a subject I have written on myself, so that was kinda interesting.
bookman156 is offline   Reply With Quote