Quote:
Originally Posted by kovidgoyal
Replace word with "token"  Where token could even be "non space character"
|
For Japanese that might not be enough. The same set of say, 3 kanji characters might, depending on the rest of a sentence, be one word or two. To count the words, the program would have to be able to understand Japanese. Else it could only count single kanji, so more like letters.