Quote:
Originally Posted by forceps
It works with Plucker Desktop 1.6.0.0 without replace CharsetMapping.py, Chinese character is correctly decoded.
|
There's a whole bunch of Chinese charsets and their names that are used in HTML documents. I swear that my PyPlucker didn't know about GBK. Probably, all pages you've tried are tagged as GB2312 or so.
The current (bad) behaviour of PyPlucker is to assume the record has default charset (i.e. "default for channel", if it was specified, or Latin-1, if it wasn't) if the charset of such record is not known (not in the PyPlucker's charset mapping table), so unpatched PyPlucker can produce an unreadable document for you when it faces a charset it doesn't know about--GBK, for example.
So you have been warned. ;-)
Quote:
Still have question -- Is it possible make it display chinese with plucker document coverted by Sunrise? I have tried but unfortunatly failed.
|
Read
this.