View Single Post
Old 04-15-2006, 09:26 AM   #3
kostix
Vade-Mecum developer
kostix began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Apr 2006
Location: Saint-Petersburg, Russia
Device: LOOX 420, Palm m500
Quote:
Originally Posted by forceps
It works with Plucker Desktop 1.6.0.0 without replace CharsetMapping.py, Chinese character is correctly decoded.
There's a whole bunch of Chinese charsets and their names that are used in HTML documents. I swear that my PyPlucker didn't know about GBK. Probably, all pages you've tried are tagged as GB2312 or so.

The current (bad) behaviour of PyPlucker is to assume the record has default charset (i.e. "default for channel", if it was specified, or Latin-1, if it wasn't) if the charset of such record is not known (not in the PyPlucker's charset mapping table), so unpatched PyPlucker can produce an unreadable document for you when it faces a charset it doesn't know about--GBK, for example.
So you have been warned. ;-)

Quote:
Still have question -- Is it possible make it display chinese with plucker document coverted by Sunrise? I have tried but unfortunatly failed.
Read this.

Last edited by kostix; 04-16-2006 at 06:25 AM.
kostix is offline   Reply With Quote