Quote:
Originally Posted by kovidgoyal
The problem with wired is that the files are encoded in UTF8 but they specify the encoding as iso8859-1. You can try either
1) Contact wired
2) write a preprocess regexp that changes the specified encoding
Code:
(r'<meta http-equiv="Content-Type" content="text/html; charset=(\S+)"',
lambda match : match.group().replace(match.group(1), 'UTF-8'))
|
I see, I tried changing the wired.py to specify a iso8859-1 encoding, but this didn't fix the problem, the apostrophes are still funny...will keep hacking at it. Also tried searching for the exact hex sequence that is causing trouble and replacing it with a normal apostrophe without success:
(r'\xE2\x80\x99', lambda match: "'"),
Any ideas?
Dave