Conversion of HTML to UTF-8
I'm trying to convert a few CHM files with iso-8859-1 format. I'm getting some characters being converted incorrectly: for example:
instead of '-' I get 'Â'
I've attached a text file with a few examples of the characters.
At first I thought it was something to do with the chm conversion, but I extracted the underlying HTML files and I got similar results.
I've tried explicitly setting the "input character encoding" to iso-8859-1 and it didn't help. I also tried setting it on the html to zip plug in, to no avail.
Any ideas?
|