View Single Post
Old 09-19-2010, 05:06 PM   #1
lippy
Junior Member
lippy began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Jul 2010
Device: Kindle dx
Conversion of HTML to UTF-8

I'm trying to convert a few CHM files with iso-8859-1 format. I'm getting some characters being converted incorrectly: for example:

instead of '-' I get 'Â'

I've attached a text file with a few examples of the characters.

At first I thought it was something to do with the chm conversion, but I extracted the underlying HTML files and I got similar results.

I've tried explicitly setting the "input character encoding" to iso-8859-1 and it didn't help. I also tried setting it on the html to zip plug in, to no avail.

Any ideas?
Attached Files
File Type: txt Incorrectly converted chars.txt (56 Bytes, 255 views)
lippy is offline   Reply With Quote