![]() |
#1 |
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Jul 2010
Device: Kindle dx
|
Conversion of HTML to UTF-8
I'm trying to convert a few CHM files with iso-8859-1 format. I'm getting some characters being converted incorrectly: for example:
instead of '-' I get 'Â' I've attached a text file with a few examples of the characters. At first I thought it was something to do with the chm conversion, but I extracted the underlying HTML files and I got similar results. I've tried explicitly setting the "input character encoding" to iso-8859-1 and it didn't help. I also tried setting it on the html to zip plug in, to no avail. Any ideas? |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,268
Karma: 27111060
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
convert the html directly and specify the correct character encoding.
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Jul 2010
Device: Kindle dx
|
Sorry if I wasn't clear in my original post, but that's what I tried - I extracted the html and tried to convert and specified the correct character encoding. I still get things like: Â littered through-out.
Thanks |
![]() |
![]() |
![]() |
#4 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,268
Karma: 27111060
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
In that case your html isnt utf-8
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
conversion TO html | in_the_fade | Calibre | 4 | 04-29-2010 10:51 AM |
HTML Conversion Problem | bigtymer | Calibre | 7 | 01-14-2010 08:15 PM |
HTML to TXT conversion | alkr | Calibre | 3 | 10-02-2009 09:54 AM |
amazon html conversion | pan2 | Amazon Kindle | 3 | 03-21-2009 06:44 PM |