|
|
#1 |
|
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Jul 2010
Device: Kindle dx
|
Conversion of HTML to UTF-8
I'm trying to convert a few CHM files with iso-8859-1 format. I'm getting some characters being converted incorrectly: for example:
instead of '-' I get 'Â' I've attached a text file with a few examples of the characters. At first I thought it was something to do with the chm conversion, but I extracted the underlying HTML files and I got similar results. I've tried explicitly setting the "input character encoding" to iso-8859-1 and it didn't help. I also tried setting it on the html to zip plug in, to no avail. Any ideas? |
|
|
|
|
|
#2 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,617
Karma: 28549044
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
convert the html directly and specify the correct character encoding.
|
|
|
|
| Advert | |
|
|
|
|
#3 |
|
Junior Member
![]() Posts: 4
Karma: 10
Join Date: Jul 2010
Device: Kindle dx
|
Sorry if I wasn't clear in my original post, but that's what I tried - I extracted the html and tried to convert and specified the correct character encoding. I still get things like: Â littered through-out.
Thanks |
|
|
|
|
|
#4 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,617
Karma: 28549044
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
In that case your html isnt utf-8
|
|
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| conversion TO html | in_the_fade | Calibre | 4 | 04-29-2010 11:51 AM |
| HTML Conversion Problem | bigtymer | Calibre | 7 | 01-14-2010 09:15 PM |
| HTML to TXT conversion | alkr | Calibre | 3 | 10-02-2009 10:54 AM |
| amazon html conversion | pan2 | Amazon Kindle | 3 | 03-21-2009 07:44 PM |