11-28-2011, 10:43 AM | #1 |
Enthusiast
Posts: 28
Karma: 10
Join Date: May 2010
Location: Stockholm
Device: iPhone, iPad, Nook, Bookeen, Sony Reader
|
Convert Ascii to UTF char
Hi all,
I have html-files with a bunch of ASCII-signs inside. Like so: Code:
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="sv"> <head> <title>De andra</title> <link rel="stylesheet" href="Styles.css" type="text/css" /> <link rel="stylesheet" type="application/vnd.adobe-page-template+xml" href="page-template.xpgt" /> </head> <body> <div class="booksection"> <h1 id="ch001"><a id="page_011"></a>Molly Beslutet</h1> <p class="noindent_j1">När Molly vaknade sträckte hon ut ena armen mot den andra kudden. Den var lika tom som den varit det senaste halvåret. Ingen kind att smeka, ingen kropp att krypa intill. Pelle fanns helt enkelt inte där.</p> <p class="indent_j">Hon satte sig upp och släppte ner fötterna i fårskinnsfällen. Den mjuka, lockiga känslan fick hennes kropp att långsamt vakna. Hon tog ett par steg fram till fönstret, öppnade det och drog försiktigt in den kalla luften i lungorna. Även om vintern höll på att släppa sitt grepp och det mesta av snön hade smält undan var morgnarna fortfarande svartmålade. Molly huttrade och drog igen fönstret.</p> <p class="indent_j">I köket slängde hon några vedklampar i spisen och kaminen. Det kändes som om hon inte hade gjort något annat den sista tiden än huggit ved och eldat upp den igen.</p> ål = å |
11-28-2011, 12:37 PM | #2 |
frumious Bandersnatch
Posts: 7,515
Karma: 18512745
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
In linux, there is a small program called "recode":
recode html..utf8 file.html (it will also change all &, < and > to &, < and >, though) I'm sure any decent HTML editor will have an option for that. By the way, that way of coding characters is not "ascii", but numeric character references. |
Advert | |
|
11-28-2011, 12:57 PM | #3 |
Wizard
Posts: 2,251
Karma: 3720310
Join Date: Jan 2009
Location: USA
Device: Kindle, iPad (not used much for reading)
|
Also called HTML Entities.
|
11-28-2011, 01:08 PM | #4 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
You could try Notepad++.
|
11-29-2011, 06:11 AM | #5 |
frumious Bandersnatch
Posts: 7,515
Karma: 18512745
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
|
Advert | |
|
11-29-2011, 04:32 PM | #6 |
Grand Sorcerer
Posts: 5,583
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
Sigil does this automatically, if you add an .html file to a project. However, it'll also run HTMLTidy and will consolidate style elements, if present.
Last edited by Doitsu; 11-30-2011 at 02:56 AM. |
12-02-2011, 03:36 AM | #7 |
Enthusiast
Posts: 28
Karma: 10
Join Date: May 2010
Location: Stockholm
Device: iPhone, iPad, Nook, Bookeen, Sony Reader
|
Thanks a lot for all the different answers!
I'm running Oxygen XML so just went Unescape Selection. But, I'll be sure to refer my colleges to this thread. |
Tags |
ascii, epub, utf-8 |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Convert Chinese UTF-8 TXT file into ePub?? | C.Jones81 | Calibre | 4 | 12-05-2010 06:32 AM |
Metadata Plugboard - First Char of each word in Series | MikeP1212 | Calibre | 2 | 10-14-2010 06:14 PM |
255 Char limit question | jerrywojo | Calibre | 3 | 07-10-2010 07:15 PM |
50 char limit? | BrianG | Calibre | 2 | 01-25-2010 10:15 AM |