11-14-2013, 11:34 AM | #1 |
Junior Member
Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
Bulk convert HTML characters for epub
Hi there,
I was wondering if anyone knows the best way to batch convert text from a doc file to HTML characters? E.g. for any instance of & to be converted to &? I’ve tried converting from a txt file through Calibre but I noticed that it didn’t take these into account. Any help would be greatly appreciated! Chris |
11-14-2013, 12:00 PM | #2 |
Grand Sorcerer
Posts: 5,582
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
|
11-14-2013, 12:10 PM | #3 |
Well trained by Cats
Posts: 29,772
Karma: 54401244
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
IIRC Writer2EPUB handle this when doing a DOC to EPUB save
|
11-14-2013, 02:11 PM | #4 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
You could also use my macro or add-in.
|
11-15-2013, 05:18 AM | #5 |
Junior Member
Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
Thank you very much for your responses!
I’ve used the Sigil option as I presume that Toxaris, your plugin won’t work on mac? It does convert characters such as & to & however it doesn’t seem to convert characters: “ ( “ ) ’ ( ’ ) – ( – ) Are these essential for text in pubs or do they not need to be converted? Thank you. |
11-15-2013, 05:34 AM | #6 |
Grand Sorcerer
Posts: 5,582
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
AFAIK, only the five pre-defined XML entities (&, <, >, " and ') need to be converted; all other named HTML entities are pre-defined in the xhtml standard.
|
11-15-2013, 06:27 AM | #7 |
Junior Member
Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
|
11-15-2013, 07:50 AM | #8 | |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Quote:
|
|
11-28-2013, 10:49 AM | #9 | |
Lector minore
Posts: 649
Karma: 1738720
Join Date: Jan 2008
Device: Aura One, Samsung Galaxy Tab S5e, Google Pixel Slate
|
Quote:
Thanks! |
|
11-28-2013, 11:13 AM | #10 | |
Grand Sorcerer
Posts: 5,582
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
|
|
11-28-2013, 02:14 PM | #11 | |
Well trained by Cats
Posts: 29,772
Karma: 54401244
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Use the Omega sign button tool ,to insert special characters, includes many not on the keyboard Last edited by theducks; 11-28-2013 at 02:15 PM. Reason: Sigil note |
|
11-29-2013, 11:00 AM | #12 |
Lector minore
Posts: 649
Karma: 1738720
Join Date: Jan 2008
Device: Aura One, Samsung Galaxy Tab S5e, Google Pixel Slate
|
Oh OK. It's obvious why angle brackets need to be escaped, but I have apostrophes, quotes and maybe even ampersands all over the place in HTML and never realized they might cause a problem. Thanks.
|
11-29-2013, 11:13 AM | #13 |
frumious Bandersnatch
Posts: 7,515
Karma: 18512745
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
Quotes and apostrophes I believe only have to escaped when they are used in some attribute value, as in <h1 title="How to be "smart"">, otherwise they are fine in HTML. Ampersands must be escaped always
|
12-02-2013, 12:23 PM | #14 |
Junior Member
Posts: 8
Karma: 10
Join Date: Dec 2013
Device: none
|
This online WYSIWYG html5 compliant editor also automatically converts special characters (like vowels with umlaut, etc) into name references (HTML entities).
http://htmleditor.in/index.html Paste your code in it whil it's in source mode, turn it to visual mode and back to source mode and they will be converted. |
Tags |
html characters |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
¿Convert unicode decomposed characters to unique/normal characters? | JohnQwerty | Calibre | 3 | 04-05-2012 12:08 PM |
HTML to Epub conversion dosn`t work because special characters | eLit | Conversion | 2 | 08-29-2011 02:01 AM |
Convert epub to HTML | MShroff | ePub | 6 | 06-19-2011 05:52 PM |
html 2 epub will not convert | Amalthia | Calibre | 2 | 06-04-2010 12:39 PM |
Convert html to epub | colly | Calibre | 9 | 03-10-2010 10:30 AM |