![]() |
#1 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
Bulk convert HTML characters for epub
Hi there,
I was wondering if anyone knows the best way to batch convert text from a doc file to HTML characters? E.g. for any instance of & to be converted to &? I’ve tried converting from a txt file through Calibre but I noticed that it didn’t take these into account. Any help would be greatly appreciated! Chris |
![]() |
![]() |
![]() |
#2 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,680
Karma: 23983815
Join Date: Dec 2010
Device: Kindle PW2
|
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,891
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
IIRC Writer2EPUB handle this when doing a DOC to EPUB save
|
![]() |
![]() |
![]() |
#4 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
You could also use my macro or add-in.
|
![]() |
![]() |
![]() |
#5 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
Thank you very much for your responses!
I’ve used the Sigil option as I presume that Toxaris, your plugin won’t work on mac? It does convert characters such as & to & however it doesn’t seem to convert characters: “ ( “ ) ’ ( ’ ) – ( – ) Are these essential for text in pubs or do they not need to be converted? Thank you. |
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,680
Karma: 23983815
Join Date: Dec 2010
Device: Kindle PW2
|
AFAIK, only the five pre-defined XML entities (&, <, >, " and ') need to be converted; all other named HTML entities are pre-defined in the xhtml standard.
|
![]() |
![]() |
![]() |
#7 |
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
|
![]() |
![]() |
![]() |
#8 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Quote:
|
|
![]() |
![]() |
![]() |
#9 | |
Lector minore
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 660
Karma: 1738720
Join Date: Jan 2008
Device: Aura One, Paperwhite Signature
|
Quote:
Thanks! |
|
![]() |
![]() |
![]() |
#10 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,680
Karma: 23983815
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
|
|
![]() |
![]() |
![]() |
#11 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,891
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
![]() Use the Omega sign button tool ,to insert special characters, includes many not on the keyboard Last edited by theducks; 11-28-2013 at 02:15 PM. Reason: Sigil note |
|
![]() |
![]() |
![]() |
#12 |
Lector minore
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 660
Karma: 1738720
Join Date: Jan 2008
Device: Aura One, Paperwhite Signature
|
Oh OK. It's obvious why angle brackets need to be escaped, but I have apostrophes, quotes and maybe even ampersands all over the place in HTML and never realized they might cause a problem. Thanks.
|
![]() |
![]() |
![]() |
#13 |
frumious Bandersnatch
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,543
Karma: 19001583
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
Quotes and apostrophes I believe only have to escaped when they are used in some attribute value, as in <h1 title="How to be "smart"">, otherwise they are fine in HTML. Ampersands must be escaped always
|
![]() |
![]() |
![]() |
#14 |
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Dec 2013
Device: none
|
This online WYSIWYG html5 compliant editor also automatically converts special characters (like vowels with umlaut, etc) into name references (HTML entities).
http://htmleditor.in/index.html Paste your code in it whil it's in source mode, turn it to visual mode and back to source mode and they will be converted. |
![]() |
![]() |
![]() |
Tags |
html characters |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
¿Convert unicode decomposed characters to unique/normal characters? | JohnQwerty | Calibre | 3 | 04-05-2012 12:08 PM |
HTML to Epub conversion dosn`t work because special characters | eLit | Conversion | 2 | 08-29-2011 02:01 AM |
Convert epub to HTML | MShroff | ePub | 6 | 06-19-2011 05:52 PM |
html 2 epub will not convert | Amalthia | Calibre | 2 | 06-04-2010 12:39 PM |
Convert html to epub | colly | Calibre | 9 | 03-10-2010 10:30 AM |