|
|
#1 |
|
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
Bulk convert HTML characters for epub
Hi there,
I was wondering if anyone knows the best way to batch convert text from a doc file to HTML characters? E.g. for any instance of & to be converted to &? I’ve tried converting from a txt file through Calibre but I noticed that it didn’t take these into account. Any help would be greatly appreciated! Chris |
|
|
|
|
|
#2 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,762
Karma: 24088559
Join Date: Dec 2010
Device: Kindle PW2
|
|
|
|
|
|
|
#3 |
|
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,240
Karma: 61360164
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
IIRC Writer2EPUB handle this when doing a DOC to EPUB save
|
|
|
|
|
|
#4 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
You could also use my macro or add-in.
|
|
|
|
|
|
#5 |
|
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
Thank you very much for your responses!
I’ve used the Sigil option as I presume that Toxaris, your plugin won’t work on mac? It does convert characters such as & to & however it doesn’t seem to convert characters: “ ( “ ) ’ ( ’ ) – ( – ) Are these essential for text in pubs or do they not need to be converted? Thank you. |
|
|
|
|
|
#6 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,762
Karma: 24088559
Join Date: Dec 2010
Device: Kindle PW2
|
AFAIK, only the five pre-defined XML entities (&, <, >, " and ') need to be converted; all other named HTML entities are pre-defined in the xhtml standard.
|
|
|
|
|
|
#7 |
|
Junior Member
![]() Posts: 3
Karma: 10
Join Date: Nov 2013
Location: Bournemouth, UK
Device: Kindle, iPad
|
|
|
|
|
|
|
#8 | |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
Quote:
|
|
|
|
|
|
|
#9 | |
|
Lector minore
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 660
Karma: 1738720
Join Date: Jan 2008
Device: Aura One, Paperwhite Signature
|
Quote:
Thanks! |
|
|
|
|
|
|
#10 | |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,762
Karma: 24088559
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
|
|
|
|
|
|
|
#11 | |
|
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,240
Karma: 61360164
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
If using SigilUse the Omega sign button tool ,to insert special characters, includes many not on the keyboard Last edited by theducks; 11-28-2013 at 02:15 PM. Reason: Sigil note |
|
|
|
|
|
|
#12 |
|
Lector minore
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 660
Karma: 1738720
Join Date: Jan 2008
Device: Aura One, Paperwhite Signature
|
Oh OK. It's obvious why angle brackets need to be escaped, but I have apostrophes, quotes and maybe even ampersands all over the place in HTML and never realized they might cause a problem. Thanks.
|
|
|
|
|
|
#13 |
|
frumious Bandersnatch
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,570
Karma: 20150435
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
Quotes and apostrophes I believe only have to escaped when they are used in some attribute value, as in <h1 title="How to be "smart"">, otherwise they are fine in HTML. Ampersands must be escaped always
|
|
|
|
|
|
#14 |
|
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Dec 2013
Device: none
|
This online WYSIWYG html5 compliant editor also automatically converts special characters (like vowels with umlaut, etc) into name references (HTML entities).
http://htmleditor.in/index.html Paste your code in it whil it's in source mode, turn it to visual mode and back to source mode and they will be converted. |
|
|
|
![]() |
| Tags |
| html characters |
| Thread Tools | Search this Thread |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| ¿Convert unicode decomposed characters to unique/normal characters? | JohnQwerty | Calibre | 3 | 04-05-2012 12:08 PM |
| HTML to Epub conversion dosn`t work because special characters | eLit | Conversion | 2 | 08-29-2011 02:01 AM |
| Convert epub to HTML | MShroff | ePub | 6 | 06-19-2011 05:52 PM |
| html 2 epub will not convert | Amalthia | Calibre | 2 | 06-04-2010 12:39 PM |
| Convert html to epub | colly | Calibre | 9 | 03-10-2010 10:30 AM |