| 
			
			 | 
		#1 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 3 
				Karma: 10 
				Join Date: Nov 2013 
				Location: Bournemouth, UK 
				
				
				Device: Kindle, iPad 
				
				
				 | 
	
	
	
		
		
			
			 
				
				Bulk convert HTML characters for epub
			 
			
			
			Hi there, 
		
	
		
		
		
		
		
		
		
		
		
		
	
	I was wondering if anyone knows the best way to batch convert text from a doc file to HTML characters? E.g. for any instance of & to be converted to &? I’ve tried converting from a txt file through Calibre but I noticed that it didn’t take these into account. Any help would be greatly appreciated! Chris  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#2 | 
| 
			
			
			
			 Grand Sorcerer 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,763 
				Karma: 24088559 
				Join Date: Dec 2010 
				
				
				
				Device: Kindle PW2 
				
				
				 | 
	
	|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#3 | 
| 
			
			
			
			 Well trained by Cats 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,267 
				Karma: 61916422 
				Join Date: Aug 2009 
				Location: The Central Coast of California 
				
				
				Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			IIRC Writer2EPUB handle this when doing a DOC to EPUB save
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#4 | 
| 
			
			
			
			 Wizard 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520 
				Karma: 121692313 
				Join Date: Oct 2009 
				Location: Heemskerk, NL 
				
				
				Device: PRS-T1, Kobo Touch, Kobo Aura 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			You could also use my macro or add-in.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#5 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 3 
				Karma: 10 
				Join Date: Nov 2013 
				Location: Bournemouth, UK 
				
				
				Device: Kindle, iPad 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Thank you very much for your responses! 
		
	
		
		
		
		
		
		
		
		
		
		
	
	I’ve used the Sigil option as I presume that Toxaris, your plugin won’t work on mac? It does convert characters such as & to & however it doesn’t seem to convert characters: “ ( “ ) ’ ( ’ ) – ( – ) Are these essential for text in pubs or do they not need to be converted? Thank you.  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#6 | 
| 
			
			
			
			 Grand Sorcerer 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,763 
				Karma: 24088559 
				Join Date: Dec 2010 
				
				
				
				Device: Kindle PW2 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			AFAIK, only the five pre-defined XML entities (&, <, >, " and ') need to be converted; all other named HTML entities are pre-defined in the xhtml standard.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#7 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 3 
				Karma: 10 
				Join Date: Nov 2013 
				Location: Bournemouth, UK 
				
				
				Device: Kindle, iPad 
				
				
				 | 
	
	|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#8 | |
| 
			
			
			
			 Wizard 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520 
				Karma: 121692313 
				Join Date: Oct 2009 
				Location: Heemskerk, NL 
				
				
				Device: PRS-T1, Kobo Touch, Kobo Aura 
				
				
				 | 
	
	
	
		
		
		
		
		 Quote: 
	
  | 
|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#9 | |
| 
			
			
			
			 Lector minore 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 661 
				Karma: 1738720 
				Join Date: Jan 2008 
				
				
				
				Device: Aura One, Paperwhite Signature 
				
				
				 | 
	
	
	
		
		
		
		
		 Quote: 
	
 Thanks!  | 
|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#10 | |
| 
			
			
			
			 Grand Sorcerer 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,763 
				Karma: 24088559 
				Join Date: Dec 2010 
				
				
				
				Device: Kindle PW2 
				
				
				 | 
	
	
	
		
		
		
		
		 Quote: 
	
  | 
|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#11 | |
| 
			
			
			
			 Well trained by Cats 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,267 
				Karma: 61916422 
				Join Date: Aug 2009 
				Location: The Central Coast of California 
				
				
				Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A 
				
				
				 | 
	
	
	
		
		
		
		
		 Quote: 
	
   If using SigilUse the Omega sign button tool ,to insert special characters, includes many not on the keyboard Last edited by theducks; 11-28-2013 at 03:15 PM. Reason: Sigil note  | 
|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#12 | 
| 
			
			
			
			 Lector minore 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 661 
				Karma: 1738720 
				Join Date: Jan 2008 
				
				
				
				Device: Aura One, Paperwhite Signature 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Oh OK.  It's obvious why angle brackets need to be escaped, but I have apostrophes, quotes and maybe even ampersands all over the place in HTML and never realized they might cause a problem.  Thanks.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#13 | 
| 
			
			
			
			 frumious Bandersnatch 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,570 
				Karma: 20150435 
				Join Date: Jan 2008 
				Location: Spaniard in Sweden 
				
				
				Device: Cybook Orizon, Kobo Aura 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Quotes and apostrophes I believe only have to escaped when they are used in some attribute value, as in <h1 title="How to be "smart"">, otherwise they are fine in HTML. Ampersands must be escaped always
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#14 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 8 
				Karma: 10 
				Join Date: Dec 2013 
				
				
				
				Device: none 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			This online WYSIWYG html5 compliant editor also automatically converts special characters (like vowels with umlaut, etc) into name references (HTML entities).  
		
	
		
		
		
		
		
		
		
		
		
		
	
	http://htmleditor.in/index.html Paste your code in it whil it's in source mode, turn it to visual mode and back to source mode and they will be converted.  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
![]()  | 
            
        
    
| Tags | 
| html characters | 
| Thread Tools | Search this Thread | 
            
  | 
    
			 
			Similar Threads
		 | 
	||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| ¿Convert unicode decomposed characters to unique/normal characters? | JohnQwerty | Calibre | 3 | 04-05-2012 01:08 PM | 
| HTML to Epub conversion dosn`t work because special characters | eLit | Conversion | 2 | 08-29-2011 03:01 AM | 
| Convert epub to HTML | MShroff | ePub | 6 | 06-19-2011 06:52 PM | 
| html 2 epub will not convert | Amalthia | Calibre | 2 | 06-04-2010 01:39 PM | 
| Convert html to epub | colly | Calibre | 9 | 03-10-2010 11:30 AM |