|  05-16-2011, 02:37 PM | #1 | 
| Member  Posts: 21 Karma: 10 Join Date: May 2011 Device: iPad | 
				
				ePub validation in epubcheck 1.1
			 
			
			I have saved a Word 2011 (Mac) document as HTML, then converted with calibre to ePub. Subsequent validation check with epubcheck 1.1, supposedly required for submission to Apple's iBookstore, results in a report  of a huge number of errors. What is the likely source? Is it a Word formatting issue? a word to html issue? I'm baffled. Help, please.
		 | 
|   |   | 
|  05-16-2011, 02:43 PM | #2 | 
| Resident Curmudgeon            Posts: 80,671 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			It is because Word puts in all kinds of garbage when you save as HTML. Even filtered HTML is not nice. You'll have to open the ePub and have a go at removing the mess that is Word.
		 | 
|   |   | 
| Advert | |
|  | 
|  05-16-2011, 02:56 PM | #3 | 
| Member  Posts: 21 Karma: 10 Join Date: May 2011 Device: iPad | 
			
			Thanks. But with over 400 error messages, that seems an almost insurmountable task. Is there another word processing program or environment in which this kind of thing might not occur? I can use Pages on the Mac which might be cleaner, I suppose. I'd welcome any ideas. Thanks again.
		 | 
|   |   | 
|  05-16-2011, 03:28 PM | #4 | 
| Wizard            Posts: 3,130 Karma: 91256 Join Date: Feb 2008 Location: Germany Device: Cybook Gen3 | 
			
			Try saving as RTF in Word and converting that.
		 | 
|   |   | 
|  05-16-2011, 04:02 PM | #5 | |
| Resident Curmudgeon            Posts: 80,671 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | Quote: 
 | |
|   |   | 
| Advert | |
|  | 
|  05-16-2011, 05:09 PM | #6 | |
| Well trained by Cats            Posts: 31,240 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | Quote: 
 learn to fix globally  Use Sigil (and Flightcrew that is built in). Common faults are failures to conform to the rules. eg id=## in Codeview REGEX Search: id="(\d+)" replace: id="b_\1" <- any letters are good once you validat one fix meets you eyeball chech, switch to All HTML files and replace all. Always keep a prior version backup for those  moments. | |
|   |   | 
|  05-17-2011, 05:02 PM | #7 | 
| Resident Curmudgeon            Posts: 80,671 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			For me personally, I prefer using notepad++ and FlightCrew. I do need to update my copy of Sigil at some point. One major advantage of FlightCrew vs. ePubCheck is that FC's error messages are more human readable.
		 | 
|   |   | 
|  05-20-2011, 07:34 PM | #8 | 
| Wizard            Posts: 1,798 Karma: 30548723 Join Date: Dec 2006 Location: Singapore Device: Boyue | 
			
			try using libre office for doc to html conversion the html created by that is a lot cleaner than Microsoft office
		 | 
|   |   | 
|  05-20-2011, 07:43 PM | #9 | 
| Well trained by Cats            Posts: 31,240 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | |
|   |   | 
|  05-23-2011, 01:30 PM | #10 | 
| Member  Posts: 21 Karma: 10 Join Date: May 2011 Device: iPad | 
			
			Thank you all. I tried moving my book to Pages first. It exported to ePub cleanly, but I lost all my links, fonts, and appearance factors. The OpenOffice worked but left all my errors in place from MS Word. Finally, working with Sigil, I was able to eliminate all the errors in about an hour of work. It seems that Word's HTML output includes many attributes not allowed in XHTML, which is used by ePub. Sigil with onboard Flightcrew made it easy to delete them, and the results are excellent. Thanks again.
		 | 
|   |   | 
|  08-02-2011, 11:29 AM | #11 | 
| Member  Posts: 21 Karma: 10 Join Date: May 2011 Device: iPad | 
			
			Much later. I have had good success using Sigil with its built-in FlightCew but with one problem. Writing in Word for Mac 2011, then saving as HTML then converting with Calibre, then testing and editing in Sigil/Flightcrew almost always works. But sometimes , in Sigil, when I go to validate with FlightCrew I get an error message saying "An exception occurred during validation: std::exception" I have been unable to get an answer from anyone as to what this means. Does anyone know?. The validation always stops at this point and will not continue. | 
|   |   | 
|  08-02-2011, 04:23 PM | #12 | 
| Wizard            Posts: 3,130 Karma: 91256 Join Date: Feb 2008 Location: Germany Device: Cybook Gen3 | 
			
			There's a Sigil forum over here. You might have better luck asking there.
		 | 
|   |   | 
|  08-02-2011, 09:13 PM | #13 | 
| US Navy, Retired            Posts: 9,897 Karma: 13806776 Join Date: Feb 2009 Location: North Carolina Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen | |
|   |   | 
|  08-02-2011, 09:28 PM | #14 | |
| Well trained by Cats            Posts: 31,240 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | Quote: 
 | |
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Indesign CS5 to ePub: epubcheck error | gdgibson | ePub | 3 | 04-20-2011 01:26 AM | 
| Adding page breaks in Calibre breaks ePubcheck validation | bookraft | Conversion | 16 | 03-01-2011 01:23 PM | 
| Epub issues with Epubcheck | ematte | ePub | 13 | 10-30-2010 07:48 AM | 
| epub date error fails epubcheck 1.05 | dkata | Calibre | 2 | 09-13-2010 04:21 AM | 
| Web-based epubcheck upgraded to epubcheck 1.0.5 | kjk | ePub | 4 | 02-09-2010 09:53 PM |