|  05-15-2012, 03:47 PM | #1 | 
| Zealot            Posts: 101 Karma: 202774 Join Date: Apr 2012 Device: none |  Getting rid of MsoNormal 
			
			Hello I am working on a book in Sigil. Converted it from Word to html (Save as Web page, filtered). Still I have tons of <p class="MsoNormal"> tags lying around. Should I leave them or remove them? And can I remove them next time even before importing to Sigil? | 
|   |   | 
|  05-15-2012, 03:52 PM | #2 | 
| Wizard            Posts: 4,520 Karma: 121692313 Join Date: Oct 2009 Location: Heemskerk, NL Device: PRS-T1, Kobo Touch, Kobo Aura | 
			
			You should definitively clean it up. Search the forum, there are several topics about this.
		 | 
|   |   | 
| Advert | |
|  | 
|  05-15-2012, 04:12 PM | #3 | 
| Resident Curmudgeon            Posts: 80,727 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			Look in the CSS and see what MsoNormal is. Clean that up. Then you'll know if you can remove it.
		 | 
|   |   | 
|  05-15-2012, 05:08 PM | #4 | 
| Wanderer            Posts: 106 Karma: 472218 Join Date: Jan 2011 Device: Kindle 3, PaperWhite 2 | 
			
			MsoNormal is Word's normal or standard paragraph.  You can leave them there if you want, but I like to clean them up to work with my stylesheet.  You can do a simple search and replace. Search for <p class="MsoNormal"> and replace it with <p>. Bob | 
|   |   | 
|  05-15-2012, 05:48 PM | #5 | 
| Resident Curmudgeon            Posts: 80,727 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			You also then have to set the p style in the CSS to be what you want.
		 | 
|   |   | 
| Advert | |
|  | 
|  05-16-2012, 04:08 AM | #6 | 
| Zealot            Posts: 101 Karma: 202774 Join Date: Apr 2012 Device: none | |
|   |   | 
|  05-16-2012, 07:35 AM | #7 | 
| Color me gone            Posts: 2,089 Karma: 1445295 Join Date: Apr 2008 Location: Central Oregon Coast Device: PRS-300 | 
			
			MsoNormal should be somewhere other than in the text, if it referred to in the text.  Do a search for it and see if it at the top of every chapter, instead of the stylesheet.
		 | 
|   |   | 
|  05-16-2012, 10:11 AM | #8 | 
| Wanderer            Posts: 106 Karma: 472218 Join Date: Jan 2011 Device: Kindle 3, PaperWhite 2 | 
			
			This is one issue with saving a Word doc as a filtered web page.  It styles everything, and includes all styles in the document, whether they are used or not. Word doesn't create a css file, it puts the css styles in the head of the document. In Word I make sure that the styles I want in the final book are set up and used. When I import the html file into Sigil, the first thing I do is delete everything within the style tags in the head. Then replace <p class="MsoNormal"> with <p>. The style sheet used in Sigil should match the styles in the Word doc. Now that stylesheet will control the styles and text in the file you imported. Bob | 
|   |   | 
|  05-16-2012, 10:36 AM | #9 | 
| Author, Pulblisher            Posts: 28 Karma: 29654 Join Date: Mar 2012 Location: Hertford, NC Device: Kindle | 
			
			Personally I'd either set the styles up in Word to be the way you want them and leave it, or not do HTML pages out of Word...plenty of WYSIWYG HTML editors out there that will do just fine and never add in the offending code to start with.
		 | 
|   |   | 
|  05-16-2012, 11:12 AM | #10 | |
| Zealot            Posts: 101 Karma: 202774 Join Date: Apr 2012 Device: none | Quote: 
 I guess this is the head of the ENTIRE document (?). | |
|   |   | 
|  05-16-2012, 11:14 AM | #11 | |
| Zealot            Posts: 101 Karma: 202774 Join Date: Apr 2012 Device: none | Quote: 
 But if you don´t do HTML from Word, what do you start out with then? In what form is your manuscript? Or you mean one can just copy a word file into a HTML editor? Is that possible? | |
|   |   | 
|  05-16-2012, 11:38 AM | #12 | 
| Guru            Posts: 698 Karma: 150000 Join Date: Feb 2010 Device: none | 
			
			When I want to go from a Word doc to an epub as directly as possible, I open the Word doc with LibreOffice, and export an epub using either Luke Calcinai's writer2epub, or the Writer2xhtml extensions. The results usually need some further tweaking in Sigil, however.
		 | 
|   |   | 
|  05-16-2012, 01:50 PM | #13 | 
| Wizard            Posts: 4,520 Karma: 121692313 Join Date: Oct 2009 Location: Heemskerk, NL Device: PRS-T1, Kobo Touch, Kobo Aura | 
			
			You could also try my 'clean' HTML macro for Word. That produces clean code and you can use Styles in Word. You still need to create the stylesheet yourself though. I thought about adding a stylesheet exporter, but since I don't need it and I created the macro for myself...
		 | 
|   |   | 
|  05-17-2012, 03:02 AM | #14 | 
| Bookmaker & Cat Slave            Posts: 11,503 Karma: 158448243 Join Date: Apr 2010 Location: Phoenix, AZ Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2 | 
			
			I can vouch for Toxie's "clean html" macro, works very well for those of you struggling with cleaning Word-->exported to filtered html. Hitch | 
|   |   | 
|  05-17-2012, 10:36 AM | #15 | |
| Wanderer            Posts: 106 Karma: 472218 Join Date: Jan 2011 Device: Kindle 3, PaperWhite 2 | Quote: 
 Code: <html> <head> <meta http-equiv=Content-Type content="text/html; charset=unicode"> <meta name=Generator content="Microsoft Word 12 (filtered)"> <style> <!-- /* Font Definitions */ . . . A few hundred lines of styles . . . --> </style> </head> <body lang=EN-US link=blue vlink=purple> Bob | |
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Getting rid of a blank page | iroquois | Sigil | 4 | 04-01-2012 12:33 AM | 
| Is there a way to get rid of this annoyance? | TonytheBookworm | Amazon Kindle | 7 | 12-26-2010 08:59 PM | 
| get rid of all tags - how ? | cybmole | Calibre | 4 | 09-29-2010 08:50 AM | 
| hi, i am currently getting rid of the | russellmz00 | Introduce Yourself | 6 | 05-25-2010 01:42 PM | 
| Just to get rid of the message | pshrynk | Introduce Yourself | 10 | 04-17-2009 01:47 AM |