05-15-2012, 03:47 PM | #1 |
Zealot
Posts: 101
Karma: 202774
Join Date: Apr 2012
Device: none
|
Getting rid of MsoNormal
Hello
I am working on a book in Sigil. Converted it from Word to html (Save as Web page, filtered). Still I have tons of <p class="MsoNormal"> tags lying around. Should I leave them or remove them? And can I remove them next time even before importing to Sigil? |
05-15-2012, 03:52 PM | #2 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
You should definitively clean it up. Search the forum, there are several topics about this.
|
05-15-2012, 04:12 PM | #3 |
Resident Curmudgeon
Posts: 73,896
Karma: 128597114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Look in the CSS and see what MsoNormal is. Clean that up. Then you'll know if you can remove it.
|
05-15-2012, 05:08 PM | #4 |
Wanderer
Posts: 106
Karma: 472218
Join Date: Jan 2011
Device: Kindle 3, PaperWhite 2
|
MsoNormal is Word's normal or standard paragraph. You can leave them there if you want, but I like to clean them up to work with my stylesheet. You can do a simple search and replace.
Search for <p class="MsoNormal"> and replace it with <p>. Bob |
05-15-2012, 05:48 PM | #5 |
Resident Curmudgeon
Posts: 73,896
Karma: 128597114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
You also then have to set the p style in the CSS to be what you want.
|
05-16-2012, 04:08 AM | #6 |
Zealot
Posts: 101
Karma: 202774
Join Date: Apr 2012
Device: none
|
|
05-16-2012, 07:35 AM | #7 |
Color me gone
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
|
MsoNormal should be somewhere other than in the text, if it referred to in the text. Do a search for it and see if it at the top of every chapter, instead of the stylesheet.
|
05-16-2012, 10:11 AM | #8 |
Wanderer
Posts: 106
Karma: 472218
Join Date: Jan 2011
Device: Kindle 3, PaperWhite 2
|
This is one issue with saving a Word doc as a filtered web page. It styles everything, and includes all styles in the document, whether they are used or not.
Word doesn't create a css file, it puts the css styles in the head of the document. In Word I make sure that the styles I want in the final book are set up and used. When I import the html file into Sigil, the first thing I do is delete everything within the style tags in the head. Then replace <p class="MsoNormal"> with <p>. The style sheet used in Sigil should match the styles in the Word doc. Now that stylesheet will control the styles and text in the file you imported. Bob |
05-16-2012, 10:36 AM | #9 |
Author, Pulblisher
Posts: 28
Karma: 29654
Join Date: Mar 2012
Location: Hertford, NC
Device: Kindle
|
Personally I'd either set the styles up in Word to be the way you want them and leave it, or not do HTML pages out of Word...plenty of WYSIWYG HTML editors out there that will do just fine and never add in the offending code to start with.
|
05-16-2012, 11:12 AM | #10 | |
Zealot
Posts: 101
Karma: 202774
Join Date: Apr 2012
Device: none
|
Quote:
I guess this is the head of the ENTIRE document (?). |
|
05-16-2012, 11:14 AM | #11 | |
Zealot
Posts: 101
Karma: 202774
Join Date: Apr 2012
Device: none
|
Quote:
But if you don´t do HTML from Word, what do you start out with then? In what form is your manuscript? Or you mean one can just copy a word file into a HTML editor? Is that possible? |
|
05-16-2012, 11:38 AM | #12 |
Guru
Posts: 696
Karma: 150000
Join Date: Feb 2010
Device: none
|
When I want to go from a Word doc to an epub as directly as possible, I open the Word doc with LibreOffice, and export an epub using either Luke Calcinai's writer2epub, or the Writer2xhtml extensions. The results usually need some further tweaking in Sigil, however.
|
05-16-2012, 01:50 PM | #13 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
You could also try my 'clean' HTML macro for Word. That produces clean code and you can use Styles in Word. You still need to create the stylesheet yourself though. I thought about adding a stylesheet exporter, but since I don't need it and I created the macro for myself...
|
05-17-2012, 03:02 AM | #14 |
Bookmaker & Cat Slave
Posts: 11,460
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
|
I can vouch for Toxie's "clean html" macro, works very well for those of you struggling with cleaning Word-->exported to filtered html.
Hitch |
05-17-2012, 10:36 AM | #15 | |
Wanderer
Posts: 106
Karma: 472218
Join Date: Jan 2011
Device: Kindle 3, PaperWhite 2
|
Quote:
Code:
<html> <head> <meta http-equiv=Content-Type content="text/html; charset=unicode"> <meta name=Generator content="Microsoft Word 12 (filtered)"> <style> <!-- /* Font Definitions */ . . . A few hundred lines of styles . . . --> </style> </head> <body lang=EN-US link=blue vlink=purple> Bob |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Getting rid of a blank page | iroquois | Sigil | 4 | 04-01-2012 12:33 AM |
Is there a way to get rid of this annoyance? | TonytheBookworm | Amazon Kindle | 7 | 12-26-2010 08:59 PM |
get rid of all tags - how ? | cybmole | Calibre | 4 | 09-29-2010 08:50 AM |
hi, i am currently getting rid of the | russellmz00 | Introduce Yourself | 6 | 05-25-2010 01:42 PM |
Just to get rid of the message | pshrynk | Introduce Yourself | 10 | 04-17-2009 01:47 AM |