MS Word "crap" at beginning of html files
Is there an easy way to clean up the styling "crap" that Word puts at the top of all html files? Like maybe through a regex search and replace?
Why does Word put a gazillion font items at the start of html files even when those fonts are not being used in the file? It makes it much harder to edit the files in Sigil or any other editor. I manually selected the Word stuff in one file and deleted it and there didn't seem to be any negative impact, so I assume it's safe to delete it. But how does one do it for every file?
I try to stay away from Word for epubs, but I find if a file needs heavy editing and also needs a new TOC, it's the easiest thing to use. I save my files as filtered html.
Maybe if I run the file through another utility first it will clean up that "crap'?
Anyone have any suggestions?
|