Quote:
Originally Posted by buffaloseven
I only briefly scanned this and thought it worth mentioning: using an <i> tag is no longer a valid way to present italics in an HTML file. Italics now use <em> and any converter that is converting an <i> tag to a span is doing what it should (since it's inline within a paragraph). Try using <em> instead and see if the problem persists 
|
And that is a painfully wrong decision. The <i> tag is about style and the <em> about semantics. It is something different. That in most (or perhaps all current) readers/browsers <em> is represented as <i> is another story.
If you really wanted to do this right, identify the semantic use of the <i> tag and create a class in the stylesheet for that use and determine its style. It might be emphasis, it might be thoughts, it might be a letter, etc. If you really want to make semantic use, follow it through.
The <i> tag will be supported for a long time, even if it is deprecated. I would not even be surprised if it is restored.
WordML/OpenXML is not that difficult to understand. It is just very big with a lot of options and functions. It is all documented quite well. The reason for these splitups can be various. Most likely the word was edited with a slightly different style. That would cause this behavior. It would be correct, only not very useful for further processing. That is the main reason I use a different way of creating the HTML export from Word...