Quote:
Originally Posted by burbleburble
Beacause of the complexity involved in dealing with more types and levels of tags, is there any pressing reason why a clean ebook should need div's or i or b or em etc.? are p, span, lists enough? (I mean, its an ebook, not a pdf or interactive webpage...)
|
Some feedback, (IMO only, of course), I've never needed a <div> tag but I do prefer an <i> or <em> tag to a <span class="italic">. mainly because if I'm cleaning up using regex in a text editor the closing tags </i> and </em> are specific. The </span> tag could be closing anything. Particularly difficult when they are nested several layers deep. The shorter <i>, <em> tags are also less distracting in a text editor.
Of course, if your new utility is really good at cleaning html then I may not need to use regex in a text editor nearly so often
On the matter of stripping CSS... I would hope that it would be optional. If a <blockquote> tag has been used or the equivalent CSS, to achieve equal left and right indented margins, then I do not want this info stripped out. Similarly CSS margin settings when used for scene-breaks or verse. I also don't want text-indent settings removed.
@Japes,
I find a Calibre conversion to epub usually does a good job of sorting out the font-size of the main bodytext, by putting a font-size in the <body> CSS class and omitting it from the main bodytext class. Calibre also changes all instances of font-sizes small, medium, large etc to em-equivalents. I've found this helps to make my PRS+ CSS work more reliably. I do find that each different font-face has its own 'best font-size' but that's easily added to the relevant PRS+ CSS file. I always convert using Base_font_size=12pt, which puts font-size:1em in the body CSS class, then let PRS+ take care of the rest.
[Edit:] Just thought of something else regarding simplification. This is a finer detail point, but if an html file has a specially formatted first-character of a chapter then it would be good to retain it. I wouldn't care what the formatting was but if it always got cleaned up to:
Code:
<span class="dropcap">T</span>his is the first...
then a standard linked CSS file could format it to taste.
Well you did ask for feedback...