Quote:
Originally Posted by JSWolf
I've dug in the innards of many eBooks and I rarely see <h for chapter headers, sub headers, and section headers.
|
Interesting. I just did a search on my ebook archive (ebooks before they are loaded into calibre for fixup). Out of 7914 epubs, 7401 had <hx> tags while the remaining 513 did not. I looked at 20 or so of the ones that didn't and they used <p> tags for the headers. Makes building the TOC whether nav.xhtml or toc.ncx) more of a pain than it needs to be.
Most used at most 2 levels <h1> and <h2> while one coded by an anal retentive went all the way from <h1> to <h6>.
Some of them created by Vellum wrapped 4 or more levels of <div> around the <hx> tag. The funny part being that 2 or 3 of the <div> tags had a class that did not exist in the CSS.
That search ran most of the day while I was busily working from home. I suspect that having to unpack the epubs into a temp directory before searching did nothing for the search speed.