View Single Post
Old 06-12-2020, 11:25 PM   #45
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 47,308
Karma: 171295426
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
Quote:
Originally Posted by JSWolf View Post
I've dug in the innards of many eBooks and I rarely see <h for chapter headers, sub headers, and section headers.
Interesting. I just did a search on my ebook archive (ebooks before they are loaded into calibre for fixup). Out of 7914 epubs, 7401 had <hx> tags while the remaining 513 did not. I looked at 20 or so of the ones that didn't and they used <p> tags for the headers. Makes building the TOC whether nav.xhtml or toc.ncx) more of a pain than it needs to be.

Most used at most 2 levels <h1> and <h2> while one coded by an anal retentive went all the way from <h1> to <h6>.

Some of them created by Vellum wrapped 4 or more levels of <div> around the <hx> tag. The funny part being that 2 or 3 of the <div> tags had a class that did not exist in the CSS.

That search ran most of the day while I was busily working from home. I suspect that having to unpack the epubs into a temp directory before searching did nothing for the search speed.
DNSB is offline   Reply With Quote