View Single Post
Old 06-12-2020, 11:40 PM   #46
j.p.s
Grand Sorcerer
j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.j.p.s ought to be getting tired of karma fortunes by now.
 
Posts: 5,822
Karma: 104662271
Join Date: Apr 2011
Device: pb360
Quote:
Originally Posted by DNSB View Post
Interesting. I just did a search on my ebook archive (ebooks before they are loaded into calibre for fixup). Out of 7914 epubs, 7401 had <hx> tags while the remaining 513 did not. I looked at 20 or so of the ones that didn't and they used <p> tags for the headers. Makes building the TOC whether nav.xhtml or toc.ncx) more of a pain than it needs to be.

Most used at most 2 levels <h1> and <h2> while one coded by an anal retentive went all the way from <h1> to <h6>.

Some of them created by Vellum wrapped 4 or more levels of <div> around the <hx> tag. The funny part being that 2 or 3 of the <div> tags had a class that did not exist in the CSS.

That search ran most of the day while I was busily working from home. I suspect that having to unpack the epubs into a temp directory before searching did nothing for the search speed.
But is that more statistically significant than finding two books that didn't use <Hn> ?

Isn't the existence of multiple tools that automatically map headers into TOC entries just a coincidental fluke?
j.p.s is online now   Reply With Quote