View Single Post
Old 10-13-2025, 04:06 PM   #6
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 9,072
Karma: 6361556
Join Date: Nov 2009
Device: many
Note that html is much much more accepting of embedded < and > chars and has much less strict syntax but requires a very forgiving and specialized parser. So what the internet archive gets away with in spaghetti html is not always valid in an epub.

The epub spec uses much stricter xhtml/xml parsing rules and therefore can use a very fast and simpler parser than html.

Sigil's Mend is an a forgiving html parser that can create valid xhtml. Which is why it is recommended to enable Mend when opening html files.
KevinH is offline   Reply With Quote