MobileRead Forums - View Single Post

Valloric · 02-25-2009, 02:02 PM

Quote:

Originally Posted by kovidgoyal

Actually building HTML parsers is trivial since almost every language has ready made open source HTML parsing libraries.

So you're saying it's easy to create an HTML parser because there are a lot of open source HTML parsers out there?

Do I need to respond to that? Or do you see how silly that sentence is?

Quote:

Originally Posted by kovidgoyal

And I dont know of a single ebook viewing application that will not read HTML in an EPUB container. So lets not get too religuous about standards.

But you have to take into account the complexity involved with a full-fledged HTML parser. Portable device makers would really love it if they could just expect XHTML content in epub documents (and they should, because basic HTML is not supported by OPS) because then they could just use a lean, fast, lightweight parser. It would require less memory, CPU and power.

It's this "lets not get too religious about standards" attitude that makes that less likely to happen. So at the very least, don't recommend to other people that they flat-out ignore the standard and put HTML in source documents. XHTML is not that difficult to produce, and Tidy can convert the majority of HTML into conforming XHTML.