Thread: PDF vs LRF etc
View Single Post
Old 02-25-2009, 01:02 PM   #30
Valloric
Created Sigil, FlightCrew
Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.Valloric ought to be getting tired of karma fortunes by now.
 
Valloric's Avatar
 
Posts: 1,982
Karma: 350515
Join Date: Feb 2008
Device: Kobo Clara HD
Quote:
Originally Posted by kovidgoyal View Post
Actually building HTML parsers is trivial since almost every language has ready made open source HTML parsing libraries.
So you're saying it's easy to create an HTML parser because there are a lot of open source HTML parsers out there?

Do I need to respond to that? Or do you see how silly that sentence is?

Quote:
Originally Posted by kovidgoyal View Post
And I dont know of a single ebook viewing application that will not read HTML in an EPUB container. So lets not get too religuous about standards.
But you have to take into account the complexity involved with a full-fledged HTML parser. Portable device makers would really love it if they could just expect XHTML content in epub documents (and they should, because basic HTML is not supported by OPS) because then they could just use a lean, fast, lightweight parser. It would require less memory, CPU and power.

It's this "lets not get too religious about standards" attitude that makes that less likely to happen. So at the very least, don't recommend to other people that they flat-out ignore the standard and put HTML in source documents. XHTML is not that difficult to produce, and Tidy can convert the majority of HTML into conforming XHTML.
Valloric is offline   Reply With Quote