Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 07-09-2025, 02:57 PM   #1
sueneu
eBook Designer
sueneu began at the beginning.
 
sueneu's Avatar
 
Posts: 29
Karma: 10
Join Date: Jun 2010
Location: New Jersey
Device: ipad, nook, kindle
Question Allowing HTML in EPUBS

I am reaching out here because we want to hear from people making EPUBs and making tools for EPUBs.

What if, in the future, EPUBs could contain HTML files in addition to XHTML?

As you know, technology is constantly evolving. One such technology, currently central to EPUB files, is XML. More exactly the XML syntax of HTML (XHTML). Use of this syntax is declining outside of publishing. In order to provide for the future of the EPUB ecosystem, we plan to remove the requirement for the XML syntax in the EPUB package.

This change will not affect the document formats currently allowed in an EPUB publication, it would simply remove the restriction to the XML syntax.

We hope to understand how this might affect the community. That way, we can identify potential obstacles and make an easier transition to accepting HTML.

Your input is valuable, and will help shape the future of ebooks.
Please take this brief four-question survey: https://www.w3.org/wbs/1/epubhtml/
  • You do not need a W3C account
  • Responses are anonymous
  • The survey is in English and Japanese
  • You may share this link with anyone you believe could give us a helpful perspective
  • Survey closes in September
  • The anonymized information we collect will be made public to ensure transparency in our process.

Who We Are
This survey was created by the Publishing Maintenance Working Group, part of the World Wide Web Consortium (W3C). Our mission is to maintain the Recommendations for EPUB files and ereaders. We work incrementally to improve the Recommendations, provide additional clarity, and grow the Recommendations for the future of Ebooks.
sueneu is offline   Reply With Quote
Old 07-09-2025, 03:33 PM   #2
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,760
Karma: 5706256
Join Date: Nov 2009
Device: many
I am already on record as being totally against adding html parsing rules to the epub3 standard since it will further fracture the epub3 base, will make backwards compatibility with old epub2 only readers next to impossible, and is just a really bad idea in any epub3 spec. Save it for epub 4, if you must.

Second, your proposition that xml is dying is simply incorrect and quite misleading. It should not be used in the lead in to your survey. Talk about adding a bias to a survey! Most current word processors (commonly used by fiction and non-fiction authors) use XML. Xml is also heavily used in text storage and archival and databases. Although html can omit some implied end tags, and handle some void tags differently, using open and closed tags is still part of the standard. Xhtml as a spec has long ago been replaced by xml parsing rules applied to html5 and its current living spec versions. This is not going to change no matter what the W3C thinks.

Also in the lead up to your survey link, you neglect to say this change would be made in the epub3 spec, and are thereby hiding the fractures and splits it would make to our current standard. I would have a very different response if this were being proposed for an epub 4 spec, as I suspect many others would.

Last edited by KevinH; 07-09-2025 at 03:53 PM.
KevinH is offline   Reply With Quote
Advert
Old 07-09-2025, 11:13 PM   #3
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,345
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
I have to agree that this is rather unnecessary. XHTML as a spec might be dead, but using a valid XML serialization of HTML 5 is perfectly fine and supports evolving web standards perfectly. This is what happens in practice today anyway. Anything you can do in HTML 5 works fine when the HTML 5 is serialized as valid XML.

Forcing EPUB software developers to support HTML 5 parsing is just creating unnecessary busy work for no good reason.

The *only* advantage I can see for the HTML 5 serialization over the XML serialization is that the former is easier to write by hand. I dont think that is an advantage that justifies the cost. Most EPUB editing tools already have some kind of functionality to either flag or auto-correct invalid XML making writing it not that hard.

And I agree with KevinH that this disruptive of a change should be in EPUB 4 otherwise it will just end up getting ignored and un-used like the yoyo-ing that was done with EPUB metadata.
kovidgoyal is offline   Reply With Quote
Old 07-10-2025, 04:50 AM   #4
Martinoptic
Bibliophist
Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.
 
Martinoptic's Avatar
 
Posts: 7,011
Karma: 7173892
Join Date: Dec 2021
Location: England
Device: none
I don't see what improvement it would make, and it would likely break backward compatibility with epub2 as Kevin says. Would readers want that - I think not. Why is there no question for readers rather than developers?
Edit: And why in an anonymous poll do you want my name, surname, employer and job title as well as my email address? Not very anonymous is it?

Last edited by Martinoptic; 07-10-2025 at 04:53 AM.
Martinoptic is offline   Reply With Quote
Old 07-10-2025, 08:36 AM   #5
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 28,569
Karma: 204127028
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
I'm from the "Don't unnecessarily make API-breaking changes without a new major version (and a new soname)" camp myself. I know that analogy isn't exactly one-to-one, but it fits well enough.

But even that code of the school yard is being broken more and more often these days, it seems (*cough* libxml2 *cough*).

I did submit my feelings via the "anonymous" survey.
DiapDealer is offline   Reply With Quote
Advert
Old 07-10-2025, 10:54 AM   #6
PeterT
Grand Sorcerer
PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.
 
Posts: 13,511
Karma: 78910202
Join Date: Nov 2007
Location: Toronto
Device: Libra H2O, Libra Colour
There are some interesting discussions out there. See for instance "Allow pure HTML 5 in EPUB 3" https://www.edrlab.org/2025/07/06/al...ml5-in-epub-3/
PeterT is offline   Reply With Quote
Old 07-10-2025, 11:21 AM   #7
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 8,760
Karma: 5706256
Join Date: Nov 2009
Device: many
Interesting article. Yes a new standard would be a big improvement. Trying to shoehorn it into epub3 would not.

That said, Thorium will not work with pure html5 directly (I tested it) but it sounds like they are working on it.
KevinH is offline   Reply With Quote
Old 07-10-2025, 11:29 AM   #8
PeterT
Grand Sorcerer
PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.PeterT ought to be getting tired of karma fortunes by now.
 
Posts: 13,511
Karma: 78910202
Join Date: Nov 2007
Location: Toronto
Device: Libra H2O, Libra Colour
Another source worth following is the Public Mailing list of the "W3 Publishing Maintenance Group" especially the Agenda and Minutes from various meetings. See https://lists.w3.org/Archives/Public/public-pm-wg/
PeterT is offline   Reply With Quote
Old 07-10-2025, 11:55 AM   #9
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,345
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Quote:
Originally Posted by KevinH View Post
Interesting article. Yes a new standard would be a big improvement. Trying to shoehorn it into epub3 would not.
That's highly debatable. If a new standard is made by someone else than the people that gave us EPUB, maybe. Otherwise it will just be the same old mess hiding behind a new name.

Quote:
That said, Thorium will not work with pure html5 directly (I tested it) but it sounds like they are working on it.
They appear to be doing some weird self contradictory things. First they say that the XML serialization of HTML 5 is fine and works in thorium. Then they go on to say that if an ebook has the non XML serialization for HTML 5 they will disable features such as pagination in their reader. I dont follow why the serialization format should make any difference to what features reader software supports. The actual DOM that is being serialized is independent of the format chosen for serialization and ebook reader features should depend on the DOM + CSS + JS not the serialization.

All-in-all it looks like there is a ton of confusion in this space. Starting with the people proposing this change who dont seem to understand that the serialization of HTML is irrelevant to what features it supports.

Like I said in my previous post the *only* pro for non XML serialization is that it is easier to author using a plain text editor.
kovidgoyal is offline   Reply With Quote
Old 07-10-2025, 05:08 PM   #10
sueneu
eBook Designer
sueneu began at the beginning.
 
sueneu's Avatar
 
Posts: 29
Karma: 10
Join Date: Jun 2010
Location: New Jersey
Device: ipad, nook, kindle
Thanks to everyone for replying, and thank you to those who filled out the survey.

I’ll be sure to share the anonymized results.

The survey is open until mid-September, so I’ll have complete results sometime after that.
sueneu is offline   Reply With Quote
Old 07-11-2025, 12:31 PM   #11
sueneu
eBook Designer
sueneu began at the beginning.
 
sueneu's Avatar
 
Posts: 29
Karma: 10
Join Date: Jun 2010
Location: New Jersey
Device: ipad, nook, kindle
Anonymous poll and EPUB user input

Quote:
Originally Posted by Martinoptic View Post
I don't see what improvement it would make, and it would likely break backward compatibility with epub2 as Kevin says. Would readers want that - I think not. Why is there no question for readers rather than developers?
Edit: And why in an anonymous poll do you want my name, surname, employer and job title as well as my email address? Not very anonymous is it?
Thank you for bringing this to our attention. The survey was made through the W3C’s survey system. We’ve sent a query to the system admins with this complaint. We will not publish people’s personal information when we release the results of the survey.

When you say users, do you mean people who use EPUB testing and editing tools, and/or user-readers who read Ebooks?

What question would you propose?
sueneu is offline   Reply With Quote
Old 07-11-2025, 02:19 PM   #12
Martinoptic
Bibliophist
Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.Martinoptic ought to be getting tired of karma fortunes by now.
 
Martinoptic's Avatar
 
Posts: 7,011
Karma: 7173892
Join Date: Dec 2021
Location: England
Device: none
Thank you for your reply. I was meaning people who read ebooks and may occasionally tweak them.
As I didn't get to the first question in the survey (as I'm not prepared to give all the details requested) it may well be that a further question isn't needed? As long as it isn't a requirement to have a job in e-publishing (which is what I took the sign up to suggest) then maybe the survey is ok as it stands. What do other people think?

Last edited by Martinoptic; 07-11-2025 at 02:22 PM.
Martinoptic is offline   Reply With Quote
Reply

Tags
eprdctn, epub, epub app, epub application, html code


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Issue converting EPUBs to HTML wolf123 Conversion 2 10-29-2021 12:23 PM
Epubs with oversize html files adrian1944 Conversion 6 01-01-2014 11:15 AM
Help! Problem with HTML tables in epubs nazzing ePub 3 06-05-2013 10:15 AM
Touch Kobo Touch and html links in epubs leaperk Kobo Reader 1 07-05-2011 02:56 PM
Is it possible to change how Calibre formats HTML for Epubs chief Calibre 8 07-07-2010 03:04 PM


All times are GMT -4. The time now is 02:14 AM.


MobileRead.com is a privately owned, operated and funded community.