Quote:
Originally Posted by Solitaire1
it seems like the issue with EPUB is that it is XML based.
|
The XML format is not an issue, the problem is that the XML container hosts HTML content.
Quote:
XML is that it is not designed to be directly edited by humans. Instead, it is supposed to be directly edited by tools used by humans.
|
Correct. But it was purposely designed to be human-readable and handcraftable.
Quote:
Could many of the issues with EPUB be the result of incorrect / incomplete tagging rather than a result of the format itself?
|
The real problem is that HTML cannot enforce any kind of 'correct' tagging. You may recommend people to use <p> for, you know, paragraphs, but there will always be a book that uses <div>s for that, and there is no way you can mark it as invalid.
With a specific format, it will not be possible.