HTML to Mobi conversions (DocBook XSL, and content.opf?)
Hi,
I recently pre-ordered the Kindle 3 like everyone else, and set about consolidating my various electronic books into a suitable format.
A big source for me is actually the O Reilly iphone books, which are purchased in a Stanza shell. The actual contents are composed as HTML, with a Table of Contents file and then each chapter/section described as separate html files. The images are duly linked in these html files in a flat folder structure.
Anyway, was curious if this was a standardized format. The only hint I've found in the files are that they've been generated via the "DocBook XSL Stylesheets," which I'm looking into now. But XSL is a giant pain in the ass from experience, so just wanted to ask here before I go down that painful, painful road again.
For reference, the naming scheme is fairly consistent, but semi arbitrary (eg ch01s02 for chapter 1 section 2), and the actual structure seems defined by an XML file called content.opf
I'm working on the idea here that since I purchased the books in the format, I'm entitled to the content and being Canadian, I can "strip" the "DRM" (i.e. remove the content from the stanza app).
Soooooooo, any suggestions?
|