08-08-2010, 11:13 PM | #1 |
Senior Wrangler
Posts: 22
Karma: 10
Join Date: Jul 2006
Device: iPhone 3GS, want Kindle 3
|
HTML to Mobi conversions (DocBook XSL, and content.opf?)
Hi,
I recently pre-ordered the Kindle 3 like everyone else, and set about consolidating my various electronic books into a suitable format. A big source for me is actually the O Reilly iphone books, which are purchased in a Stanza shell. The actual contents are composed as HTML, with a Table of Contents file and then each chapter/section described as separate html files. The images are duly linked in these html files in a flat folder structure. Anyway, was curious if this was a standardized format. The only hint I've found in the files are that they've been generated via the "DocBook XSL Stylesheets," which I'm looking into now. But XSL is a giant pain in the ass from experience, so just wanted to ask here before I go down that painful, painful road again. For reference, the naming scheme is fairly consistent, but semi arbitrary (eg ch01s02 for chapter 1 section 2), and the actual structure seems defined by an XML file called content.opf I'm working on the idea here that since I purchased the books in the format, I'm entitled to the content and being Canadian, I can "strip" the "DRM" (i.e. remove the content from the stanza app). Soooooooo, any suggestions? |
08-09-2010, 02:07 AM | #2 | |
US Navy, Retired
Posts: 9,865
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Nexus 7
|
Quote:
Remove DRM from O'Reilly books? I didn't think O'Reilly used DRM. |
|
Advert | |
|
08-09-2010, 02:08 AM | #3 |
Wizard
Posts: 4,553
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
|
Since Stanza is primarily an ePub format reader it is more than likely that these are ePub format books encapsulated in a Stanza shell to provide the reading facility. Nothing you said contradicts this assumption. An ePub book is basically a Zip file with a standardized set of files inside.
It might be worth taking all the files, zipping them up into a single file, and then changing the file extension to ePub and see if Calibre (or an ePub editor like Sigil) is happy with the results. |
09-04-2010, 09:02 PM | #4 |
Senior Wrangler
Posts: 22
Karma: 10
Join Date: Jul 2006
Device: iPhone 3GS, want Kindle 3
|
Worked brilliantly, thanks! It was just an uncompressed epub.
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Default book language to be saved in content.opf? | moriakaice | Calibre | 7 | 05-21-2011 05:02 PM |
Calibre Recipe HTML content differs from raw html of index.html. | krunk | Calibre | 4 | 09-20-2010 09:48 PM |
cleaning the content.opf file | Adjust | ePub | 6 | 09-01-2010 05:54 PM |
HTML, NCX & OPF --> MESS | pakiyabhai | Workshop | 2 | 12-22-2009 10:43 AM |
DocBook XSL 1.74.0 adds ePub support! | Alexander Turcic | News | 1 | 06-14-2008 07:06 AM |