Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 08-08-2010, 11:13 PM   #1
AndrewLB
Senior Wrangler
AndrewLB began at the beginning.
 
AndrewLB's Avatar
 
Posts: 22
Karma: 10
Join Date: Jul 2006
Device: iPhone 3GS, want Kindle 3
HTML to Mobi conversions (DocBook XSL, and content.opf?)

Hi,
I recently pre-ordered the Kindle 3 like everyone else, and set about consolidating my various electronic books into a suitable format.

A big source for me is actually the O Reilly iphone books, which are purchased in a Stanza shell. The actual contents are composed as HTML, with a Table of Contents file and then each chapter/section described as separate html files. The images are duly linked in these html files in a flat folder structure.

Anyway, was curious if this was a standardized format. The only hint I've found in the files are that they've been generated via the "DocBook XSL Stylesheets," which I'm looking into now. But XSL is a giant pain in the ass from experience, so just wanted to ask here before I go down that painful, painful road again.

For reference, the naming scheme is fairly consistent, but semi arbitrary (eg ch01s02 for chapter 1 section 2), and the actual structure seems defined by an XML file called content.opf

I'm working on the idea here that since I purchased the books in the format, I'm entitled to the content and being Canadian, I can "strip" the "DRM" (i.e. remove the content from the stanza app).

Soooooooo, any suggestions?
AndrewLB is offline   Reply With Quote
Old 08-09-2010, 02:07 AM   #2
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 8,809
Karma: 12535517
Join Date: Feb 2009
Location: North Carolina
Device: Nexus 7
Quote:
Originally Posted by AndrewLB View Post
A big source for me is actually the O Reilly iphone books, which are purchased in a Stanza shell. The actual contents are composed as HTML, with a Table of Contents file and then each chapter/section described as separate html files. The images are duly linked in these html files in a flat folder structure.
I'll be the first to admit ignorance in this area, not having an iPhone or using Stanza myself, but it sounds as if you are talking about books in an ePub format, which is one of the formats O'Reilly uses and one that Stanza reads.

Quote:
Originally Posted by AndrewLB View Post
I'm working on the idea here that since I purchased the books in the format, I'm entitled to the content and being Canadian, I can "strip" the "DRM" (i.e. remove the content from the stanza app).
Remove DRM from O'Reilly books? I didn't think O'Reilly used DRM.
DoctorOhh is offline   Reply With Quote
Old 08-09-2010, 02:08 AM   #3
itimpi
Wizard
itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.
 
Posts: 4,046
Karma: 777825
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
Since Stanza is primarily an ePub format reader it is more than likely that these are ePub format books encapsulated in a Stanza shell to provide the reading facility. Nothing you said contradicts this assumption. An ePub book is basically a Zip file with a standardized set of files inside.

It might be worth taking all the files, zipping them up into a single file, and then changing the file extension to ePub and see if Calibre (or an ePub editor like Sigil) is happy with the results.
itimpi is offline   Reply With Quote
Old 09-04-2010, 09:02 PM   #4
AndrewLB
Senior Wrangler
AndrewLB began at the beginning.
 
AndrewLB's Avatar
 
Posts: 22
Karma: 10
Join Date: Jul 2006
Device: iPhone 3GS, want Kindle 3
Worked brilliantly, thanks! It was just an uncompressed epub.
AndrewLB is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Default book language to be saved in content.opf? moriakaice Calibre 7 05-21-2011 05:02 PM
Calibre Recipe HTML content differs from raw html of index.html. krunk Calibre 4 09-20-2010 09:48 PM
cleaning the content.opf file Adjust ePub 6 09-01-2010 05:54 PM
HTML, NCX & OPF --> MESS pakiyabhai Workshop 2 12-22-2009 10:43 AM
DocBook XSL 1.74.0 adds ePub support! Alexander Turcic News 1 06-14-2008 07:06 AM


All times are GMT -4. The time now is 10:24 AM.


MobileRead.com is a privately owned, operated and funded community.