Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 10-27-2009, 05:51 PM   #1
tjlytle
Junior Member
tjlytle began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Oct 2009
Device: Kindle DX
Converting Online Documentation

I'd like to convert the documentation I read online into ebook format. For example:

Zend Framework Reference
Doctrine Project

I've had moderate success grabbing documentation from the Zend source code (simpler HTML than the public site), and running that through Calibre as single book (from a directory of files). However, for documentation that isn't available in a 'plain' HTML format, I'm not quite sure what to do.

It seems that I might be able to treat it as a new source, and write a recipe that parses an 'index' page for all the 'chapters' and then strips all but the actual content from the individual pages.

Has anyone done this kind of recipe? Am I on the right track?

Also, is there any way to get a recipe to generate a multi-level TOC (main TOC links to individual section TOCs)?
tjlytle is offline   Reply With Quote
Old 10-27-2009, 06:49 PM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,436
Karma: 5383257
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
I would suggest writing a recipe to generate an EPUB file. Then unzip thye EPUB file and use the Automatic TOC creation options to create a multilevel TOC. recipe TOCS are limited to two levels
kovidgoyal is online now   Reply With Quote
Old 10-30-2009, 12:14 PM   #3
tjlytle
Junior Member
tjlytle began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Oct 2009
Device: Kindle DX
I'll give that a shot. Another quick question - any references on converting docbook to ebook formats?
tjlytle is offline   Reply With Quote
Old 11-13-2009, 08:32 AM   #4
donald1
Enthusiast
donald1 began at the beginning.
 
Posts: 28
Karma: 10
Join Date: Nov 2009
Device: Sony PRS-600
Talking recipe converting online documentation

Quote:
Originally Posted by tjlytle View Post
I'd like to convert the documentation I read online into ebook format. For example:

Zend Framework Reference
Doctrine Project

I've had moderate success grabbing documentation from the Zend source code (simpler HTML than the public site), and running that through Calibre as single book (from a directory of files). However, for documentation that isn't available in a 'plain' HTML format, I'm not quite sure what to do.

It seems that I might be able to treat it as a new source, and write a recipe that parses an 'index' page for all the 'chapters' and then strips all but the actual content from the individual pages.

Has anyone done this kind of recipe? Am I on the right track?

Also, is there any way to get a recipe to generate a multi-level TOC (main TOC links to individual section TOCs)?
Did you write a recipe to convert online documentation ?
I could use this, it would be very helpfull.

Thanks for your reply in advance
donald1 is offline   Reply With Quote
Old 11-21-2009, 12:34 AM   #5
tjlytle
Junior Member
tjlytle began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Oct 2009
Device: Kindle DX
Haven't had a chance to do anything yet. Got as far as reading documentation on how to use python in calibre to convert documentation so I could read it.

I'll post if/when I get somethign done.
tjlytle is offline   Reply With Quote
Old 09-05-2010, 02:39 AM   #6
tjlytle
Junior Member
tjlytle began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Oct 2009
Device: Kindle DX
Refreshing this old thread - hope that's not taboo. Finally got a chance to read up on python and modify a recipe. Here's a simple one that grabs the Doctrine2 documentation.

Now I'm wondering how to make calibre format this as a normal book and not a collection of 'articles'. While the article navigation makes sense for real periodicals/blogs/news feeds, I'd prefer a standard TOC for converted documentation.

Last edited by tjlytle; 09-05-2010 at 02:40 AM. Reason: added reasoning
tjlytle is offline   Reply With Quote
Old 09-05-2010, 12:13 PM   #7
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,436
Karma: 5383257
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
yes you can override the method in basicnewsrecipe that creates the OPF and NCX files and create any kind of TOC you like.
kovidgoyal is online now   Reply With Quote
Old 09-05-2010, 06:01 PM   #8
tjlytle
Junior Member
tjlytle began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Oct 2009
Device: Kindle DX
Quote:
Originally Posted by kovidgoyal View Post
yes you can override the method in basicnewsrecipe that creates the OPF and NCX files and create any kind of TOC you like.
Is there a recipe that illustrates that? I see that the 'publication_type' is set to 'periodical:' (plus newspaper/magazine/blog). Is it as easy as changing that?

Sorry for what probably are simple questions, I really did check out the 'create_opf' method - it's all just a bit new to me.
tjlytle is offline   Reply With Quote
Old 09-05-2010, 06:26 PM   #9
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 26,436
Karma: 5383257
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
No publication_type only sets a single field in the OPF. You have to override create_opf
kovidgoyal is online now   Reply With Quote
Reply

Tags
conversion, recipe, website

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Documentation for Sigil Valloric Sigil 15 11-12-2010 07:37 PM
Converting online html ebook with Calibre andavane Calibre 6 10-04-2009 05:17 PM
Technical Documentation Gremlin HanLin eBook 4 07-16-2009 03:24 AM
Documentation for FBReader MishaS Reading and Management 5 07-29-2008 02:00 PM
More Detailed Documentation? TedPark Sony Reader Dev Corner 3 06-10-2008 10:06 PM


All times are GMT -4. The time now is 10:51 AM.


MobileRead.com is a privately owned, operated and funded community.