I'd like to convert the documentation I read online into ebook format. For example:
Zend Framework Reference
I've had moderate success grabbing documentation from the Zend source code (simpler HTML than the public site), and running that through Calibre as single book (from a directory of files). However, for documentation that isn't available in a 'plain' HTML format, I'm not quite sure what to do.
It seems that I might be able to treat it as a new source, and write a recipe that parses an 'index' page for all the 'chapters' and then strips all but the actual content from the individual pages.
Has anyone done this kind of recipe? Am I on the right track?
Also, is there any way to get a recipe to generate a multi-level TOC (main TOC links to individual section TOCs)?