Hello,
I started to attempt this myself, by I know from experience that I suck at python scripting (seems to be some sort of mental block).
Anyway, I am moving from a Palm TX to an iPad and need to move two site scrapers to calibre. Both are html page scrapes (not rss). Here they are:
http://www.macintouch.com/
I have been taking the main page and including the links to the reader reports.
http://www.theregister.co.uk/week.html
I would like to have this indexed by the dates on the page so the table of contents would have the dates with the articles as sub-titles (much like the way the one for the Calgary Herald works). It would also be great if it would also include the links that go to reghardware.com. This is definitely beyond my script-fu.
These both would be greatly appreciated and save me (literally) days of futzing around trying to learn python.
Thank you in advanced!