View Single Post
Old 08-16-2014, 06:39 AM   #2
knowledgecrawler
Member
knowledgecrawler began at the beginning.
 
Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
Quote:
Originally Posted by knowledgecrawler View Post
Hi,

Aim: Create a single ebook for news from multiple source.

Source: Hindu, pib.nic.in,ET,business line,business standard, downtoearth.. etc

challenge: Each of these sites have different layout, many have to parsed from index rather than rss.

I wish to run a single recipe for the whole of this task.
I am stuck with the a question "How to have preprocess / postpreocess_html for each source within a recipe? "
or if i can override the download and mangle procedure myself? (which classmethods to override?)

Thanks in advance..
Restated the problem..
knowledgecrawler is offline   Reply With Quote