Need to create custom recipe for downloading from different source
Hi,
Aim: Create a single ebook for news from multiple source.
Source: Hindu, pib.nic.in,ET,business line,business standard, downtoearth.. etc
challenge: Each of these sites have different layout, many have to parsed from index rather than rss.
I wish to run a single recipe for the whole of this task.
I am stuck with the a question "How to have preprocess / postpreocess_html for each source within a recipe? "
or if i can override the download and mangle procedure myself? (which classmethods to override?)
Thanks in advance..
Last edited by knowledgecrawler; 08-16-2014 at 06:38 AM.
Reason: restating the issue at hand
|