View Single Post
Old 08-15-2014, 01:56 PM   #1
knowledgecrawler
Member
knowledgecrawler began at the beginning.
 
Posts: 12
Karma: 10
Join Date: Aug 2014
Device: kindle
Need to create custom recipe for downloading from different source

Hi,

Aim: Create a single ebook for news from multiple source.

Source: Hindu, pib.nic.in,ET,business line,business standard, downtoearth.. etc

challenge: Each of these sites have different layout, many have to parsed from index rather than rss.

I wish to run a single recipe for the whole of this task.
I am stuck with the a question "How to have preprocess / postpreocess_html for each source within a recipe? "
or if i can override the download and mangle procedure myself? (which classmethods to override?)

Thanks in advance..

Last edited by knowledgecrawler; 08-16-2014 at 06:38 AM. Reason: restating the issue at hand
knowledgecrawler is offline   Reply With Quote