Quote:
Originally Posted by TonytheBookworm
I would still like to know how to parse for links though if anyone can guide me
Reason I'd like to know is even on this page not all the feeds have feeds. More specifically have a look at
http://www.fieldandstream.com/blogs
and notice "The Wild Chef" takes you to feeds.feedburner.com and nothing else 
And the recipe blog was one of the main ones I wanted haha cause man gotta eat
|
Let's start at:
http://www.fieldandstream.com/blogs That page has links to feeds (as pointed out by dwanthny) and non-rss links (as in "The Wild Chef").
If all links were to rss feeds, you would use this page to manually get the feed links for your recipe, then the recipe would do all the work thereafter. Let's assume there are no RSS feeds. Then you would normally manually get all the other links from that page (and the title of the feed), and store them in a manually created dictionary of feed title and URL in your recipe. Each URL would be fed into
parse_index.
Each time one of those URLs was fed into parse_index, it would parse the page, find all article links and build a feed structure for the matching feed title/URL that would then be appended to the feed list and be passed back into the recipe. How you build the feed structure depends on the pages, but basically, you need:
'title' : article title,
'url' : URL of article
'date' : The publication date of the article as a string,
'description' : A summary of the article
I suggest you search the recipes for "parse_index." There are dozens of examples of how this is done.