View Single Post
Old 08-29-2010, 02:45 PM   #2555
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by TonytheBookworm View Post
I would still like to know how to parse for links though if anyone can guide me


Reason I'd like to know is even on this page not all the feeds have feeds. More specifically have a look at
http://www.fieldandstream.com/blogs
and notice "The Wild Chef" takes you to feeds.feedburner.com and nothing else
And the recipe blog was one of the main ones I wanted haha cause man gotta eat
Let's start at: http://www.fieldandstream.com/blogs That page has links to feeds (as pointed out by dwanthny) and non-rss links (as in "The Wild Chef").

If all links were to rss feeds, you would use this page to manually get the feed links for your recipe, then the recipe would do all the work thereafter. Let's assume there are no RSS feeds. Then you would normally manually get all the other links from that page (and the title of the feed), and store them in a manually created dictionary of feed title and URL in your recipe. Each URL would be fed into parse_index.

Each time one of those URLs was fed into parse_index, it would parse the page, find all article links and build a feed structure for the matching feed title/URL that would then be appended to the feed list and be passed back into the recipe. How you build the feed structure depends on the pages, but basically, you need:

'title' : article title,
'url' : URL of article
'date' : The publication date of the article as a string,
'description' : A summary of the article

I suggest you search the recipes for "parse_index." There are dozens of examples of how this is done.

Last edited by Starson17; 08-29-2010 at 02:49 PM.
Starson17 is offline