Thread: web2lrf
View Single Post
Old 03-14-2008, 01:01 AM   #199
Deputy-Dawg
Groupie
Deputy-Dawg has learned how to read e-booksDeputy-Dawg has learned how to read e-booksDeputy-Dawg has learned how to read e-booksDeputy-Dawg has learned how to read e-booksDeputy-Dawg has learned how to read e-booksDeputy-Dawg has learned how to read e-booksDeputy-Dawg has learned how to read e-books
 
Deputy-Dawg's Avatar
 
Posts: 153
Karma: 799
Join Date: Dec 2007
Device: sony prs505
Quote:
Originally Posted by ddavtian View Post
Hi guys.

I'd like to get my local newspaper into the reader but couldn't do it. I used get_feeds to get articles from rss page but not much luck. I'm only getting the first article with tons of unnecessary pages (tried different patterns, couldn't clean the text).

If anybody has some time, please take a look at this one feed (http://feeds.contracostatimes.com/mn...571/200819.xml).

Thanks a lot in advance,
David
The attached script will download the "Most Viewed" feed. I have thus far been unable to capture more than the lead article from the other feeds. There is some subtle difference in them that is eluding me.

But in any event it shows you how to clean up the file so that you get rid of the extra garbage, including the embedded "Advertisement" block.
Attached Files
File Type: zip C_Costa.py.zip (1.1 KB, 423 views)
Deputy-Dawg is offline   Reply With Quote