Quote:
Originally Posted by ddavtian
Hi guys.
I'd like to get my local newspaper into the reader but couldn't do it. I used get_feeds to get articles from rss page but not much luck. I'm only getting the first article with tons of unnecessary pages (tried different patterns, couldn't clean the text).
If anybody has some time, please take a look at this one feed ( http://feeds.contracostatimes.com/mn...571/200819.xml).
Thanks a lot in advance,
David
|
The attached script will download the "Most Viewed" feed. I have thus far been unable to capture more than the lead article from the other feeds. There is some subtle difference in them that is eluding me.
But in any event it shows you how to clean up the file so that you get rid of the extra garbage, including the embedded "Advertisement" block.