Originally Posted by SteffenH
Yes, of course, needless links. What is a recipe wizard?
Depends on what the links are that you wish to remove. If they are links on the rss page itself like for instance if you have say a Gallery link full of photos but you don't want that in the fetch then you would simply do something along this.
if you had at http://blah.com/mypage.rss
- Lovely article
- Another Beautiful article
- Gallery: - Lovely photos of a dumpster
- More Stuff
- And More stuff
- Gallery: Dirt samples
you could go something like this in your code
the above would search the rss feed links and if the link doesn't contain gallery it will return it otherwise they will be skipped.
if it is actually links inside the articles themselves for instance if you have
this little piggy went to the market. this little piggy stayed home. gallery: this little piggy's home
then you could do something along these lines:
what the above will do is take and find all head and h2 tags in the soup (you will have to change it to suit your needs) then if it finds those tags it moves on down to the for look and checks each tag that is stored inside weblinks. by taking and doing a regexpress search for the values of Gallery: that is inside the link. if it finds it then it gets rid of it. then it returns the soup.