Originally Posted by sdow1
I created a (fairly basic) recipe for Vanity Fair, and it seems to work pretty well as far as getting the full article content without "too much" extraneous stuff, but I would love if someone else who is better at this than I wants to run with it (i.e., adding covers, cleaning it up further, etc.). (Note, I also don't have the fourth VF RSS feed, relating to their Soccer Blog, in here, because I had no interest in it, but it obviously might be of interest to a more general audience)
Take and use remove_tags
For instance to get rid of the print options at the top use this in your code. I always put it before the feed section but you can put it pretty much anywhere inside the class block just make sure your indents are correct.
you see when using firebug in firefox that the element you wish to remove is
<div id="printoptions"> so the below will get rid of that.
As for the cover it depends on what cover you wish to use. Take again and use firefox and figure out what element of article (soup) you want to use as your image source. For instance lets say our cover is in the
<div class="spread-image"> we would use something like this to get the image as the cover.
If however you want just a static cover (never changes) then simply take and put the following
cover_url = 'PUT THE URL TO THE IMAGE HERE'
and thats it.
good luck let me know if you need any help. just post your code and indicate where you seem to be having issues. also utilize (
)put your code in here (
) without the ()'s of course. This will keep the thread cleaner and keep the formatting correct because python is picky about indents.