View Single Post
Old 11-13-2011, 08:02 AM   #17
NotTaken
Connoisseur
NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.NotTaken is fluent in JavaScript as well as Klingon.
 
Posts: 65
Karma: 4640
Join Date: Aug 2011
Device: kindle
A few changes I made to the guardian recipe...

Made a few changes to the guardian recipe:
  • Removed the adverts that appeared at the bottom of some articles - found that there were two html sections in the soup, with all the relevant stuff in the first.
  • Removed some of the info under the headline. This includes: the author mugshot, "a version of this article appeared" spiel and the link to article history. I have put a comment by each of these in the remove_tags list to make it easy to re-enable if you choose.
  • Removed the number next to the ratings stars (that appear in reviews) - you will probably want to remove this if you disable the images (just remove the relevant stuff in preprocess_html)
Attached Files
File Type: zip guardian.recipe.zip (2.1 KB, 243 views)

Last edited by NotTaken; 11-13-2011 at 08:15 AM.
NotTaken is offline   Reply With Quote