View Single Post
Old 03-28-2010, 11:37 AM   #1668
olaf
Enthusiast
olaf is on a distinguished road
 
Posts: 43
Karma: 50
Join Date: May 2009
Device: Kindle3
I can not for the life of me figure out how to remove an image file at the top of each article of this newspaper. The image file has "Share - Larger Text - Smaller Text - Print" at the top of each article, pushing the main picture off to the next page and leaving the current page mostly blank. Any advice on how I get rid of that image? It seems to be embedded in code I can't seem to get at.

import string, re

class AdvancedUserRecipe1252944207(BasicNewsRecipe):
title = u'Telegram & Gazette'
oldest_article = 1
max_articles_per_feed = 50
timefmt = ''
no_stylesheets = True

keep_only_tags = [dict(id=['frontpage_section', 'articleWell', 'headline', 'subheadline', 'SuperHeading', 'byline', 'articleBody', 'zoom1'])]
remove_tags = [dict(id=['factBoxes'])]
preprocess_regexps = [(re.compile(r'<!-- This code displays columnist headshots: -->.*?<p>', re.DOTALL|re.IGNORECASE), lambda match: '')]
preprocess_regexps = [(re.compile(r'<div class="verdana11">.*?<!-- END ARTICLE COMMENTS -->', re.DOTALL|re.IGNORECASE), lambda match: '')]
encoding = 'cp1252'
remove_tags_after = [dict(id='leaderboardBot')]

feeds = [(u'Front Page News', u'http://www.telegram.com/apps/pbcs.dll/section?Category=RSS03&MIME=xml'),
(u'World & Regional', u'http://www.telegram.com/apps/pbcs.dll/section?Category=rss01&MIME=xml&profile=1052'),
(u'Living', u' http://www.telegram.com/apps/pbcs.dl...l&profile=1011'),
(u'Local News', u' http://www.telegram.com/apps/pbcs.dl...l&profile=1101'),
(u'Business', u'http://www.telegram.com/apps/pbcs.dll/section?Category=rss01&MIME=xml&profile=1002'),
(u'Opinion', u'http://www.telegram.com/apps/pbcs.dll/section?Category=rss01&MIME=xml&profile=1017'),
(u'Deaths', u'http://www.telegram.com/apps/pbcs.dll/section?Category=rss01&MIME=xml&profile=1001'),
(u'As I See It', u'http://www.telegram.com/apps/pbcs.dll/section?Category=rss01&MIME=xml&profile=1054')]
olaf is offline