View Single Post
Old 04-01-2022, 02:46 AM   #5
unkn0wn
Guru
unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.
 
Posts: 624
Karma: 85520
Join Date: May 2021
Device: kindle
Quote:
masthead_url = 'https://epaper.hindustantimes.com//Images/logo/logo.png'
Quote:
extra_css = '''
button { display: none; }
#container { font-style: italic; font-size: 75%; }
'''
add the container part. The 'about author' part is taking huge space

Quote:
keep_only_tags = [
Spoiler:
dict(name='h1'),
dict(name='div', attrs={'class':'sortDec'}),
dict(name='picture'),
dict(name='figcaption'),

classes(' fullStory dateTime storyBy storyDetails detail freemiumText paywall'),
]
remove spoiler part and add fullStory to classes.

Quote:
remove_tags = [
classes('htsWrapper shareArticle new__newsletter__signup signup__box subscribe freemium-card adMinHeight313'
' storyTopics embed_div shareIcons close-btn mainCoachmarks sharePopUp'),
add mainCoachmarks and sharePopUP to remove tags

If there are Also read: links that are present within article text.. can we remap it to take it to that article (if its available) within the epub like it does in the economist.

Quote:
def canonicalize_internal_url(self, url, is_link=True):
if url.endswith('/print'):
url = url.rpartition('/')[0]
return BasicNewsRecipe.canonicalize_internal_url(self, url, is_link=is_link)
Above is from economist. Is this how we should do it?

Last edited by unkn0wn; 04-01-2022 at 03:53 AM.
unkn0wn is offline   Reply With Quote