View Single Post
Old 02-23-2022, 05:17 AM   #1
unkn0wn
Guru
unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.unkn0wn understands the Henderson-Hasselbalch Equation.
 
Posts: 625
Karma: 85520
Join Date: May 2021
Device: kindle
Update Indian express

fixing some tags and removing unnecessary banners

https://github.com/kovidgoyal/calibr...00db42e010677d

Code:
remove_attributes = ['style','height','width']
    ignore_duplicate_articles = {'url'}
    
    keep_only_tags = [
        classes('heading-part full-details')
    ]
    remove_tags = [
        dict(name='nav', attrs={'class':'ie-breadcrumb'}),
        dict(name='div', attrs={'id':'ie_story_comments'}),
        dict(name='div', attrs={'class':['ie-int-campign-ad','custom_read_button','unitimg','copyright']}),
        dict(name='img', attrs={'src':'https://images.indianexpress.com/2021/06/explained-button-300-ie.jpeg'}),
        dict(name='a', attrs={'href':'https://indianexpress.com/section/explained/?utm_source=newbanner'}),
        dict(name='img', attrs={'src':'https://images.indianexpress.com/2021/06/opinion-button-300-ie.jpeg'}),
        dict(name='a', attrs={'href':'https://indianexpress.com/section/opinion/?utm_source=newbanner'}),
        classes('share-social appstext storytags pdsc-related-modify news-guard'),

Last edited by unkn0wn; 02-23-2022 at 05:20 AM.
unkn0wn is offline   Reply With Quote