Hi, I'd like to propose the following additions to the Internazionale recipe (
this) so that some text and images after the article are removed, since they do not belong to the article, but they only point to other articles.
In "class Volkskrant(BasicNewsRecipe):" I propose to add the class with the comments:
Code:
remove_tags = [
dict(
attrs={
'class': [
'item-banner',
'hentryfeed__side',
'magazine-article-share-tools',
'magazine-article-share-popup',
'article_next',
'cta_nl_ext_container',
'article_others_authors', # Remove link of other articles at the bottom of the article
'item_note2', # Remove 'lettere' and 'numero' at the bottom of the article
'article_spotlight', # Remove 'Da non perdere' at the end of the article
]
}
),
dict(name=['script', 'style']),
]
Thank you