Business Standard update
Made some of changes..
reduced oldest article to 1
changes to tags (presently every article loads some kind of 'dear readers' letter box)
cover_url
feeds
removed conversion_options.. (keep it maybe)
Today's paper articles don't load wholly .. they are to be pre_process'ed to parse json
just saw that there's this text in every article: (Only the headline and picture of this report may have been reworked by the Business Standard staff; the rest of the content is auto-generated from a syndicated feed.)
add this in remove_tags dict(name='p', attrs={'id':'auto_disclaimer'}),
Last edited by unkn0wn; 04-20-2022 at 08:56 AM.
|