|
|
#1 |
|
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 644
Karma: 85520
Join Date: May 2021
Device: kindle
|
economic times india update
https://github.com/kovidgoyal/calibr...s_india.recipe
feed based, remove_tags and preprocess images Code:
remove_tags = [
classes(
'story_title storyCollection shareBar sr_widget_free jsSrWidgetFree srwidgetfree_3'
' sr_paid jsSrWidgetPaid ar_wrp arwd_ld_chk adBox custom_ad mgid orn_free_r bold'
),
]
Code:
def preprocess_html(self, soup):
for image in soup.findAll('img', attrs={'src': True}):
image['src'] = image['src'].replace("width-300", "width-640")
for img in soup.findAll('img', attrs={'data-original': True}):
img['src'] = img['data-original'].replace('photo', 'thumb').replace('quality-100', 'quality-100,width-600,resizemode-4')
return soup
https://github.com/kovidgoyal/calibr...edition.recipe |
|
|
|
![]() |
| Thread Tools | Search this Thread |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| The week(India) update | unkn0wn | Recipes | 2 | 06-29-2022 04:26 AM |
| New Recipe: Times of India | sskashish | Recipes | 1 | 06-05-2014 01:37 AM |
| Recipe Times of India Newspaper Mumbai Edition | honmore | Recipes | 0 | 02-01-2012 02:50 AM |
| Times Of India, DNA recipes broken? | mihirp | Recipes | 1 | 09-23-2011 04:09 PM |
| Recipe for hindustan times and India Today | agbpatro | Recipes | 1 | 09-11-2011 06:02 PM |