Quote:
Originally Posted by kovidgoyal
Many news websites lately are moving to using paywalls. Visit the website and check foryourself. It may just be that the website has changed and the recipe needs adjusting in calibre.
|
I'm having the same problem with the New Yorker. Calibre downloads the online version but many articles do not show the text only a link to the online edition (where, if you click on it you can see the whole article). I checked the current recipe and it includes the code for showing articles where there's a link. I think that's what this is:
def preprocess_html(self, soup):
for item in soup.findAll(style=True):
del item['style']
auth = soup.find(attrs={'id':'articleauthor'})
if auth:
alink = auth.find('a')
if alink and alink.string is not None:
txt = alink.string
alink.replaceWith(txt)
return soup
I'm not very comfortable editing code but would try it if there's something new that would work.
Many thanks for your help.