Quote:
Originally Posted by kovidgoyal
|
Thanks Kovid! You are awesome!
The only hiccup is that the file is over 60 MB (too big to automatically email to my kindle)
I figured all the images were probably the issue. I wasn't sure how to reduce the number of articles it downloaded so I just deleted what I thought is the image part:
Quote:
def preprocess_html(self, soup):
for img in soup.findAll('img', attrs={'data-baseurl': True}):
img['src'] = img['data-baseurl']
return soup
|
It is now under 2 MBs and it transferred to my kindle just fine. Thanks again for your help!