View Single Post
Old 03-05-2018, 11:11 PM   #5
snji
Junior Member
snji began at the beginning.
 
Posts: 7
Karma: 10
Join Date: Feb 2018
Device: Kindle Paperwhite
Quote:
Originally Posted by bobbysteel View Post
to test edit the recipe where the line has index_to_soup and a url with 'todays-paper' (i think 63) change that URL to the date you want
like
http://www.thehindu.com/archive/print/2018/03/01/
Hi, Thanks for the suggestion. I tried the modification as follows

Code:
# Modified on 2018Mar06 To download specific date paper        
        soup = self.index_to_soup('http://www.thehindu.com/archive/print/2018/03/02/')
        #soup = self.index_to_soup('http://www.thehindu.com/todays-paper/')
        nav_div = soup.find(id='subnav-tpbar-latest')
        section_list = []
But Calibre is not downloading any article with the modification.
I have compared code of pages on 'todays-paper' and 'print/2018/03/02/' & it looks like the id = subnav-tpbar-latest contains complete URL in first case but only links in 2nd case which is causing Calibre to not able to find any article

I have attached Calibre logs with relevant screenshots. Requesting your further help in clarification if possible . Thanks
Attached Thumbnails
Click image for larger version

Name:	Calibre_01.png
Views:	273
Size:	69.3 KB
ID:	162666   Click image for larger version

Name:	Calibre_02.png
Views:	254
Size:	35.8 KB
ID:	162667  
Attached Files
File Type: txt Calibre_03.txt (4.3 KB, 245 views)
snji is offline   Reply With Quote