View Single Post
Old 12-01-2018, 12:57 PM   #1
xiatian
Connoisseur
xiatian began at the beginning.
 
Posts: 51
Karma: 10
Join Date: Oct 2018
Device: kindle
Question parse_index for a jpg link as an article url

Hi,
I went into a problem when using parse_index. Some website's sections are just pictures, which are .jpg links(like https://www.irishtimes.com/polopoly_fs/1.3520128.1528206115!/image/image.jpg_gen/derivatives/box_620_330/image.jpg ) rather than normal html web pages. So if I still use list of articles dictionary like {'url',jpg_url} will fail because it's not a html tree to parse. How can I work around this issue?
Thanks!
xiatian is offline   Reply With Quote