Thank you very much for your response!
I tried to follow your suggestions with whatever little knowledge I have, and it works well to some extent.
But I want to do the following:
1. I need the images related to the articles, which is not there right now. Like your 'The Hindu' recipe, I would like to place all the images at the beginning of the articles.
2. The articles show all the 'p' tags with 'span', which is understandable because I am directly feeding the XHR requests. I want to get rid of this.
3. I want to have more control on the json structure, I have no experience working with this. I tried to clean up things using 'remove_tags' and 'keep_only_tags', but got no luck yet.
4. And most importantly, I would like to know whether the whole process of getting the XHR links can be automated through the recipe itself or not.
You may also suggest some references, I would love to spend some time on this.
Thanks!
|