There are many examples of recipes that deal with responsive images.
First you need to remove auto_cleanup and set
use_embedded_content = False
Then examine the downloaded html, find the responsive images, which typically have something like a data-src or srcset attribute with the real image url and dummy src attribute. Once you understand the structure, write preprocess_html to fix the images. See for example the preprocess_html function in the CNN or National Geographic recipes.
|