skip_ad_pages & bad image links
Hi,
I have a problem with the skip_ad_pages method.
The feed I want to parse returns me a "wrong" article URL like
"http://bad/advertisement/page/story01.htm"
which refers to an advertisement page containing the right article URL like
"http://right/article/url/article.shtml"
I use the skip_ad_pages method to get the right page and it works except for img links in the real page.
Calibre prepend the wrong article URL to all the img tag which have "src" attribute like "path/to/image.jpg" so that the final image URL is
"http://bad/advertisement/page/path/to/image.jpg"
and not
"http://right/article/url/path/to/image.jpg"
This causes calibre fail when it tries fetching the image because it follows the wrong link.
Which is the best way to solve this?
Thankyou all in advance
|