View Single Post
Old 04-15-2011, 04:18 AM   #1
DarkElf
Junior Member
DarkElf began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Apr 2011
Device: Kindle 3
skip_ad_pages & bad image links

Hi,
I have a problem with the skip_ad_pages method.
The feed I want to parse returns me a "wrong" article URL like
"http://bad/advertisement/page/story01.htm"
which refers to an advertisement page containing the right article URL like
"http://right/article/url/article.shtml"

I use the skip_ad_pages method to get the right page and it works except for img links in the real page.
Calibre prepend the wrong article URL to all the img tag which have "src" attribute like "path/to/image.jpg" so that the final image URL is
"http://bad/advertisement/page/path/to/image.jpg"
and not
"http://right/article/url/path/to/image.jpg"

This causes calibre fail when it tries fetching the image because it follows the wrong link.
Which is the best way to solve this?

Thankyou all in advance
DarkElf is offline   Reply With Quote