Thats more or less what I used, the Article can also be extraced fine, the problem is the picture within the article. Its a normal JPG picture, but still, it fails to be included. Tried Bookit to get the whole page but it also fails to include the articles picture.
e.g.:
http://diepresse.com/home/panorama/r...ex.do?from=rss
Picture of the pope in there, nevertheless, no picture included in the final ebook.
Code:
remove_tags_before = dict(id='content')
remove_tags_after = dict(id='content')