Okay. I see your problem.
In fact, the return value of parse_index(self) is:
Code:
[
('title', [
{'title':..., 'url':..., 'description':..., 'date':...},
More dictionaries as above ...
]
),
More tuples with genres
]
The url has to be a HTML page.
On each of these pages, the values of remove_tags and so on are executed, resulting in a cleaned HTML-page.
A working example would be: