I don't think I ever used unicode in regular expressions. Did you just copy my code or did you try to replace the „“ chars in it with the ones copied from the source webpage?
Otherwise this variant might work better:
Code:
preprocess_regexps = [
(re.compile(r'„|“'),
lambda match: '"'),
]
Or you could post the whole recipe here, so I can test it.