Error messages often dont mean what they say. People rarely update error messages to cover every condition that can trigger the error.
Article.__init__ in feeds/__init__.py
"Garbage characters" doesn't mean much. The fact that the characters are not interpretable by a human does not neccessarily mean anything. lxml chokes on unicode strings that ascii control codes or null characters. Both those are taken care of already. If there is some other well defined character set that lxml chokes on, that is not valid in a unicode text string, then that can be added, though I suspect that is not the case.
|