View Single Post
Old 11-27-2014, 02:41 AM   #3
cyttorak
Member
cyttorak began at the beginning.
 
Posts: 18
Karma: 10
Join Date: Nov 2014
Device: Kobo Mini
Exclamation

Thank kovidgoyal

but the solution was this:

Quote:
preprocess_regexps = [
(re.compile(u'\xa0'), lambda match: ' '),
(re.compile(r' ',re.DOTALL|re.IGNORECASE), lambda match: ' '),
(re.compile(r'\s*<p[^>]*>\s*</p>\s*',re.DOTALL|re.IGNORECASE), lambda match: '')
]
I saw it here http://stackoverflow.com/questions/1...a0-from-string
cyttorak is offline   Reply With Quote