Quote:
Originally Posted by Barty
Don't quote me on this (haha), but try
Code:
preprocess_regexps = [
(re.compile(r"\\'", re.IGNORECASE | re.DOTALL), lambda match: "'")]
|
hi there.
You'd think that would work...
here's a piece of the web page source - i can't see why that wouldn't work
"caption": "<p>This American actress is setting tinseltown alight with her pretty looks and impressive acting ability. She rose to fame for her role in \'True Grit\' and was even nominated for an Academy Award and a BAFTA. The teen star is creating quite a splash in the fashion arena too, just recently her Miu Miu advert got banned for being \'irresponsible\'. Eek! We predict front row action at February\'s international fashion weeks</p>"