I recommend useing the editor for finding split paragraphs and line feeds, this is the base search and replace I use:
Search
Code:
</p>\s*<p[^>]+>([a-z])
replace:
note the space at the beginning of the replace.
For extra line feeds I use a regex-fuction.
search:
Code:
<p class="(.*?)">(.*?)</p>|<div class="(.*?)">(.*?)</div>
regex-fuction
Code:
def replace(match, number, file_name, metadata, dictionaries, data, functions, *args, **kwargs):
return match.group().replace('\n', ' ')
bernie