Working on a book this morning and I found after stripping out all the class, style and useless spans/divs I was left once again with broken up sentences like this:
<p>this is</p>
<p>part of a</p>
<p>paragraph.</p>
<p> </p>
So I came up with:
FIND: ([a-z,’”.?!-])</p>\n\n\s\s<p>([a-z,A-Z“-])
REPLACE: \1 \2
\n = new line
\s = white space
All the <p> </p> are ignored and then I just strip them out when all the paragraphs are back together.
Interesting info
here