Quote:
Originally Posted by alansplace
i've run into a document that is full of broken paragraphs. in order to repair this problem i've tried to search for new paragraphs that begin with a lower case character, but have had no success as the regex expression ^[a-z] returns nothing.
|
This works for me in epp.
Code:
Find: ([^.”:?'!>—’)])</p>\s+<p>
Replace: \1 space
I search for paragraphs that do not end with a proper terminal followed by crlfs and a new paragraph.
Code:
<p>Now is the time</p>
<p>for all good men to come.</p>
becomes
Code:
<p>Now is the time for all good men to come.</p>