View Single Post
Old 08-23-2011, 06:25 PM   #4
crutledge
eBook FANatic
crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.
 
crutledge's Avatar
 
Posts: 18,301
Karma: 16078357
Join Date: Apr 2008
Location: Alabama, USA
Device: HP ipac RX5915 Wife's Kindle
Quote:
Originally Posted by alansplace View Post
i've run into a document that is full of broken paragraphs. in order to repair this problem i've tried to search for new paragraphs that begin with a lower case character, but have had no success as the regex expression ^[a-z] returns nothing.
This works for me in epp.

Code:
Find: ([^.”:?'!>—’)])</p>\s+<p>

Replace: \1 space
I search for paragraphs that do not end with a proper terminal followed by crlfs and a new paragraph.

Code:
<p>Now is the time</p>
<p>for all good men to come.</p>
becomes
Code:
<p>Now is the time for all good men to come.</p>
crutledge is offline   Reply With Quote