Find this NOT that
I'm trying to do a search but a narrow one. Basically I've converted some PDFs to ePub but some paragraphs are broken up say one ends with half a sentence and the other paragraph continues on with the sentence.
I want to do search for any 2 characters and </p> but don't find ."</p>, .</p>, ?</p>, ."</p>, ?"</p>, !</p>, !"</p> as those should be proper sentence enders.
Right now I have [^.,^\?,^\!][a-z,A-Z,”,\,, ,+]</p> and it seems to work but is there a simpler way of doing this?
|