one pattern to rule them all would be good, except that you can construct so many pathological borderline cases of mid sentence breaks, with speech and punctuation marks stuff, that's it's probably impossible.
the use of full stops for other than sentence ends does help either.
e.g. I doubt any one automated rule could detect and fix a mid sentence break like
"Is that Mr.
Smith ? "
she said
It's been a while since I've wanted to fix up one of these - nowadays I just read well formed books - but my preference was to use 2 or 3 passes with different simple rules
e.g. one that focuses on para start checks, one that focused on on para ends, and maybe another one for speech issues.
I'm sure there's a big old thread somewhere, for borderline case solutions
|