Adding/extending the post from theducks
Quote:
Originally Posted by theducks
(?sm)</p>\s+(.+?)\s+<p>
Should work to remove things outside those tags
Don't try this on any copy you want to be usable after you are done,
BUT YOU WERE WARNED that there are other valid things between the closing </p> and the Next <p> that should not be removed: The list is big, so I am not wasting my time typing it.
|
Because it's highlighting/selecting the p tags, just change the replace to </p>\n<p>
Overall problem - The way I'd do it, is convert in calibre to txtz->textile output, the convert that back to epub.
keeping copies of original just in case.