View Single Post
Old 10-18-2022, 08:46 AM   #1
ColMac
Connoisseur
ColMac began at the beginning.
 
Posts: 59
Karma: 10
Join Date: Apr 2012
Device: Kindle Fire
False paragraph breaks & RegEx

Wonder if someone can help an absolute beginner.

I have a number of books that contain many false paragraph breaks part way through a sentence.

An example of the type of code that appears is

</p>

<p class="calibre7">

This of course is the valid code for a correctly placed paragraph break in this book.

It normally appears with no space after the last word, and no space before the next word, and in most (I'm reluctant to say all) cases the last character before the error will be lower case. However, the first character of the following word may well be upper case when a name is used for example.

Would there be a way to check for that code appearing (and Calibre 7 would presumably need to be a variable) when not following the common punctuation marks?

Thanks in hope, as this is way beyond my skills.

Colin
ColMac is offline   Reply With Quote