View Single Post
Old 10-21-2022, 12:55 AM   #7
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by ColMac View Post
Wonder if someone can help an absolute beginner.

I have a number of books that contain many false paragraph breaks part way through a sentence.

An example of the type of code that appears is

</p>

<p class="calibre7">

This of course is the valid code for a correctly placed paragraph break in this book.

It normally appears with no space after the last word, and no space before the next word, and in most (I'm reluctant to say all) cases the last character before the error will be lower case. However, the first character of the following word may well be upper case when a name is used for example.
I have explained this exact "broken paragraphs" question many times over the years.

Here are 4 such topics where I go step-by-step and break down the Regular Expressions:

I even discussed it way back in:

(Of course, my newer methods are better and fix more things, but those regex are still useful to see/learn from.)

Last edited by Tex2002ans; 10-21-2022 at 01:42 AM.
Tex2002ans is offline   Reply With Quote