View Single Post
Old 12-23-2019, 09:18 PM   #939
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 31,167
Karma: 60406498
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Quote:
Originally Posted by snarkophilus View Post
Great idea, but I can see this getting hairy quickly! Some books use lower case chapter names, so an algorithm that was smart enough to pick lower case letters at the start of a paragraph style instead of a chapter name style would be nice. Maybe something that counts the number of time a style was used? I've also seen cases where a paragraph that finishes with a lower case letter or a comma and the next starts with an upperc case character are still "bad breaks".

I'm sure there's someone in the Sigil world who has built up a fancy regex to find many of these. (Quick search...) There are some examples here, here, here and here.

Definitely a handy one if it could be implemented.
I have about 5 or 6 that I use in Sigil. Only 2, do I ever run using the 'All" button. The rest I step thru (it is still a very fast operation) , some times I do skip the replace .
There are still EXCEPTIONS . (lots of Publishers Boilerplate should not be touched).
I am currently reading a book that has a acronym that starts with a lower case letter.
A.M. or P.M. will fail. (One of my searches dos fix Mr. Mrs. ... splits )
Still, Nothing beats the human eyeball for spotting errors
theducks is offline   Reply With Quote