View Single Post
Old 12-23-2019, 06:31 PM   #938
snarkophilus
Wannabe Connoisseur
snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.snarkophilus ought to be getting tired of karma fortunes by now.
 
Posts: 426
Karma: 2516674
Join Date: Apr 2011
Location: Geelong, Australia
Device: Kobo Libra 2, Kobo Aura 2, Sony PRS-T1, Sony PRS-350, Palm TX
Quote:
Originally Posted by kboogie222 View Post
USER STATEMENT
As a Calibre user, I would like to be able to detect books with "Bad Breaks" so that I can repair or replace them with more readable versions.
Great idea, but I can see this getting hairy quickly! Some books use lower case chapter names, so an algorithm that was smart enough to pick lower case letters at the start of a paragraph style instead of a chapter name style would be nice. Maybe something that counts the number of time a style was used? I've also seen cases where a paragraph that finishes with a lower case letter or a comma and the next starts with an upperc case character are still "bad breaks".

I'm sure there's someone in the Sigil world who has built up a fancy regex to find many of these. (Quick search...) There are some examples here, here, here and here.

Definitely a handy one if it could be implemented.
snarkophilus is offline   Reply With Quote