![]() |
#1 |
Enthusiast
![]() Posts: 33
Karma: 10
Join Date: Sep 2017
Location: Royston, Georgia
Device: Moon Reader & Prestigio on Android
|
Plugin that can detect broken sentences
Does anyone know if there is a plugin that can detect broken sentences within the text structure of an epub? When I say 'broken sentences', I mean a sentence that breaks off on one line and picks up two lines below.
FOR EXAMPLE: [He soon realized that they did not have any past experiences or original thoughts and ideas to share with Him. Nor did they show any signs of wanting to learn more beyond the music they loved and were so proficient at.] ** I'm don't expect any plugin to fix this type of situation. I'm just looking for a way to scan and detect such books, because I have a lot of old epubs that were scanned many years ago, thus creating the problem, but other than that, all the text, indentations, etc. are still fine and readable, and I hate to throw them out. Perhaps a plugin like 'Quality Check' could add that ability to it's list of epub structure items that it scans for. All thoughts are welcome. Last edited by Frizzell; 03-24-2019 at 07:49 PM. |
![]() |
![]() |
![]() |
#2 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,880
Karma: 59840450
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
I have a number of REGEX (Sigil saved searches below)
Code:
71\Name=Cleanup/Joins/Join to lower 71\Find="([[:alpha:],][\"\xe2\x80\x9d]*)</p>\\s*<p\\b[^>]*>([a-z\xe2\x80\x9c\"])" 71\Replace=\\1 \\2 72\Name=Cleanup/Joins/Join to upper 72\Find="([[:alpha:],]\xe2\x80\x9d*)</p>\\s*<p\\b[^>]*>([\"\xe2\x80\x9c]*[A-Z])" 72\Replace=\\1 \\2 74\Name=Cleanup/Joins/Initials 74\Find=([A-Z]\\.)</p>\\s*<p\\b[^>]*>([\"\xe2\x80\x9c]*[A-Z]) 74\Replace=\\1 \\2 80\Name=Cleanup/Joins/Honorifics 80\Find="(Mr|Mrs|Ms|Dr|Prof)\\.</p>\\s+<p class=\"calibre\\d+\">([A-Z])" 80\Replace=\\1. \\2 Note that some need the class name corrected I always do a Search, replace Next because there are always exceptions that I have not trapped (did not even try. It take a few minutes to run the series of S&R's) |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,599
Karma: 29709834
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Maybe one of the Sigil plugins can do that ==>> Sigil Plugin Index
It'll be one at a time, not en-masse. BR |
![]() |
![]() |
![]() |
Tags |
broken, plugin development, quality check, scanned, sentence structure |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Plugin error: IOError: [Errno 32] Broken pipe | stoduk | Development | 6 | 12-30-2015 07:14 AM |
UI plugin detect device connected? | jgoguen | Development | 7 | 02-28-2014 06:13 AM |
[Old Thread] Android plugin 0.8.3 does not detect Samsung Galaxy SII I9100 | hakan42 | Devices | 22 | 08-28-2011 03:37 AM |
.rtf - a way to find broken sentences? | plumtoad | Other formats | 3 | 07-05-2011 06:09 AM |
Fixing broken sentences. | Vanguard3000 | Sigil | 18 | 01-23-2011 12:45 PM |