11-07-2012, 03:35 PM | #1 |
Groupie
Posts: 191
Karma: 5484808
Join Date: Sep 2012
Device: Sony PRS-T2, Kindle
|
Next Misspelled Word
I have discovered a problem with the Next Misspelled Word function of Sigil, but I don't know what's causing it.
I am trying to clean up the Google-Books epub of Henry Adams's History of the United States to eliminate some distractions. Mainly these are spelling mistakes, scan errors, or words split in two. The split is only visible in Sigil in Code View, or in my Reader when the first part of the split comes at the end of a line. For example, here the word "learning" is split by code whose function, as a beginner, I don't understand: On learn<a id="GBS.PA58.w.1.0.0"></a><span class="gtxt_body" id="para.70.1.0.box.248.234.1006.310.q.60">ing the sale of Louisiana [etc.] To find these errors, I have been using Sigil's "Next Misspelled Word"-function, which generally works as I would expect. However, a few times now I have found that it will skip from one section to the next, even though there are still misspelled words farther down in the first section. Say that I am checking for errors in content-0020.xml, it will skip to content-0021.xml, even though there are still plenty of words underlined in red in content-0020.xml. Here's an example of where that happens: <p class="gtxt_footnote" id="para.367.2.0.box.242.1770.999.79.q.40" style="text-indent:1em;"><sup>1</sup> Mémoire, etc., lu à l'Institut National le 15 Germinal, An v. (April 4, 1797).</p> In section content-0020.xml the words "Mémoire, etc., lu à l'Institut" and "le" are all underlined in red. Clicking on the Next Misspelled Word button highlights each of them, one after the other, in blue - up to and including the word "à". If I click on the Next Misspelled Word button again, the spelling check skips to section content-0021.xml, ignoring "l'Institut" and "le", and a bunch of other red-underlined words farther down the page. If I insert the cursor after "l'Institut," the spelling check continues in content-0020.xml, instead of skipping to the next section - until, that is, I reach this line: <p class="gtxt_footnote" id="para.384.2.0.box.226.1767.1003.72.q.50" style="text-indent:1em;"><sup>1</sup> Rapport à l'Empereur, 28 Brumaire, An xiii. (Nov. 19,1804); Archives des Aff. Étr. MSS.</p> The words "à l'Empereur" are underlined in red. As before, "à" gets highlighted in blue, but when I click on the Next Misspelled Word button again, it skips to content-0021.xml, even though it hasn't reached the bottom of content-0020.xml yet. Again, if I insert the cursor after "l'Empereur," the spelling check continues in content-0020.xml, instead of skipping to the next section. |
11-07-2012, 04:03 PM | #2 |
Well trained by Cats
Posts: 29,662
Karma: 54369090
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
<a id="GBS.PA58.w.1.0.0"></a><span class="gtxt_body" id="para.70.1.0.box.248.234.1006.310.q.60"> (and it is missing the closing </span> )
That is an anchor(point) FROM the footnote/Index to allow a return to the middle of the word? While HTML visually correct, I would never expect a spell check to wade through such odd usage |
11-07-2012, 04:33 PM | #3 |
Sigil developer
Posts: 1,275
Karma: 1101600
Join Date: Jan 2011
Location: UK
Device: Kindle PW, K4 NT, K3, Kobo Touch
|
Well that is pretty heavily tagged text, but the issue is also shown by just
<p>test à this wrd</p> It finds à as misspelled but then won't move forward. |
11-07-2012, 05:23 PM | #4 |
calibre/Sigil Developer
Posts: 4,588
Karma: 2092208
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
Fixed for 0.6.1.
The issue is any time it reaches a 1-character misspelled word it will not jump to any other misspelled words further on the page for the next check. And as stated the workaround for now is placing the cursor several characters after that 1-character misspelled word. |
11-07-2012, 06:58 PM | #5 | |||
Groupie
Posts: 191
Karma: 5484808
Join Date: Sep 2012
Device: Sony PRS-T2, Kindle
|
Quote:
Quote:
Quote:
forest covered every portion, except here and there a str<a id="GBS.PA1.w.0.1.0.1"></a>ip of cultivated soil |
|||
11-07-2012, 06:59 PM | #6 |
Groupie
Posts: 191
Karma: 5484808
Join Date: Sep 2012
Device: Sony PRS-T2, Kindle
|
Thanks meme and kiwidude.
|
11-07-2012, 07:17 PM | #7 |
Well trained by Cats
Posts: 29,662
Karma: 54369090
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
|
11-07-2012, 08:27 PM | #8 | |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
Dale |
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
I have a misspelled genre tag | mdietz39 | Library Management | 1 | 06-04-2012 08:13 PM |
Romance Ebers, Georg: A Word, Only a Word. V1. 20 Mar 2009 | crutledge | IMP Books | 0 | 03-20-2009 08:12 AM |
Romance Ebers, Georg: A Word, Only a Word. V1. 20 Mar 2009 | crutledge | ePub Books | 0 | 03-20-2009 08:09 AM |