|
|
#1 |
|
Member
![]() Posts: 16
Karma: 10
Join Date: Sep 2012
Device: sony prs-t2
|
Next Misspelled Word
I am trying to clean up the Google-Books epub of Henry Adams's History of the United States to eliminate some distractions. Mainly these are spelling mistakes, scan errors, or words split in two. The split is only visible in Sigil in Code View, or in my Reader when the first part of the split comes at the end of a line. For example, here the word "learning" is split by code whose function, as a beginner, I don't understand: On learn<a id="GBS.PA58.w.1.0.0"></a><span class="gtxt_body" id="para.70.1.0.box.248.234.1006.310.q.60">ing the sale of Louisiana [etc.] To find these errors, I have been using Sigil's "Next Misspelled Word"-function, which generally works as I would expect. However, a few times now I have found that it will skip from one section to the next, even though there are still misspelled words farther down in the first section. Say that I am checking for errors in content-0020.xml, it will skip to content-0021.xml, even though there are still plenty of words underlined in red in content-0020.xml. Here's an example of where that happens: <p class="gtxt_footnote" id="para.367.2.0.box.242.1770.999.79.q.40" style="text-indent:1em;"><sup>1</sup> Mémoire, etc., lu à l'Institut National le 15 Germinal, An v. (April 4, 1797).</p> In section content-0020.xml the words "Mémoire, etc., lu à l'Institut" and "le" are all underlined in red. Clicking on the Next Misspelled Word button highlights each of them, one after the other, in blue - up to and including the word "à". If I click on the Next Misspelled Word button again, the spelling check skips to section content-0021.xml, ignoring "l'Institut" and "le", and a bunch of other red-underlined words farther down the page. If I insert the cursor after "l'Institut," the spelling check continues in content-0020.xml, instead of skipping to the next section - until, that is, I reach this line: <p class="gtxt_footnote" id="para.384.2.0.box.226.1767.1003.72.q.50" style="text-indent:1em;"><sup>1</sup> Rapport à l'Empereur, 28 Brumaire, An xiii. (Nov. 19,1804); Archives des Aff. Étr. MSS.</p> The words "à l'Empereur" are underlined in red. As before, "à" gets highlighted in blue, but when I click on the Next Misspelled Word button again, it skips to content-0021.xml, even though it hasn't reached the bottom of content-0020.xml yet. Again, if I insert the cursor after "l'Empereur," the spelling check continues in content-0020.xml, instead of skipping to the next section. |
|
|
|
|
|
#2 |
|
Staff to 4 Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 10,715
Karma: 2485850
Join Date: Aug 2009
Location: The (original) Silicon Valley, USA
Device: Galaxy Tab 2,Black Astak PEz, K4NT(now Wifes)
|
<a id="GBS.PA58.w.1.0.0"></a><span class="gtxt_body" id="para.70.1.0.box.248.234.1006.310.q.60"> (and it is missing the closing </span> )
That is an anchor(point) FROM the footnote/Index to allow a return to the middle of the word? While HTML visually correct, I would never expect a spell check to wade through such odd usage
__________________
Using: Ubuntu(32 bit):Oneric,Precise and XPpro SP3, W7HP(64)- - Libre Office w/Writer2EPUB
|
|
|
|
|
Enthusiast
|
|
|
|
#3 |
|
Sigil developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,261
Karma: 1101600
Join Date: Jan 2011
Location: UK
Device: Kindle PW, K4 NT, K3, Kobo Touch
|
Well that is pretty heavily tagged text, but the issue is also shown by just
<p>test à this wrd</p> It finds à as misspelled but then won't move forward.
__________________
See the Sigil User Guide and its tutorials for details about Sigil. |
|
|
|
|
|
#4 |
|
calibre/Sigil Developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,087
Karma: 1211084
Join Date: Oct 2010
Location: London, UK
Device: Kindle 3 3G, iPad 2, iPad 3
|
Fixed for 0.6.1.
The issue is any time it reaches a 1-character misspelled word it will not jump to any other misspelled words further on the page for the next check. And as stated the workaround for now is placing the cursor several characters after that 1-character misspelled word.
__________________
Like my calibre plugins or Sigil work? Say thanks with PayPal |
|
|
|
|
|
#5 | |||
|
Member
![]() Posts: 16
Karma: 10
Join Date: Sep 2012
Device: sony prs-t2
|
Quote:
Quote:
Quote:
forest covered every portion, except here and there a str<a id="GBS.PA1.w.0.1.0.1"></a>ip of cultivated soil |
|||
|
|
|
|
|
#6 |
|
Member
![]() Posts: 16
Karma: 10
Join Date: Sep 2012
Device: sony prs-t2
|
Thanks meme and kiwidude.
|
|
|
|
|
|
#7 | |
|
Staff to 4 Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 10,715
Karma: 2485850
Join Date: Aug 2009
Location: The (original) Silicon Valley, USA
Device: Galaxy Tab 2,Black Astak PEz, K4NT(now Wifes)
|
Quote:
__________________
Using: Ubuntu(32 bit):Oneric,Precise and XPpro SP3, W7HP(64)- - Libre Office w/Writer2EPUB
|
|
|
|
|
|
|
#8 | |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 8,702
Karma: 3644259
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2
|
Quote:
Dale
__________________
Dale DePriest http://pages.suddenlink.net/dalede or http://daledepriest.wikispaces.com currently using an EZ Reader or a Literati or my iPad. |
|
|
|
|
![]() |
| Thread Tools | Search this Thread |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| I have a misspelled genre tag | mdietz39 | Library Management | 1 | 06-04-2012 08:13 PM |
| Romance Ebers, Georg: A Word, Only a Word. V1. 20 Mar 2009 | crutledge | IMP Books | 0 | 03-20-2009 08:12 AM |
| Romance Ebers, Georg: A Word, Only a Word. V1. 20 Mar 2009 | crutledge | ePub Books | 0 | 03-20-2009 08:09 AM |