View Single Post
Old 11-12-2017, 08:38 PM   #1
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Suggestion: Spellcheck Enhancement (Numbers)

Currently, Sigil does not consider Numbers as "words".

This means that Spellcheck can't catch entire classes of errors, because many "words" don't display in the Spellcheck List (Tools > Spellcheck > Spellcheck).

I've attached an Example EPUB to show the issue.

The Problem

This includes the "Current Spellcheck List" vs. "Proposed Spellcheck List" in Spoilers.

Example 1: Centuries or Years:

Code:
In the 21st century, [...]
In the 1800’s, there was [...]
Spoiler:
Code:
In          In
the         the
st          21st
century     century
s           1800’s
there       there
was         was


Example 2: Pounds/Shillings/Pence/Money

Code:
The device cost £14 8s 2d.
Spoiler:
Code:
The         The
device      device
cost        cost
            14
s           8s
d           2d


Example 3: Hyphenated Years or Age:

Code:
In the 10-year period between [...]
The 10-year-old girl [...]
Spoiler:
Code:
In         In
the        the
year       10-year
period     period
between    between
The        The
year-old   10-year-old


Example 4: Weights/Measures

Code:
It weighs 100.5lbs.
The length is 100.5km and 2ft.
Spoiler:
Code:
It         It
weighs     weighs
lbs        100.5lbs
The        The
length     length
is         is
km         100.5km
and        and
ft         2ft


Example 5: Indexes/Footnotes

Code:
Dogs, 123n., 125, 130n.
See p. 123ff.
Spoiler:
Code:
Dogs      Dogs
n         123n
          125
n         130n
See       See
p         p
ff        123ff


Example 6: A very common typo (especially because of OCR):

Code:
In the 196os, the president was [...]
In l941, the samples were [...]
Good argument, h0wever, you are [...]
Spoiler:
Code:
os        196os
l         l941
h0wever   h0wever


It in Action

Calibre already includes numbers in their Spellcheck:

Click image for larger version

Name:	CalibreNumberSpellcheck.png
Views:	355
Size:	25.3 KB
ID:	159978

and it is extremely helpful.

Proposal

There is one downside to the Calibre-method though, because the Spellcheck List gets flooded with numbers. Especially when dealing with HTML tables full of data (or in Indexes):

Click image for larger version

Name:	CalibreNumberFlood.png
Views:	365
Size:	28.8 KB
ID:	159979

To get around that issue:
  • Ignore all "words" made completely of numbers + punctuation
    • Although I could see a usage for still keeping this (catching typos)
  • Include the "Numbers as Words" as a checkbox/toggle.
    • Similar to "Show All Words"
    • Probably default to OFF.
    • This could allow the user to choose whether they want to display those full-numbers or not.
Attached Files
File Type: epub NumbersAsWords[Test].epub (3.6 KB, 234 views)
Tex2002ans is offline   Reply With Quote