View Single Post
Old 07-08-2016, 11:16 PM   #18
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by Doitsu View Post
Unfortunately, neither the default LibreOffice grammar checker nor LanguageTool caught the error in the paragraph that you posted. Do you use custom settings or did you get the warning from the MS Word grammar checker?
That typo was caught by Microsoft Word's grammar check.

I must admit, I haven't touched LibreOffice in a while (I just use Notepad++ for all my writing). But the more types of tools you can throw at it, the better (certain tools might catch errors that others might miss).

Quote:
Originally Posted by Doitsu View Post
Shameless plug: In my quest to corner the Sigil wrapper plugin market, I released a LanguageTool (grammar check) validation plugin. (Validation plugin means that it'll display warnings like FlightCrew and not like LibreOffice or the standalone LanguageTool version.)
Hmmmm, very interesting. Well you convinced me to download the latest version of Sigil and test it out (I was holding out on 0.8.6 for a while while the dust settled).

The plugin felt quite rough:
  • If you doubleclick on the error, it jumps you to the paragraph (not necessarily the EXACT position the error is located)
    • Long paragraphs make it very hard to spot where the error actually occurred.
    • Is there any possible way for it to highlight the exact position in the text?
  • Is there a way to split the messages into more columns? It is quite hard to read these errors and figure out WHAT exactly it is complaining about.
    • Currently: File + Line + MessageJammedIntoOneGiantLine
    • Potential: File + Line + Reason + Sentence + Suggestion
  • Is there any possible way for it to run on the entire EPUB at once? Or am I just crazy? (Or didn't read your instructions properly). Currently I am just running it a chapter at a time.
  • Any stats/thoughts on adding the n-gram data?
    • Is there anywhere I could test this beforehand before trying to download the 8 GB beast? :P
    • About how many more errors will this point out? How many more false positives might I have to sift through, or does it do a pretty good job?

Quote:
Originally Posted by GrannyGrump View Post
@Doitsu --- I am looking forward to trying the plugin this weekend. Glad I will no longer have to schlepp my files to work and paste into MS Word for their grammar check.
PASTE into MS Word? You do know Toxaris's Tools now have "Import EPUB"? Or you could do what I did before Toxaris introduced that... Calibre convert EPUB -> RTF/HTMLZ/DOCX + Open it up in your Word Processor of choice.

Quote:
Originally Posted by BetterRed View Post
a) eyeball the Words list filtered on '-' (this morning I found 'them-selves' and 'Wag-nails'),
You are welcome.

Quote:
Originally Posted by BetterRed View Post
b) scan the Character list for 'odd-ball' characters.
Yes, this is one of the first steps I do after I OCR the book. Who knows what crazy characters might have snuck in (or accents on characters). I then go through the book and check every odd/accented character to doublecheck they are correct. Doing this pass also helps you potentially catch inconsistencies like "vis-à-vis" + "vis-a-vis" existing in the same book.

Side Note: Before Toxaris comes swooping in here, yes, his EPUB Tools also has "Check Accents".

Quote:
Originally Posted by BetterRed View Post
The ability to sort the various lists on frequency is helpful, as is the facility to save a list to a csv.
Exporting to CSV has also been recently added to my repertoire (within the last few months). If anything of substance comes out of that research, I will also post that info on MobileRead. (Already caught a few typos that slipped by in my previous passes). :P

Again, just a different way to visualize the data might make discrepancies stand out like a sore thumb.

Last edited by Tex2002ans; 07-09-2016 at 12:38 AM.
Tex2002ans is offline   Reply With Quote