04-21-2014, 09:15 AM | #61 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
@jackie_w: calibre uses the ICU word break iteration algorithm, which as far as I recall, splits up most hyphenated words into two words (the details are language dependent), so, for example, abc-def will show up in the words list as two words, abc and def
See http://userguide.icu-project.org/boundaryanalysis for details |
04-21-2014, 09:34 AM | #62 |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
Oh, well. I was hoping this would be one of the Sigil features which would be replicated. A quick list of all hyphenated words is useful when dealing with a scanned source which has not been 'de-hyphenated' very well.
The new spellchecker is a welcome addition, though. Thanks |
Advert | |
|
04-21-2014, 09:55 AM | #63 | ||
null operator (he/him)
Posts: 20,575
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
At http://www.unicode.org/reports/tr29/#WB14 there is this with respect to word boundaries and hyphens Quote:
BR |
||
04-21-2014, 10:38 AM | #64 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
@BR: You'll have to take it up with the developers of ICU, this is one swamp I have no intention of wading into.
|
04-21-2014, 11:48 AM | #65 |
Color me gone
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
|
Kovid, when you create a user dictionary, is it active on all documents, or is it possible to select ones to activate?
|
Advert | |
|
04-21-2014, 04:55 PM | #66 | |
Dead account. Bye
Posts: 587
Karma: 668244
Join Date: Mar 2011
Device: none
|
Hi Kovid, in the release notes for 1.33.0 it says:
Quote:
Code:
calibre, version 1.33.0 ERROR: Failed to check spelling: Failed to check spelling, click "Show details" for the full error information. Traceback (most recent call last): File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\gui2\tweak_book\spell.py", line 1102, in get_words File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\gui2\tweak_book\spell.py", line 1102, in <dictcomp> File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 339, in recognized File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 205, in dictionary_for_locale File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 174, in load_dictionary IOError: [Errno 2] No such file or directory: u'M:\\DVD Personal 55\\Calibre\\SourceCode\\resources\\dictionaries\\es-ES\\es-AR.dic' Code:
IOError: [Errno 2] No such file or directory: u'C:\\Program Files\\Calibre2\\resources\\dictionaries\\es-ES\\es-AR.dic' What am I doing wrong? (Nevertheless if this is the "default" warning you get if you don't have the needed dictionary installed, I do really think it's veeeeeery un-userfriendly). |
|
04-21-2014, 06:11 PM | #67 | |
null operator (he/him)
Posts: 20,575
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
BR Last edited by BetterRed; 04-21-2014 at 06:14 PM. |
|
04-21-2014, 10:54 PM | #68 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
@BR: I have ICU 52.1 on my system and hyphenated words are still split up by the break iterator. IIRC the break iterator has been present for a very long time in ICU and I doubt the algorithm for english has changed anytime in the recent past. Not to mention, that I highly doubt sigil uses ICU for word iteration. I'd guess the only reason sigil includes ICU is because WebKit requires it.
@mrmikel: You can mark a user dictionary active or inactive. Active ones apply to all documents, inactive ones to none. |
04-21-2014, 11:12 PM | #69 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
04-22-2014, 10:29 AM | #70 | |
Dead account. Bye
Posts: 587
Karma: 668244
Join Date: Mar 2011
Device: none
|
Quote:
Another doubt or possible feature. Looking in the Manage Dictionaries window I can see that there are three built-in ones:
And I can see that, as simple example, Philippines English is associated to US English dictionary, and New Zealand to British one. But I cannot change that association (or I don't know how). Could it be a good idea? I mean, OK, I don't have a Kiwi English dictionary installed, why should I use the British one instead of the US one? Who has decided that Australian English is closer to British than to US one? (Please keep in mind that I'm hypothetically speaking. In fact I don't really know if Ghana English is SO much closer to the British flavour that the previous possible choice just makes no sense at all. No offence meant in any way). |
|
04-22-2014, 10:47 AM | #71 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
The associations come from the maintainers of the dictionaries. I am not going to get involved in second guessing them. If you find you prefer a US english dictionary for say ghana instead of the GB one, then just install a custom US english one and set it to be the one used for ghana. Or change your book to use the language code en-US instead of en-GH or whatever.
|
04-22-2014, 01:43 PM | #72 |
Guru
Posts: 655
Karma: 64171
Join Date: Sep 2010
Location: Kent, England, Sol 3, ZZ9 plural Z Alpha
Device: Sony PRS-300, Kobo Aura HD, iPad (Marvin)
|
@arspr, I think which association has to do with some of the root words, and their spelling.
(think colour/color or honour/honor) |
04-25-2014, 02:55 AM | #73 |
Wizard
Posts: 1,161
Karma: 1404241
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
|
Hi Kovid,
Thank you very much for integrating the import function. I made a first test it with my wordlists. Now I am missing some additional things (sorry). Exporting words out of a user dictionary to clipboard and/or File. I can select more than one word, but I cannot copy them together to clipboard (only the first one in a list of more than one word). Sometimes there is a need to make corrections in the word list. Is it possible to implement a function for edit a word, change of language for selected words and delete selected words? |
04-25-2014, 03:24 AM | #74 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
There are already buttons to add and remove words. If you wish to edit, remove and then re-add the changed word.
|
04-25-2014, 03:25 AM | #75 |
Interested in the matter
Posts: 421
Karma: 426094
Join Date: Dec 2011
Location: Spain, south coast
Device: Pocketbook InkPad 3
|
Import list of words
The feature Import list of words works correctly, word by word and paste from the clipboard.
But ... simultaneously it opens the attached error window: Thanks for all Kovid. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Spelling anomalies | DMB | General Discussions | 71 | 06-19-2012 07:55 AM |
Are DRM books with check in/check out allowed? | i8abug | Library Management | 4 | 05-31-2012 02:27 PM |
Spelling errors and such | starrlamia | General Discussions | 29 | 11-29-2010 03:59 AM |
Seriously thoughtful Spelling contractions | SameOldStory | Lounge | 47 | 09-08-2010 09:08 PM |
Spelling Macro | PieOPah | Workshop | 36 | 12-13-2008 02:27 AM |