![]() |
#61 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,209
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
@jackie_w: calibre uses the ICU word break iteration algorithm, which as far as I recall, splits up most hyphenated words into two words (the details are language dependent), so, for example, abc-def will show up in the words list as two words, abc and def
See http://userguide.icu-project.org/boundaryanalysis for details |
![]() |
![]() |
![]() |
#62 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,249
Karma: 16539642
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
Oh, well. I was hoping this would be one of the Sigil features which would be replicated. A quick list of all hyphenated words is useful when dealing with a scanned source which has not been 'de-hyphenated' very well.
The new spellchecker is a welcome addition, though. Thanks ![]() |
![]() |
![]() |
Advert | |
|
![]() |
#63 | ||
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,638
Karma: 29710510
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
At http://www.unicode.org/reports/tr29/#WB14 there is this with respect to word boundaries and hyphens Quote:
BR |
||
![]() |
![]() |
![]() |
#64 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,209
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
@BR: You'll have to take it up with the developers of ICU, this is one swamp I have no intention of wading into.
|
![]() |
![]() |
![]() |
#65 |
Color me gone
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
|
Kovid, when you create a user dictionary, is it active on all documents, or is it possible to select ones to activate?
|
![]() |
![]() |
Advert | |
|
![]() |
#66 | |
Dead account. Bye
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 587
Karma: 668244
Join Date: Mar 2011
Device: none
|
Hi Kovid, in the release notes for 1.33.0 it says:
Quote:
Code:
calibre, version 1.33.0 ERROR: Failed to check spelling: Failed to check spelling, click "Show details" for the full error information. Traceback (most recent call last): File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\gui2\tweak_book\spell.py", line 1102, in get_words File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\gui2\tweak_book\spell.py", line 1102, in <dictcomp> File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 339, in recognized File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 205, in dictionary_for_locale File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 174, in load_dictionary IOError: [Errno 2] No such file or directory: u'M:\\DVD Personal 55\\Calibre\\SourceCode\\resources\\dictionaries\\es-ES\\es-AR.dic' Code:
IOError: [Errno 2] No such file or directory: u'C:\\Program Files\\Calibre2\\resources\\dictionaries\\es-ES\\es-AR.dic' ![]() ![]() ![]() What am I doing wrong? (Nevertheless if this is the "default" warning you get if you don't have the needed dictionary installed, I do really think it's veeeeeery un-userfriendly). |
|
![]() |
![]() |
![]() |
#67 | |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,638
Karma: 29710510
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
BR Last edited by BetterRed; 04-21-2014 at 06:14 PM. |
|
![]() |
![]() |
![]() |
#68 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,209
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
@BR: I have ICU 52.1 on my system and hyphenated words are still split up by the break iterator. IIRC the break iterator has been present for a very long time in ICU and I doubt the algorithm for english has changed anytime in the recent past. Not to mention, that I highly doubt sigil uses ICU for word iteration. I'd guess the only reason sigil includes ICU is because WebKit requires it.
@mrmikel: You can mark a user dictionary active or inactive. Active ones apply to all documents, inactive ones to none. |
![]() |
![]() |
![]() |
#69 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,209
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
![]() |
![]() |
![]() |
#70 | |
Dead account. Bye
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 587
Karma: 668244
Join Date: Mar 2011
Device: none
|
Quote:
![]() Another doubt or possible feature. Looking in the Manage Dictionaries window I can see that there are three built-in ones:
And I can see that, as simple example, Philippines English is associated to US English dictionary, and New Zealand to British one. But I cannot change that association (or I don't know how). Could it be a good idea? I mean, OK, I don't have a Kiwi ![]() (Please keep in mind that I'm hypothetically speaking. In fact I don't really know if Ghana English is SO much closer to the British flavour that the previous possible choice just makes no sense at all. No offence meant in any way). |
|
![]() |
![]() |
![]() |
#71 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,209
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
The associations come from the maintainers of the dictionaries. I am not going to get involved in second guessing them. If you find you prefer a US english dictionary for say ghana instead of the GB one, then just install a custom US english one and set it to be the one used for ghana. Or change your book to use the language code en-US instead of en-GH or whatever.
|
![]() |
![]() |
![]() |
#72 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 657
Karma: 64171
Join Date: Sep 2010
Location: Kent, England, Sol 3, ZZ9 plural Z Alpha
Device: Sony PRS-300, Kobo Aura HD, iPad (Marvin)
|
@arspr, I think which association has to do with some of the root words, and their spelling.
(think colour/color or honour/honor) |
![]() |
![]() |
![]() |
#73 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,166
Karma: 1410083
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
|
Hi Kovid,
Thank you very much for integrating the import function. I made a first test it with my wordlists. Now I am missing some additional things ![]() Exporting words out of a user dictionary to clipboard and/or File. I can select more than one word, but I cannot copy them together to clipboard (only the first one in a list of more than one word). Sometimes there is a need to make corrections in the word list. Is it possible to implement a function for edit a word, change of language for selected words and delete selected words? |
![]() |
![]() |
![]() |
#74 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,209
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
There are already buttons to add and remove words. If you wish to edit, remove and then re-add the changed word.
|
![]() |
![]() |
![]() |
#75 |
Interested in the matter
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 421
Karma: 426094
Join Date: Dec 2011
Location: Spain, south coast
Device: Pocketbook InkPad 3
|
Import list of words
The feature Import list of words works correctly, word by word and paste from the clipboard.
But ... simultaneously it opens the attached error window: Thanks for all Kovid. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Spelling anomalies | DMB | General Discussions | 71 | 06-19-2012 07:55 AM |
Are DRM books with check in/check out allowed? | i8abug | Library Management | 4 | 05-31-2012 02:27 PM |
Spelling errors and such | starrlamia | General Discussions | 29 | 11-29-2010 03:59 AM |
Seriously thoughtful Spelling contractions | SameOldStory | Lounge | 47 | 09-08-2010 09:08 PM |
Spelling Macro | PieOPah | Workshop | 36 | 12-13-2008 02:27 AM |