Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Editor

Notices

Reply
 
Thread Tools Search this Thread
Old 04-21-2014, 09:15 AM   #61
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
@jackie_w: calibre uses the ICU word break iteration algorithm, which as far as I recall, splits up most hyphenated words into two words (the details are language dependent), so, for example, abc-def will show up in the words list as two words, abc and def

See http://userguide.icu-project.org/boundaryanalysis for details
kovidgoyal is offline   Reply With Quote
Old 04-21-2014, 09:34 AM   #62
jackie_w
Grand Sorcerer
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
Oh, well. I was hoping this would be one of the Sigil features which would be replicated. A quick list of all hyphenated words is useful when dealing with a scanned source which has not been 'de-hyphenated' very well.

The new spellchecker is a welcome addition, though. Thanks
jackie_w is offline   Reply With Quote
Advert
Old 04-21-2014, 09:55 AM   #63
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,575
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by kovidgoyal View Post
@jackie_w: calibre uses the ICU word break iteration algorithm, which as far as I recall, splits up most hyphenated words into two words (the details are language dependent), so, for example, abc-def will show up in the words list as two words, abc and def

See http://userguide.icu-project.org/boundaryanalysis for details
that's a valuable link - explains some things I've been puzzling about wrt leading & trailing apostrophes.

At http://www.unicode.org/reports/tr29/#WB14 there is this with respect to word boundaries and hyphens

Quote:
The correct interpretation of hyphens in the context of word boundaries is challenging ... it is better overall to keep the hyphen out of the default definition
Is that to be interpreted as... a hyphen should or should not constitute a word boundary... I'm inclined to read it as should not.

BR
BetterRed is offline   Reply With Quote
Old 04-21-2014, 10:38 AM   #64
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
@BR: You'll have to take it up with the developers of ICU, this is one swamp I have no intention of wading into.
kovidgoyal is offline   Reply With Quote
Old 04-21-2014, 11:48 AM   #65
mrmikel
Color me gone
mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.
 
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
Kovid, when you create a user dictionary, is it active on all documents, or is it possible to select ones to activate?
mrmikel is offline   Reply With Quote
Advert
Old 04-21-2014, 04:55 PM   #66
arspr
Dead account. Bye
arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.
 
Posts: 587
Karma: 668244
Join Date: Mar 2011
Device: none
Hi Kovid, in the release notes for 1.33.0 it says:
Quote:
It comes with builtin dictionaries for English and Spanish.
But when I try to run the check on Spanish books I get:

Code:
calibre, version 1.33.0
ERROR: Failed to check spelling: Failed to check spelling, click "Show details" for the full error information.

Traceback (most recent call last):
  File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\gui2\tweak_book\spell.py", line 1102, in get_words
  File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\gui2\tweak_book\spell.py", line 1102, in <dictcomp>
  File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 339, in recognized
  File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 205, in dictionary_for_locale
  File "M:\DVD Personal 55\Calibre\SourceCode\src\calibre\spell\dictionary.py", line 174, in load_dictionary
IOError: [Errno 2] No such file or directory: u'M:\\DVD Personal 55\\Calibre\\SourceCode\\resources\\dictionaries\\es-ES\\es-AR.dic'
(I've also tested with "official" 1.33 and I get the same error but loading from
Code:
IOError: [Errno 2] No such file or directory: u'C:\\Program Files\\Calibre2\\resources\\dictionaries\\es-ES\\es-AR.dic'
)





What am I doing wrong?

(Nevertheless if this is the "default" warning you get if you don't have the needed dictionary installed, I do really think it's veeeeeery un-userfriendly).
arspr is offline   Reply With Quote
Old 04-21-2014, 06:11 PM   #67
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,575
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by kovidgoyal View Post
@BR: You'll have to take it up with the developers of ICU, this is one swamp I have no intention of wading into.
Before I put on me wellies on and get me canoe out - I note that Sigil appears to be using a later version of the icu libraries than the one's that calibre uses.

BR
Attached Thumbnails
Click image for larger version

Name:	Capture.JPG
Views:	238
Size:	74.0 KB
ID:	121954  

Last edited by BetterRed; 04-21-2014 at 06:14 PM.
BetterRed is offline   Reply With Quote
Old 04-21-2014, 10:54 PM   #68
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
@BR: I have ICU 52.1 on my system and hyphenated words are still split up by the break iterator. IIRC the break iterator has been present for a very long time in ICU and I doubt the algorithm for english has changed anytime in the recent past. Not to mention, that I highly doubt sigil uses ICU for word iteration. I'd guess the only reason sigil includes ICU is because WebKit requires it.

@mrmikel: You can mark a user dictionary active or inactive. Active ones apply to all documents, inactive ones to none.
kovidgoyal is offline   Reply With Quote
Old 04-21-2014, 11:12 PM   #69
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
@arspr: https://github.com/kovidgoyal/calibr...b6581d78c46f01
kovidgoyal is offline   Reply With Quote
Old 04-22-2014, 10:29 AM   #70
arspr
Dead account. Bye
arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.arspr ought to be getting tired of karma fortunes by now.
 
Posts: 587
Karma: 668244
Join Date: Mar 2011
Device: none
Quote:
Originally Posted by kovidgoyal View Post
Fixed

Another doubt or possible feature. Looking in the Manage Dictionaries window I can see that there are three built-in ones:
  • British English
  • US English
  • Spanish (whatever flavour)

And I can see that, as simple example, Philippines English is associated to US English dictionary, and New Zealand to British one. But I cannot change that association (or I don't know how). Could it be a good idea? I mean, OK, I don't have a Kiwi English dictionary installed, why should I use the British one instead of the US one? Who has decided that Australian English is closer to British than to US one?

(Please keep in mind that I'm hypothetically speaking. In fact I don't really know if Ghana English is SO much closer to the British flavour that the previous possible choice just makes no sense at all. No offence meant in any way).
arspr is offline   Reply With Quote
Old 04-22-2014, 10:47 AM   #71
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
The associations come from the maintainers of the dictionaries. I am not going to get involved in second guessing them. If you find you prefer a US english dictionary for say ghana instead of the GB one, then just install a custom US english one and set it to be the one used for ghana. Or change your book to use the language code en-US instead of en-GH or whatever.
kovidgoyal is offline   Reply With Quote
Old 04-22-2014, 01:43 PM   #72
Perkin
Guru
Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.
 
Perkin's Avatar
 
Posts: 655
Karma: 64171
Join Date: Sep 2010
Location: Kent, England, Sol 3, ZZ9 plural Z Alpha
Device: Sony PRS-300, Kobo Aura HD, iPad (Marvin)
@arspr, I think which association has to do with some of the root words, and their spelling.
(think colour/color or honour/honor)
Perkin is offline   Reply With Quote
Old 04-25-2014, 02:55 AM   #73
Divingduck
Wizard
Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.Divingduck ought to be getting tired of karma fortunes by now.
 
Posts: 1,161
Karma: 1404241
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
Hi Kovid,
Thank you very much for integrating the import function. I made a first test it with my wordlists. Now I am missing some additional things (sorry).
Exporting words out of a user dictionary to clipboard and/or File. I can select more than one word, but I cannot copy them together to clipboard (only the first one in a list of more than one word).
Sometimes there is a need to make corrections in the word list. Is it possible to implement a function for edit a word, change of language for selected words and delete selected words?
Divingduck is offline   Reply With Quote
Old 04-25-2014, 03:24 AM   #74
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
There are already buttons to add and remove words. If you wish to edit, remove and then re-add the changed word.
kovidgoyal is offline   Reply With Quote
Old 04-25-2014, 03:25 AM   #75
jbacelar
Interested in the matter
jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.jbacelar ought to be getting tired of karma fortunes by now.
 
jbacelar's Avatar
 
Posts: 421
Karma: 426094
Join Date: Dec 2011
Location: Spain, south coast
Device: Pocketbook InkPad 3
Import list of words

The feature Import list of words works correctly, word by word and paste from the clipboard.
But ... simultaneously it opens the attached error window:

Click image for larger version

Name:	Error.jpg
Views:	228
Size:	45.8 KB
ID:	122060

Thanks for all Kovid.
jbacelar is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Spelling anomalies DMB General Discussions 71 06-19-2012 07:55 AM
Are DRM books with check in/check out allowed? i8abug Library Management 4 05-31-2012 02:27 PM
Spelling errors and such starrlamia General Discussions 29 11-29-2010 03:59 AM
Seriously thoughtful Spelling contractions SameOldStory Lounge 47 09-08-2010 09:08 PM
Spelling Macro PieOPah Workshop 36 12-13-2008 02:27 AM


All times are GMT -4. The time now is 02:10 PM.


MobileRead.com is a privately owned, operated and funded community.