|
View Poll Results: Do you want sorting as described in the first post? | |||
Yes | 5 | 23.81% | |
No | 6 | 28.57% | |
Don't care | 10 | 47.62% | |
Voters: 21. You may not vote on this poll |
|
Thread Tools | Search this Thread |
10-03-2010, 09:07 AM | #1 |
Grand Sorcerer
Posts: 11,961
Karma: 7225107
Join Date: Jan 2010
Location: Notts, England
Device: Kobo Libra 2
|
Sorting with accented characters
Mixx has suggested in this post that calibre should sort accented characters as equivalent to their non-accented ASCII version.
What the suggestion means is that e, è, and é would be sorted as if they are exactly the same letter. There is no guarantee that one would come in front of the other. The same is true for c and ç, s and ß, A and Å, etc. I have no idea what it means for non-latin characters such as Greek, Cyrillic or Chinese, but my guess is that they would sort using the letters that are used when creating file system names. It should (at least) be consistent. Do you, the calibre users, want this? |
10-03-2010, 09:19 AM | #2 |
Wizard
Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
I'd like to elaborate beyond just voting: Personally, I don't care either way, as I most often use search to find the book(s) I want. But, as I see it, this would make sorting more consistent for international users (for example I personally feel that "ö" should follow "o", or, at least be sorted in the same area), and that, in my eyes would be a Good Thing. One modifier applies, though: If it's worth the trouble for you developers. If this means you have to rip apart everything and rewrite most of it, my vote would change to no.
Another comment regarding sorting that goes slightly off-topic: Personally, I'd much more like to see an option to include non- english articles in the list that Calibre ignores when sorting, like the german "der, die, das". Optimally, this would be implemented as a tweak, pre-filled with the most common articles, say english, french, spanish and german. For this, the same modifier as above applies. |
Advert | |
|
10-03-2010, 09:47 AM | #3 |
Addict
Posts: 254
Karma: 69786
Join Date: May 2006
Location: Oslo, Norway
Device: Kobo Aura, Sony PRS-650
|
I'm all for new features, but please make this one optional, or at least with selectable collation algorithms as you can do in mySQL.
In Norwegian, common collation of oø and aå is wildly incorrect, in fact 'a' and 'å' resides respectively at the beginning and end of our alfabet For names you might also consider language-specific collation. For instance "van Eeden, Frederik Willem" should be collated under E. I presume you would use some sort of library for this? Last edited by Man Eating Duck; 10-03-2010 at 09:50 AM. |
10-03-2010, 09:50 AM | #4 | ||
Grand Sorcerer
Posts: 11,961
Karma: 7225107
Join Date: Jan 2010
Location: Notts, England
Device: Kobo Libra 2
|
Quote:
Quote:
I have looked at using unicode sorting libraries. The amount of work to do so is enormous, and I have no desire to do it. Edit: for names, that is under your control. Enter the sort string using the 'manage authors' function available by right-clicking on any author in the left-hand tag pane. Last edited by chaley; 10-03-2010 at 09:52 AM. |
||
10-03-2010, 10:00 AM | #5 |
Addict
Posts: 254
Karma: 69786
Join Date: May 2006
Location: Oslo, Norway
Device: Kobo Aura, Sony PRS-650
|
|
Advert | |
|
10-03-2010, 10:03 AM | #6 |
Grand Sorcerer
Posts: 11,961
Karma: 7225107
Join Date: Jan 2010
Location: Notts, England
Device: Kobo Libra 2
|
|
10-03-2010, 03:01 PM | #7 | |
Grand Sorcerer
Posts: 11,961
Karma: 7225107
Join Date: Jan 2010
Location: Notts, England
Device: Kobo Libra 2
|
Quote:
It is 'pre-filled' as today, with A, An, The. You can add what you want, or turn it off completely. |
|
10-03-2010, 03:54 PM | #8 |
Wizard
Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
|
10-03-2010, 11:16 PM | #9 |
Groupie
Posts: 155
Karma: 112134
Join Date: May 2009
Location: Kuala Lumpur
Device: iPad, K3, K4, T1
|
|
12-03-2010, 07:38 AM | #10 | ||
Connoisseur
Posts: 73
Karma: 11638
Join Date: Aug 2010
Location: Spain
Device: iPad, Kindle Paperwhite, Kobo Libra 2
|
The poll is closed but my vote should have been a resounding yes for the very minimum, common sense implementation:
Code:
a = á = à e = é = è i = í = ì o = ò = ó u = ú =ù Quote:
Quote:
|
||
12-03-2010, 07:45 AM | #11 |
Wizard
Posts: 4,553
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
|
Note that you can already control the sorting of Authors (where I expect it is most important) by setting the Author-sort field to only use the non-accented versions of characters.
|
12-03-2010, 08:23 AM | #12 |
Grand Sorcerer
Posts: 11,961
Karma: 7225107
Join Date: Jan 2010
Location: Notts, England
Device: Kobo Libra 2
|
This is a non-issue for me, but it seems that you could be the needed 'someone who cares'. I invite you to do the work and submit it. I am sure that Kovid would entertain such a patch.
|
12-03-2010, 09:49 AM | #13 | |
Connoisseur
Posts: 73
Karma: 11638
Join Date: Aug 2010
Location: Spain
Device: iPad, Kindle Paperwhite, Kobo Libra 2
|
Quote:
If the grid where calibre is displaying the data is actually the result of a database query (some view query) whe should be able to skip any calibre change and use database functions for collation Would the following be applicable? How to sort text in sqlite3 with specified locale? So in the end we could use the calibre locale to build database queries like: Code:
SELECT * FROM authors ORDER BY name COLLATE POLISH; |
|
12-03-2010, 10:06 AM | #14 | |
Well trained by Cats
Posts: 30,409
Karma: 58055234
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Your example works beautiful as long as the data is mono-lingual Kovid and crew have no easy task making Calibre work across many regional differences crew |
|
12-03-2010, 10:27 AM | #15 |
Connoisseur
Posts: 73
Karma: 11638
Join Date: Aug 2010
Location: Spain
Device: iPad, Kindle Paperwhite, Kobo Libra 2
|
The idea I tried to convey was that you have all locale queries created:
Example: Code:
SELECT * FROM authors ORDER BY name COLLATE POLISH; SELECT * FROM authors ORDER BY name COLLATE SPANISH; SELECT * FROM authors ORDER BY name COLLATE FRENCH; ............. To summarize, is this SQLite/database-based approach possible? Code:
calibre_lang = German // Or English, or korean, or whatever db_query = SELEC foo, bar, baz ... whatever COLLATE $calibre_lang |
Tags |
accent, sorting |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Accented characters on PRS-505 | gandalfbp | Calibre | 4 | 04-19-2010 07:48 AM |
PRS-600 any way to type spanish accented characters? | arielinflux | Sony Reader | 1 | 03-17-2010 04:22 AM |
Foreign accented characters and libprs500 | Stingo | Calibre | 6 | 02-24-2008 07:51 PM |
PRS-500 Accented characters onto reader using Mac | squiggle8 | Sony Reader Dev Corner | 9 | 12-06-2007 04:01 PM |
Accented characters | bingle | Sony Reader | 7 | 07-25-2007 06:36 AM |