![]() |
#1 |
Member
![]() Posts: 21
Karma: 10
Join Date: Feb 2010
Device: HTC Magic
|
Sorting names with polish character - bug
I have discovered today that authors with names started with polish characters ie.: Ł are sorted after authors with names starting with "Z".
Has someone any solution for this problem ? Cheers |
![]() |
![]() |
![]() |
#2 |
Memento Mori
![]() Posts: 36
Karma: 10
Join Date: Apr 2007
Device: eClicto, iPad WiFi, Kindle 3 WiFi
|
It's a common problem and a way most software sorts it - I'd say it's because of the Unicode character codes, where 'z' has a lower code than 'ł' ('z' is 007A, 'ł' is 0142 [both are hex] -- or 122 vs. 322).
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
Trouble is, there's no one sorting order that works for all languages. You really need to be able to specify a language, and then define sorting rules for that language.
|
![]() |
![]() |
![]() |
#4 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 820
Karma: 11012
Join Date: Nov 2007
Location: Warsaw, Poland
Device: Bookeen Cybook
|
Yup, in Polish it's supposed to be right after 'L', like 'Ą' is after 'A' and 'Ę' after 'E'... I wonder if there are two languages where the same two characters are put in different order, or if there could be one order that fits all languages (after filling in all non-Ascii characters).
|
![]() |
![]() |
![]() |
#5 |
Memento Mori
![]() Posts: 36
Karma: 10
Join Date: Apr 2007
Device: eClicto, iPad WiFi, Kindle 3 WiFi
|
I guess that could be done by changing the non-ASCII characters to their ASCII counterparts, like changing "Ł" to "L". If we gave them some "priority", we could have it sorted with "Ł" going with other "L"'s, but then, since we've converted it from non-ASCII, move it after the last "L". Or even get all the non-ASCII, sort them, then add them to the sorted list of those ASCII-only after the ASCII character counterpart.
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,188
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
At some point the calibre database will get support for defining book language and then there could conceivably be language dependent sorting, but it will be a while.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
[Bug] Calibre using all lower case path names | krunk | Calibre | 0 | 05-01-2010 02:58 PM |
File Names (character lenght) | guyanonymous | Calibre | 8 | 02-26-2010 07:36 PM |
Bug when sorting | mores | Calibre | 8 | 10-13-2009 08:52 AM |
Bug (?) related to polish (non-ascii) chars | parasit | Calibre | 8 | 05-30-2009 03:55 PM |
"ascii' codec can't encode character" bug ? | zelda_pinwheel | Calibre | 2 | 12-21-2008 08:12 PM |