![]() |
#16 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
|
It should be fixed, they are talking about Koreader, Coolreader, or FBReader which are third-party readers that have nothing to do with the readers that come with the Kobo firmware. (Edit: The Kobo firmware has included Dutch hyphenation dictionaries since version 2.8.1b)
Have a look at the second screenshot in this post. The KEPUB hyphenation problem demonstrated in the In the second paragraph of the screenshot sounds like the problem you describe. It has been fixed in firmware 3.16.0. Last edited by GeoffR; 06-13-2015 at 10:08 PM. Reason: ... Dutch hyphenation dictionaries ... |
![]() |
![]() |
![]() |
#17 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 955
Karma: 149907
Join Date: Jul 2013
Location: Rotterdam
Device: HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
|
The pull request is merged, so koreader now has hyphenation support for Dutch.
You could try to enable hyphenation support for Dutch in Kobo Nickel by editing the metadata of your ebook with Calibre: Rightclick on your ebook in the Calibre Library>edit Metadata>Languages>Dutch The first book I checked had no language defined. So how would Kobo know it should Dutch hyphenation? |
![]() |
![]() |
Advert | |
|
![]() |
#18 | |
Connoisseur
![]() ![]() Posts: 68
Karma: 152
Join Date: Jun 2015
Device: Kobo Glo HD
|
Quote:
|
|
![]() |
![]() |
![]() |
#19 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,079
Karma: 14079267
Join Date: Oct 2007
Location: Almere, The Netherlands
Device: Kobo Sage
|
|
![]() |
![]() |
![]() |
#20 | ||
Connoisseur
![]() ![]() Posts: 68
Karma: 152
Join Date: Jun 2015
Device: Kobo Glo HD
|
Quote:
Quote:
As soon as punctuation is involved, it goes wrong. To demonstrate this, I have made 2 screenshots, both from the same e-book (which is a kepub file, set on Dutch): pic 1: http://static.tweakers.net/ext/f/KRl...PUTlx/full.jpg pic 2: http://static.tweakers.net/ext/f/Acz...6PGRi/full.jpg Also, for some reason interview gets cut off wrong too, eventough it is an official Dutch word (http://www.vandale.nl/opzoeken?patte....VYGRNfntlBc): pic 3: http://static.tweakers.net/ext/f/Jn6...L3csk/full.jpg Anyway, isn't 3.16.0 supposed to fix hyphenation when punctuation is involved? Last edited by outlandos; 06-17-2015 at 01:08 PM. Reason: typo's |
||
![]() |
![]() |
Advert | |
|
![]() |
#21 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
|
Quote:
|
|
![]() |
![]() |
![]() |
#22 | |
Connoisseur
![]() ![]() Posts: 68
Karma: 152
Join Date: Jun 2015
Device: Kobo Glo HD
|
Quote:
But how do I change it? And should I let someone at kobo on this forum know that it's Dutch hyphenation dictionary is invalid? Last edited by outlandos; 06-17-2015 at 01:34 PM. |
|
![]() |
![]() |
![]() |
#23 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
|
Quote:
I've been reading kepub books for the last few days and haven't seen any problems like this in English though. However there are still some problems with KEPUB hyphenation in 3.16.0, one being that it doesn't respect the LEFTHYPHENMIN and RIGHTHYPHENMIN settings like the EPUB reader does, so maybe there are other problems remaining too. You can replace the Dutch hyphenation dictionary by creating a TGZ file namd KoboRoot.tgz containing the dictionary named ./usr/local/Kobo/hyphenDicts/hyph_nl.dic (e.g. on Linux: tar czf KoboRoot.tgz ./usr/local/Kobo/hyphenDicts/hyph_nl.dic) and placing KoboRoot.tgz in the .kobo directory of the device and safetly eject. |
|
![]() |
![]() |
![]() |
#24 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
|
Experimenting a bit more, it seems that the KEPUB reader does in fact read the LEFTHYPHENMIN and RIGHTHYPHENMIN values from the hyphenation dictionary, but for some reason it is miscalculating their value. It seems to be reading LEFTHYPHENMIN 3 as if it were LEFTHYPHENMIN 2.
Also the Dutch hyphenation dictionary is missing the LEFTHYPHENMIN and RIGHTHYPHENMIN values altogether, which could be causing problems. If you or someone else wanted to experiment you could try extracting the Dutch hyphenation dictionary from the 3.16.0 firmware and editing it (with a unix-aware text editor such as Notepad++ if using Windows) so that it starts like this (or whatever values you prefer, increased by one): Code:
UTF-8 LEFTHYPHENMIN 3 RIGHTHYPHENMIN 3 |
![]() |
![]() |
![]() |
#25 | |
Connoisseur
![]() ![]() Posts: 68
Karma: 152
Join Date: Jun 2015
Device: Kobo Glo HD
|
Quote:
GeoffR was so nice to send me an edited 'hyph_nl' with the above code modification. The result: 'onderwerp' (which means 'subject' in Dutch) still was spelled onderwer-p. The other two words ('trainingskamp' and 'interview') did end up in the middle of a line, most likely because at default the value =/ 4, so that doesn't tell us much. So unfortunately, the modification did not solve the problem. |
|
![]() |
![]() |
![]() |
#26 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
|
I found that the Dutch hyphenation dictionary included in the Kobo firmware is just a copy of the openoffice one available here.
There is a hyphenation program available here that can be used to test the hyphenation of words using the hyphenation dictionary. Using the above program with the Dutch hyphenation dictionary (with no HYPHENMIN values, exactly as it comes with the Kobo firmware) gives the following hyphenations: onderwerp on=der=werp onderwerp, on=der=wer=p, So the hyphenation onderwer-p, is valid according to that dictionary. But adding LEFTHYPHENMIN 3 and RIGHTHYPHENMIN 3 to the dictionary gives the hyphenations: onderwerp onder=werp onderwerp, onder=werp, so I hoped that the HYPHENMIN additions would fix the problem, but there seems to be more to it. Edit: There are rules such as kam1p and wer1p in the Dutch hyphenation dictionary which seem to be responsible for these strange hyphenations, removing them fixes these paricular cases but I don't know Dutch so I don't know what other problems removing them would cause. But it does seem that stripping the trailing punctuation from the words before hyphenating them would solve all these problems. The dictionaries have some rules to handle punctuation, but they only seem to be for apostrophes and such that are a part of the word, not other leading and trailing punctuation. It doesn't make sense to me that trailing punctuation should affect the result of the hyphenation algorithm. Last edited by GeoffR; 06-17-2015 at 05:59 PM. Reason: There are rules ... |
![]() |
![]() |
![]() |
#27 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
|
Some more testing (I've made a Dutch ebook that contains nothing but "onderwerp" followed by various punctuation) and have found some interesting things:
1. The Dutch hyphenation dictionary supplied with the Kobo firmware is incorrectly encoded: It is marked as UTF-8 but it is actually encoded as ISO8859-1. However converting the file to UTF-8 didn't fix the onderwerp problem. It could be that the hyphenation dictionary needs to be rebuilt from scratch as a UTF-8 dictionary? 2. The bad hephenation of onderwerp occurs in the EPUB reader as well as the KEPUB reader. I've never seen this type of problem in the EPUB reader with the English hyphenation dictionary in over two years of reading epubs, so I suspect the real problem is the Dutch hyphenation dictionary. Edit: I find can add rules such as 4p, to the dictionary to prevent the bad hyphenation in the EPUB reader, but it doesn't work for the KEPUB reader. So as well a problem with the hyphenation dictionary, there are still problems with the KEPUB hyphenation algorithm too. Last edited by GeoffR; 06-18-2015 at 03:51 AM. Reason: I find I can add rules ... |
![]() |
![]() |
![]() |
#28 |
Librarian
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 346
Karma: 72225
Join Date: Apr 2015
Location: Liège - Belgium
Device: kobo gloHD - KA1
|
@outlandos: what about trying this method I posted yesterday on the Firmware 3.16.0 thread ?
First, since the hyphen algo is supposed corrected in 3.16.0, (and if I were you), I should try to update the hyphen dictionary inside the kobo by adding the one of openoffice (link and method in the post linked). If it continues to not working, I should use the other method and not using the hyphen algo of kobo at all but use calibre + kobo_extra.css to hyphenate correctly the books. Do you use calibre to send books into your gloHD ? If it’s the case, with KoboTouch plugin or KoboTouchExtended ? Hence, do you read epubs or kepubs ? Last edited by cramoisi; 06-18-2015 at 04:40 AM. |
![]() |
![]() |
![]() |
#29 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,758
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Why not just fix the Dutch hyphenation dictionary every time an incorrect hyphen is found?
|
![]() |
![]() |
![]() |
#30 | |
Librarian
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 346
Karma: 72225
Join Date: Apr 2015
Location: Liège - Belgium
Device: kobo gloHD - KA1
|
Quote:
![]() |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Hyphenation | Simboubou | PocketBook | 9 | 09-15-2014 05:21 AM |
Hyphenation | Siard | Kobo Reader | 6 | 08-09-2013 07:40 AM |
Hyphenation - again.. | ItsMee | Conversion | 2 | 07-31-2012 03:00 PM |
Hyphenation | Jini | ePub | 3 | 04-05-2012 11:48 AM |
Hyphenation in Dutch | pini | PocketBook | 0 | 06-28-2011 03:34 AM |