Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Readers > Kobo Reader

Notices

Reply
 
Thread Tools Search this Thread
Old 06-13-2015, 10:02 PM   #16
GeoffR
Wizard
GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.
 
GeoffR's Avatar
 
Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
Quote:
Originally Posted by outlandos View Post
So the problem won't be fixed when 3.16 arrives?
It should be fixed, they are talking about Koreader, Coolreader, or FBReader which are third-party readers that have nothing to do with the readers that come with the Kobo firmware. (Edit: The Kobo firmware has included Dutch hyphenation dictionaries since version 2.8.1b)

Have a look at the second screenshot in this post. The KEPUB hyphenation problem demonstrated in the In the second paragraph of the screenshot sounds like the problem you describe. It has been fixed in firmware 3.16.0.

Last edited by GeoffR; 06-13-2015 at 10:08 PM. Reason: ... Dutch hyphenation dictionaries ...
GeoffR is offline   Reply With Quote
Old 06-14-2015, 12:51 AM   #17
Markismus
Guru
Markismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicingMarkismus causes much rejoicing
 
Markismus's Avatar
 
Posts: 955
Karma: 149907
Join Date: Jul 2013
Location: Rotterdam
Device: HiSenseA5ProCC, Cracked OnyxNotePro, Note5, Kobo Glo, Aura
The pull request is merged, so koreader now has hyphenation support for Dutch.

You could try to enable hyphenation support for Dutch in Kobo Nickel by editing the metadata of your ebook with Calibre:
Rightclick on your ebook in the Calibre Library>edit Metadata>Languages>Dutch

The first book I checked had no language defined. So how would Kobo know it should Dutch hyphenation?
Markismus is offline   Reply With Quote
Advert
Old 06-14-2015, 07:01 AM   #18
outlandos
Connoisseur
outlandos doesn't litteroutlandos doesn't litter
 
outlandos's Avatar
 
Posts: 68
Karma: 152
Join Date: Jun 2015
Device: Kobo Glo HD
Quote:
Originally Posted by Markismus View Post
The pull request is merged, so koreader now has hyphenation support for Dutch.

You could try to enable hyphenation support for Dutch in Kobo Nickel by editing the metadata of your ebook with Calibre:
Rightclick on your ebook in the Calibre Library>edit Metadata>Languages>Dutch

The first book I checked had no language defined. So how would Kobo know it should Dutch hyphenation?
This fixed my problem, obviously kobo is no google and thus could not detect the language of the book, manually setting it on Dutch did help. Thanks all!
outlandos is offline   Reply With Quote
Old 06-16-2015, 07:00 AM   #19
mbovenka
Wizard
mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.mbovenka ought to be getting tired of karma fortunes by now.
 
Posts: 2,079
Karma: 14079267
Join Date: Oct 2007
Location: Almere, The Netherlands
Device: Kobo Sage
Quote:
Originally Posted by Ken Maltby View Post
You are probably correct, I should have used Swedish or Norwegian.
Yup, Finnish is a Finno-Ugric language (related to Hungarian and Estonian).
mbovenka is offline   Reply With Quote
Old 06-17-2015, 11:26 AM   #20
outlandos
Connoisseur
outlandos doesn't litteroutlandos doesn't litter
 
outlandos's Avatar
 
Posts: 68
Karma: 152
Join Date: Jun 2015
Device: Kobo Glo HD
Quote:
Originally Posted by davidfor View Post
Firmware version 3.16.0 has fixes for hyphenation. The changes were for all languages. Hopefully it fixes the problems with Dutch.

As 3.16.0 was not released for the H2O, you have a choice of manually installing if from one of the other firmware packages, or waiting for the next release. Kobo don't preannounce any releases, so we don't now when it will be.
Quote:
Originally Posted by GeoffR View Post
It should be fixed, they are talking about Koreader, Coolreader, or FBReader which are third-party readers that have nothing to do with the readers that come with the Kobo firmware. (Edit: The Kobo firmware has included Dutch hyphenation dictionaries since version 2.8.1b)

Have a look at the second screenshot in this post. The KEPUB hyphenation problem demonstrated in the In the second paragraph of the screenshot sounds like the problem you describe. It has been fixed in firmware 3.16.0.
Unfortunately, 3.16.0 did not fix the Dutch hyphenation completely.

As soon as punctuation is involved, it goes wrong. To demonstrate this, I have made 2 screenshots, both from the same e-book (which is a kepub file, set on Dutch):

pic 1: http://static.tweakers.net/ext/f/KRl...PUTlx/full.jpg

pic 2: http://static.tweakers.net/ext/f/Acz...6PGRi/full.jpg

Also, for some reason interview gets cut off wrong too, eventough it is an official Dutch word (http://www.vandale.nl/opzoeken?patte....VYGRNfntlBc):

pic 3: http://static.tweakers.net/ext/f/Jn6...L3csk/full.jpg

Anyway, isn't 3.16.0 supposed to fix hyphenation when punctuation is involved?

Last edited by outlandos; 06-17-2015 at 01:08 PM. Reason: typo's
outlandos is offline   Reply With Quote
Advert
Old 06-17-2015, 01:26 PM   #21
GeoffR
Wizard
GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.
 
GeoffR's Avatar
 
Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
Quote:
Originally Posted by outlandos View Post
Unfortunately, 3.16.0 did not fix the Dutch hyphenation completely.

As soon as punctuation is involved, it goes wrong. To demonstrate this, I have made 2 screenshots, both from the same e-book (which is a kepub file, set on Dutch):

pic 1: http://static.tweakers.net/ext/f/KRl...PUTlx/full.jpg

pic 2: http://static.tweakers.net/ext/f/Acz...6PGRi/full.jpg
I had a look in the Dutch hyphenation dictionary, and it appears that wer-p and kam-p are listed as valid hyphenations, so if they are wrong then the problem might be with the dictionary. If you have a better dictionary you can install it as a replacement.
GeoffR is offline   Reply With Quote
Old 06-17-2015, 01:31 PM   #22
outlandos
Connoisseur
outlandos doesn't litteroutlandos doesn't litter
 
outlandos's Avatar
 
Posts: 68
Karma: 152
Join Date: Jun 2015
Device: Kobo Glo HD
Quote:
Originally Posted by GeoffR View Post
I had a look in the Dutch hyphenation dictionary, and it appears that wer-p and kam-p are listed as valid hyphenations, so if they are wrong then the problem might be with the dictionary. If you have a better dictionary you can install it as a replacement.
They are very wrong.

But how do I change it? And should I let someone at kobo on this forum know that it's Dutch hyphenation dictionary is invalid?

Last edited by outlandos; 06-17-2015 at 01:34 PM.
outlandos is offline   Reply With Quote
Old 06-17-2015, 01:51 PM   #23
GeoffR
Wizard
GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.
 
GeoffR's Avatar
 
Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
Quote:
Originally Posted by outlandos View Post
They are very wrong.

But how do I change it? And should I let someone at kobo on this forum know that it's Dutch hyphenation dictionary is invalid?
I might be misreading the dictionary, I am not sure about the format, perhaps kam-p and wer-p are only valid if they are in the middle of a word and not at the end.

I've been reading kepub books for the last few days and haven't seen any problems like this in English though.

However there are still some problems with KEPUB hyphenation in 3.16.0, one being that it doesn't respect the LEFTHYPHENMIN and RIGHTHYPHENMIN settings like the EPUB reader does, so maybe there are other problems remaining too.

You can replace the Dutch hyphenation dictionary by creating a TGZ file namd KoboRoot.tgz containing the dictionary named ./usr/local/Kobo/hyphenDicts/hyph_nl.dic (e.g. on Linux: tar czf KoboRoot.tgz ./usr/local/Kobo/hyphenDicts/hyph_nl.dic) and placing KoboRoot.tgz in the .kobo directory of the device and safetly eject.
GeoffR is offline   Reply With Quote
Old 06-17-2015, 02:45 PM   #24
GeoffR
Wizard
GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.
 
GeoffR's Avatar
 
Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
Experimenting a bit more, it seems that the KEPUB reader does in fact read the LEFTHYPHENMIN and RIGHTHYPHENMIN values from the hyphenation dictionary, but for some reason it is miscalculating their value. It seems to be reading LEFTHYPHENMIN 3 as if it were LEFTHYPHENMIN 2.

Also the Dutch hyphenation dictionary is missing the LEFTHYPHENMIN and RIGHTHYPHENMIN values altogether, which could be causing problems.

If you or someone else wanted to experiment you could try extracting the Dutch hyphenation dictionary from the 3.16.0 firmware and editing it (with a unix-aware text editor such as Notepad++ if using Windows) so that it starts like this (or whatever values you prefer, increased by one):
Code:
UTF-8
LEFTHYPHENMIN 3
RIGHTHYPHENMIN 3
and then installing it as described in my previous post. It would be interesting to see if that fixed (or at least improved) the problem you are experiencing.
GeoffR is offline   Reply With Quote
Old 06-17-2015, 05:05 PM   #25
outlandos
Connoisseur
outlandos doesn't litteroutlandos doesn't litter
 
outlandos's Avatar
 
Posts: 68
Karma: 152
Join Date: Jun 2015
Device: Kobo Glo HD
Quote:
Originally Posted by GeoffR View Post
Experimenting a bit more, it seems that the KEPUB reader does in fact read the LEFTHYPHENMIN and RIGHTHYPHENMIN values from the hyphenation dictionary, but for some reason it is miscalculating their value. It seems to be reading LEFTHYPHENMIN 3 as if it were LEFTHYPHENMIN 2.

Also the Dutch hyphenation dictionary is missing the LEFTHYPHENMIN and RIGHTHYPHENMIN values altogether, which could be causing problems.

If you or someone else wanted to experiment you could try extracting the Dutch hyphenation dictionary from the 3.16.0 firmware and editing it (with a unix-aware text editor such as Notepad++ if using Windows) so that it starts like this (or whatever values you prefer, increased by one):
Code:
UTF-8
LEFTHYPHENMIN 4
RIGHTHYPHENMIN 4
and then installing it as described in my previous post. It would be interesting to see if that fixed (or at least improved) the problem you are experiencing.
Just for giving the interested a heads-up:

GeoffR was so nice to send me an edited 'hyph_nl' with the above code modification. The result: 'onderwerp' (which means 'subject' in Dutch) still was spelled onderwer-p. The other two words ('trainingskamp' and 'interview') did end up in the middle of a line, most likely because at default the value =/ 4, so that doesn't tell us much.

So unfortunately, the modification did not solve the problem.
outlandos is offline   Reply With Quote
Old 06-17-2015, 05:24 PM   #26
GeoffR
Wizard
GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.
 
GeoffR's Avatar
 
Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
I found that the Dutch hyphenation dictionary included in the Kobo firmware is just a copy of the openoffice one available here.

There is a hyphenation program available here that can be used to test the hyphenation of words using the hyphenation dictionary.

Using the above program with the Dutch hyphenation dictionary (with no HYPHENMIN values, exactly as it comes with the Kobo firmware) gives the following hyphenations:

onderwerp on=der=werp
onderwerp, on=der=wer=p,

So the hyphenation onderwer-p, is valid according to that dictionary.

But adding LEFTHYPHENMIN 3 and RIGHTHYPHENMIN 3 to the dictionary gives the hyphenations:

onderwerp onder=werp
onderwerp, onder=werp,

so I hoped that the HYPHENMIN additions would fix the problem, but there seems to be more to it.


Edit: There are rules such as kam1p and wer1p in the Dutch hyphenation dictionary which seem to be responsible for these strange hyphenations, removing them fixes these paricular cases but I don't know Dutch so I don't know what other problems removing them would cause.

But it does seem that stripping the trailing punctuation from the words before hyphenating them would solve all these problems. The dictionaries have some rules to handle punctuation, but they only seem to be for apostrophes and such that are a part of the word, not other leading and trailing punctuation. It doesn't make sense to me that trailing punctuation should affect the result of the hyphenation algorithm.

Last edited by GeoffR; 06-17-2015 at 05:59 PM. Reason: There are rules ...
GeoffR is offline   Reply With Quote
Old 06-18-2015, 01:35 AM   #27
GeoffR
Wizard
GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.GeoffR ought to be getting tired of karma fortunes by now.
 
GeoffR's Avatar
 
Posts: 3,821
Karma: 19162882
Join Date: Nov 2012
Location: Te Riu-a-Māui
Device: Kobo Glo
Some more testing (I've made a Dutch ebook that contains nothing but "onderwerp" followed by various punctuation) and have found some interesting things:

1. The Dutch hyphenation dictionary supplied with the Kobo firmware is incorrectly encoded: It is marked as UTF-8 but it is actually encoded as ISO8859-1. However converting the file to UTF-8 didn't fix the onderwerp problem. It could be that the hyphenation dictionary needs to be rebuilt from scratch as a UTF-8 dictionary?

2. The bad hephenation of onderwerp occurs in the EPUB reader as well as the KEPUB reader. I've never seen this type of problem in the EPUB reader with the English hyphenation dictionary in over two years of reading epubs, so I suspect the real problem is the Dutch hyphenation dictionary.

Edit: I find can add rules such as 4p, to the dictionary to prevent the bad hyphenation in the EPUB reader, but it doesn't work for the KEPUB reader. So as well a problem with the hyphenation dictionary, there are still problems with the KEPUB hyphenation algorithm too.

Last edited by GeoffR; 06-18-2015 at 03:51 AM. Reason: I find I can add rules ...
GeoffR is offline   Reply With Quote
Old 06-18-2015, 03:53 AM   #28
cramoisi
Librarian
cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.
 
Posts: 346
Karma: 72225
Join Date: Apr 2015
Location: Liège - Belgium
Device: kobo gloHD - KA1
@outlandos: what about trying this method I posted yesterday on the Firmware 3.16.0 thread ?

First, since the hyphen algo is supposed corrected in 3.16.0, (and if I were you), I should try to update the hyphen dictionary inside the kobo by adding the one of openoffice (link and method in the post linked). If it continues to not working, I should use the other method and not using the hyphen algo of kobo at all but use calibre + kobo_extra.css to hyphenate correctly the books.

Do you use calibre to send books into your gloHD ? If it’s the case, with KoboTouch plugin or KoboTouchExtended ? Hence, do you read epubs or kepubs ?

Last edited by cramoisi; 06-18-2015 at 04:40 AM.
cramoisi is offline   Reply With Quote
Old 06-18-2015, 04:21 AM   #29
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 79,758
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Why not just fix the Dutch hyphenation dictionary every time an incorrect hyphen is found?
JSWolf is offline   Reply With Quote
Old 06-18-2015, 04:42 AM   #30
cramoisi
Librarian
cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.cramoisi did not drink the Kool Aid.
 
Posts: 346
Karma: 72225
Join Date: Apr 2015
Location: Liège - Belgium
Device: kobo gloHD - KA1
Quote:
Originally Posted by JSWolf View Post
Why not just fix the Dutch hyphenation dictionary every time an incorrect hyphen is found?
Updating the whole dictionary with a correct one (at libreoffice or openoffice) seems easier to me The idea is to read books, not to kill the pleasure by debugging a system while reading...
cramoisi is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Hyphenation Simboubou PocketBook 9 09-15-2014 05:21 AM
Hyphenation Siard Kobo Reader 6 08-09-2013 07:40 AM
Hyphenation - again.. ItsMee Conversion 2 07-31-2012 03:00 PM
Hyphenation Jini ePub 3 04-05-2012 11:48 AM
Hyphenation in Dutch pini PocketBook 0 06-28-2011 03:34 AM


All times are GMT -4. The time now is 10:01 AM.


MobileRead.com is a privately owned, operated and funded community.