Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > Kindle Formats

Notices

Reply
 
Thread Tools Search this Thread
Old 12-01-2022, 06:28 AM   #16
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 6,634
Karma: 54000001
Join Date: Jun 2017
Location: Ireland
Device: Both Kinds: epub based makes and Kindle
Quote:
Originally Posted by jhowell View Post
I don’t think that MOBI supports Japanese.
The old mobi/azw/KF7 only supports a subset of Greek in addition to Latin-Roman. Lack of fonts on the ereaders is one issue.
No Chinese, Korean, Hindi, Japanese (or any Asian).
No Cyrllic, Hebrew or Arabic.
The fonts installed do support Icelandic, Spanish, French, German etc extra letters not used in English.

It was really very bad of Amazon to release such a limited GUI/Language support in 2007. The underlying OS supported it over 10 years earlier. They seem to just bolt the Mobipocket stuff they bought in 2005 on top of Linux. It was very limited as it had originally supported Palm OS, Symbian, Windows CE and maybe DOS and very limited.
Quoth is offline   Reply With Quote
Old 12-02-2022, 08:58 PM   #17
colinsky
Addict
colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.colinsky ought to be getting tired of karma fortunes by now.
 
colinsky's Avatar
 
Posts: 210
Karma: 3232318
Join Date: Sep 2009
Device: Sony PRS-300, PRS-T1, PRS-T3
Quote:
Originally Posted by jhowell View Post
I don’t think that MOBI supports Japanese.

KF8 (azw3) does but it relies on an included word boundary table (GESW records) that is generated during the publishing process. I do not think that there is any way to add that to a book that was not sold by Amazon.

I am not sure about KFX. It might work better in that format.

Update: Like KF8, KFX format also includes word boundary information, but only in published boooks.
Thanks. Most everything I have played with has been from an original EPUB source (converted by Calibre) so I'm not sure how the behavior I am seeing is arising.

Basically: MOBI doesn't seem to attempt any word segmentation (you can select any combination of characters. AZW/KFX only let you select along some notion of word boundaries (kanji + inflection). I'll have to do some more experimentation, to see if it is deriving from a source feature (like a RUBY tag) or some other process.

Are you aware of documentation (or reverse engineering) that describes the word boundary information in KF8 or KFX?
colinsky is offline   Reply With Quote
Advert
Old 12-03-2022, 06:55 PM   #18
jhowell
Grand Sorcerer
jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.jhowell ought to be getting tired of karma fortunes by now.
 
jhowell's Avatar
 
Posts: 5,787
Karma: 64828995
Join Date: Nov 2011
Location: Tampa Bay, Florida
Device: Oasis 2, iPad, Nexus 7
Quote:
Originally Posted by colinsky View Post
Are you aware of documentation (or reverse engineering) that describes the word boundary information in KF8 or KFX?
I have not seen any documentation on this. I looked into the KF8 GESW years ago but did not write up my findings. If I recall correctly it is a compressed table of coded instructions for parsing the raw HTML content to determine which bytes make up each word, taking into account that they might not be contiguous due to HTML markup.
jhowell is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Converting a Japanese Word doc to Mobi help, please ImogenRose Conversion 1 06-12-2013 02:20 PM
Need help w/very simple task: page of Word text > Kindle text I can share w/friends kearnine Conversion 1 10-17-2012 09:25 PM
Japanese Text in KT firmaware v2.0 kumaryu Kobo Reader 18 07-17-2012 02:43 AM
Displays Japanese text roquet Bookeen 5 11-07-2007 10:30 AM
Can I read Japanese text with it? ChristSchmidt Sony Reader 2 01-27-2007 12:14 PM


All times are GMT -4. The time now is 10:26 AM.


MobileRead.com is a privately owned, operated and funded community.