Thread: Welsh
View Single Post
Old 06-25-2021, 06:07 PM   #4
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by Notjohn View Post
How should I handle the name of Llanfairpwllgwyngyllgogerychwyrndrobwllllantysilio gogogoch?
Llan-fair-pwll-gwyn-gyll-gog-er-ych-wyrn-dro-bwllll-ant-ysil-iog-ogogoch

Quote:
Originally Posted by Notjohn View Post
I came upon this statement in a related thread:

Quote:
Originally Posted by Tex2002ans View Post
If you mark your code properly, Multi-Language Hyphenation should work in a properly designed reader.
Welsh's HTML lang = "cy".

So you'd mark this in your HTML:

Code:
<p>A Welsh word is: <span lang="cy" xml:lang="cy">Llanfairpwllgwyngyllgogerychwyrndrobwllllantysiliogogogoch</span>.</p>
then rely on Welsh Hyphenation Dictionaries.

Luckily, people have already created that for you.

See the list at Hyphenation.org: "Languages".

If you wanted proper Welsh hyphenation on your actual ereader (like Kobo), you'd do a few tweaks to those files so your device can understand them.

JSWolf explains how + goes into more detail in this thread:

Kobo Reader > "Better Hyphenation"

Kobo's hyphenation dictionaries use LibreOffice's hyphenation format + a few different lines in the beginning.

(And LibreOffice's is a few minor differences from the raw TeX/Hyphenation.org ones.)

Quote:
Originally Posted by Quoth View Post
Manually hyphenate it.
No. Bad idea. Rely on the hyphenation dictionaries.

Rare Exception: If you're creating this ebook for personal use... then PERHAPS, maybe, you could run the HyphenateThis! plugin on it.

This inserts tons of soft hyphens throughout your text.

But there are many disadvantages to this. I wrote about that in more detail in:

2018: "Auto-Hyphenation".

Never ever clog up an ebook for sale with soft hyphens.

Note: Although you have to make sure you're using the proper dictionary... if you use the wrong one, you'd get this:

✓ Llan-fair-pwll-gwyn-gyll-gog-er-ych-wyrn-dro-bwllll-ant-ysil-iog-ogogoch (Welsh)
✗ Llan-fair-p-wll-gwyn-gyll-gogerych-wyrn-drob-wl-l-l-lan-tysil-i-o-gogogoch (English)

Quote:
Originally Posted by Notjohn View Post
I will manually hyphenate it in the print edition, but what about the Sigil-generated epub?

(When I preview this, I see a space in it, though there is no space in the original.)
You shouldn't rely on manual hyphenation in print either. Mark the language properly, then rely on the Hyphenation Dictionaries.

That will make sure it breaks at valid locations + optimally distributes the spacing throughout the paragraph.

People have already done all the hard legwork for you.

Last edited by Tex2002ans; 06-25-2021 at 07:17 PM.
Tex2002ans is offline   Reply With Quote