Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 06-30-2020, 07:28 AM   #16
KevinH
Wizard
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 4,380
Karma: 2622176
Join Date: Nov 2009
Device: many
It should be rare on the same node of the DOM tree, since a new span will create a new node. So as long as spans (and other tags) use the proper lang attributes it should really not happen. Without the proper lang attributes added, Sigil will NOT be guessing language. That is a kettle of fish that Sigil will not be opening.
KevinH is online now   Reply With Quote
Old 06-30-2020, 07:58 AM   #17
BetterRed
null operator
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 16,245
Karma: 18817784
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by mcdummy View Post

I've read many ebooks that are written in a first language, but contain terms or citations from other languages. If the creator of the ebook does not mark all instances of a word in the proper language, you end up with this situation.
Not so bad if the markuperer italicises the foreign phrases, at least then you can search and eyeball - provided they do it consistently, which of course they often don't.

Last edited by BetterRed; 06-30-2020 at 08:01 AM.
BetterRed is online now   Reply With Quote
Old 06-30-2020, 08:23 AM   #18
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 22,546
Karma: 125997190
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Quote:
Originally Posted by KevinH View Post
Without the proper lang attributes added, Sigil will NOT be guessing language. That is a kettle of fish that Sigil will not be opening.
Absolutely agree. The expectation will be that the proper language attributes have already been correctly added. If this is not the case, users will need to add/correct them themselves (or take it up with an ebook's creator) if they want multi-language spellcheck to be the most useful in Sigil. GIGO still very much applies.
DiapDealer is offline   Reply With Quote
Old 07-03-2020, 03:15 AM   #19
mcdummy
Connoisseur
mcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the roughmcdummy is a jewel in the rough
 
mcdummy's Avatar
 
Posts: 55
Karma: 7130
Join Date: Apr 2015
Device: PRS-T3
Quote:
Originally Posted by Tex2002ans View Post
And which ereader are you using that applies proper hyphenation?
I'm using a PRS-T3, which does not apply hyphenation to all languages.

Quote:
Originally Posted by Tex2002ans View Post
Does it work at the per-word level too? Or only works on a per-book's-language level?
I'm trying to figure this out.

My PRST-T3 seems to work at least on a html-file-level, i.e. it can change the language when a new html-file is processed.

So far, I haven't figured out, which language instructions it processes and ignores (e.g., xml:lang="..." vs. lang="..." or en-US vs. en_US).

For instance, the PRS-T3 seems to ignore en_US/en_GB/de_DE/fr_..., while en-US/en-GB/de-DE/fr-... seems to work.
mcdummy is offline   Reply With Quote
Old 07-03-2020, 04:55 AM   #20
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 1,592
Karma: 7401109
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by mcdummy View Post
I'm using a PRS-T3, which does not apply hyphenation to all languages.
Thanks for the info. I'm very interested in multi-language hyphenation.

Even many browsers don't handle hyphenation properly yet, which is why I was interested if you found a reader that could do it at that level.

Quote:
Originally Posted by mcdummy View Post
My PRST-T3 seems to work at least on a html-file-level, i.e. it can change the language when a new html-file is processed.
Probably a good assumption.

Quote:
Originally Posted by mcdummy View Post
So far, I haven't figured out, which language instructions it processes and ignores (e.g., xml:lang="..." vs. lang="..." or en-US vs. en_US).
Using _ is invalid. Only - allowed.

See "Tags for Identifying Languages" (BCP47) and w3c's page on "Language tags in HTML and XML".

Also, in XHTML xml:lang takes priority:

Quote:
The xml:lang attribute is not actually useful for handling the file as HTML, but takes over from the lang attribute any time you process or serve the document as XML. The lang attribute is allowed by the syntax of XHTML, and may also be recognized by browsers. When using other XML parsers, however (such as the lang() function in XSLT) you can't rely on the lang attribute being recognized.
Quote:
Originally Posted by mcdummy View Post
For instance, the PRS-T3 seems to ignore en_US/en_GB/de_DE/fr_..., while en-US/en-GB/de-DE/fr-... seems to work.
Also, best to stick with minimal possible. Better to more broadly specify (en) than over-specify wrongly (en-US on a en-GB document) or redundantly.

See w3c's "Choosing a Language Tag":

Quote:
Always bear in mind that the golden rule is to keep your language tag as short as possible. Only add further subtags to your language tag if they are needed to distinguish the language from something else in the context where your content is used.
* * *

Also, if you desperately need to handle multiple dictionaries in a single document, and you use Microsoft Word... you could import your properly-lang-marked EPUB -> DOCX using Toxaris's EPUB Tools:

https://www.mobileread.com/forums/sh....php?p=2516490

I was pleasantly surprised to see it transferred over all lang information into DOCX, which made dealing with the red squigglies so much easier!

(I recently used it to mark all Spanish/French/German text, and even American/British, making the spellchecking passes so much faster.)
Tex2002ans is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Search in 2 dictionary in a same time akorx Kobo Reader 3 03-06-2020 09:30 AM
Bug in dictionary function, wish: upgrade to using multiple dictionaries at one time Bjarne Calibre 1 04-21-2019 05:13 AM
So I tried to use the dictionary on my PB360 for the first time... maxbookworm PocketBook 18 06-27-2010 08:29 PM
Dictionary lookup time tompe Bookeen 17 11-08-2008 12:19 PM


All times are GMT -4. The time now is 06:58 PM.


MobileRead.com is a privately owned, operated and funded community.