Quote:
Originally Posted by Tex2002ans
So another potential enhancement that I just ran into today.
1. (See attached EPUB) Create an EPUB that was given metadata:
"dc:language en-GB"
and this HTML:
Code:
<p>colour rumour programme</p>
<p><span lang="en-US" xml:lang="en-US">color rumor program</span></p>
2. Import EPUB using EPUB Tools.
3. The DOCX language is set to (I assume the user's default language?).
In my case, in Microsoft Word, all of the text is marked as "English (U.S.)", so words like "colour" + "rumour" + "programme" are marked as "misspelled"... even though it SHOULD be a British-spelled book.
Expected Functionality
Import EPUB should look at the EPUB Language Metadata, and make the DOCX Language = EPUB Language.
Potential Side Functionality
On Import it could transfer the HTML lang + xml:lang as well? Maybe this could be an option buried in the Preferences (default to Off).
That would be helpful for properly marked up multi-language EPUBs.
|
Of course it is more complex than that... From the top of my head the span tag with the language should be imported correctly. I do not think it gets scrubbed from the source. I will have to check in the source for that, but I am quite certain of that.
There are two situations where the language setting is changed:
1. When the preparation/preprocess procedure is run AND this is specified in the settings
2. Upon running the alternative spelling control. The language of the setting in step 1 is used. The alternative spelling control is not multilingual for several reasons (performance a very big one!)
Described above is the designed situation. Of course, if no language is specified in the imported HTML, Word will use its general setting.
A special mention for the settings for the ePUB export. The generic language for the ePUB can be specified there.
During the export to HTML (and thus ePUB) the language setting for specific words/sentences are ignored I think. I would have to check to be certain, but I am pretty sure.
There is hope though! With the new HTML conversion engine this would no longer be the case. It will be an option, but the language settings of specific words/sentences will be retained there during the export. It is one more reason why I am working on this.
Quote:
Originally Posted by Tex2002ans
Definitely link it when you complete it. I didn't have time to look much further since I made the post here, but thanks for saving me all that legwork. 
|
I will. I forgot to do it yesterday...