View Single Post
Old 06-10-2022, 02:54 PM   #12
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by philja View Post
I'm going to add TagMechanic plugin to my Sigil and have a look at the tutorials mentioned above by @Tex2002ans.


Quote:
Originally Posted by philja View Post
[...] systematically cutting off the last 9 characters of every paragraph regardless of word breaks. I think it must be some idea configured into the process by my friend's book designer. I doubt it was aimed at widows and orphans, though, because these had specific parameters defined in the contents paragraphs' CSS in the stylesheet.
It's trying to emulate a setting introduced in CSS4 called:

There's also the new:

This type of hyphenation functionality has existed in Print for a very long time, but is only just making its way into the latest standards for the web.

What must have happened is they were using some sort of program (InDesign?), or some plugin that tried to "hack" a "do not hyphenate in the last word".

* * *

Speculation: If the Book Designer was working within Word, then there was probably some plugin that ran, trying to hack this functionality into Word using Character Styles.

* * *

Side Note: The next major version of LibreOffice, 7.4, is going to be introducing this Hyphenation functionality:

It'll map to the equivalent CSS4 listed above.

Quote:
Originally Posted by philja View Post
The good news is that with the Zoom call and a couple of emails, I managed to teach my author friend how to remove those spans with a regex. (including explaining what a regex is and does).
Another trick I like to do is you can use a simple Search/Replace:

Search: <span class="no-break">
Replace: <span>

Then I use TagMechanic in order to disappear all empty <span>s!

Action Type: Delete
Tag name: span
Having the attribute: No attributes ("naked" tag)

That would take you from this:

Code:
This is a last <span class="no-break">word</span>.
Code:
This is a last <span>word</span>.
Code:
This is a last word.
Very helpful when working with lots of junk.

I just do lots of simple replaces to get them down to <span>, and then in one swoop, wipe away all the empty ones.

Quote:
Originally Posted by philja View Post
I use LibreOffice and the ODTimport plugin with my customised config.xml file which took me a while to sort out for style mapping. I suppose I'll have to go thro the whole thing again for a Word import plugin.
Yep. Best thing you can do is teach them how to use Styles in Word/LibreOffice.

Latest project I'm working on with an Editor, taught him Styles—and in less than 20 minutes, he was up and running.

2 weeks later, bam, 2nd book, the DOCX->EPUB conversion is infinitely easier. Now I'm just teaching him little tweaks here and there and tricks to look out for.

Last edited by Tex2002ans; 06-10-2022 at 02:56 PM.
Tex2002ans is offline   Reply With Quote