View Single Post
Old 12-10-2023, 01:48 PM   #12
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by KevinH View Post
This entire thread really belongs in the epub forum not the Sigil forum.
Yes, I'd say move it to the EPUB (or Workshop) section.

Quote:
Originally Posted by KevinH View Post
There are many tools including regex and the ability to create a table of before and after potential replacements that can be easily scrolled over and just the replacements you do not want removed. There are also regex python replacement functions (built-in or via plugin) that can be used as well.
Can you describe some of your ideas?

Quote:
Originally Posted by democrite View Post
A different language plus specialized scientific terms for which perhaps I couldn't find a dictionary.
Which language? Which words? It's like we're pulling teeth here! The more you can share, the easier it'll be to tackle your specific issues.

Quote:
Originally Posted by democrite View Post
Exported PDF to EPUB with one of the numerous cheapo apps.
And there's the root cause of a lot of this. Soft hyphen detection is key.

So many of the crappy PDF->something apps just treat all "line-ending hyphens" as "hard hyphens", so they'll appear in the EPUB. And as you can see, that produces THOUSANDS of them that you'll have to correct.

In the case of Finereader, it narrows it down to a handful.

Last edited by Tex2002ans; 12-10-2023 at 01:52 PM.
Tex2002ans is offline   Reply With Quote