View Single Post
Old 09-07-2021, 02:30 AM   #1
quinta@ebf.cz
Connoisseur
quinta@ebf.cz began at the beginning.
 
Posts: 62
Karma: 10
Join Date: Mar 2019
Device: Kindle 3 Paperwhite
soft hyphens in docx conversion output

Soft hyphens marks (characters U+00AD, or entitities #173 or shy), originally existing in html, are exported to docx (again) as shy characters (code 00AD).

Which is not quite desired behaviour, cause MS Word implements optional word breaks differently, and characters 00AD itself are simply displayed (visually simillary as standard hyphens).

Exported docx document containing shy characters can be repaired by searching shy characters (using symbol ^0173), and replacing them: either by Word "optional word break" (^-), or (mostly in my case) just deleting them by replacing by nothing...

Anyway: Is such export behaviour intentional? Or - mayby - is for some reason inevitable? Is there any way how to achieve replacing shy characters to MS Word "optional word break" as part of conversion?

Last edited by quinta@ebf.cz; 09-07-2021 at 02:40 AM.
quinta@ebf.cz is offline   Reply With Quote