View Single Post
Old 09-07-2021, 10:04 AM   #4
quinta@ebf.cz
Connoisseur
quinta@ebf.cz began at the beginning.
 
Posts: 62
Karma: 10
Join Date: Mar 2019
Device: Kindle 3 Paperwhite
Quote:
Originally Posted by kovidgoyal View Post
And note that in DOCX softhyphens are represented as a special tag not as a unicode character.
Yes, docx softhyphens (optional word brakes), seem to look more like objects then characters. They do not react to unicode-revealing shortcut Alt+X, their "ascii value" is 31 (?), their XML representation is element <w:softHyphen/> (not character)...

My attempts to create optional word brakes using suggested Calibre "search and replace" export feature was (yet) not succesfull. Well, all I tried was replacing using expression \u200C (which is unicode value of suggested "zero width non-joiner"), and using expression \u001F (hexa value of 31)... Excuse my naive approach. : )

Possible good reason for converting soft hyphens to OWB as default Calibre export behaviour: MS Word itself is behaving that way. OWB are converted to SHY when exported to HTML, and vice versa (just tested in Word 2010).

Quote:
Originally Posted by kovidgoyal View Post
From the next release calibre will convert soft hyphens to that tag automatically. https://github.com/kovidgoyal/calibr...c9948658be0db8
Ooh, I think thats better message than expected. Thank you.
quinta@ebf.cz is offline   Reply With Quote