06-22-2017, 01:47 PM | #1 |
Junior Member
Posts: 2
Karma: 10
Join Date: Jun 2017
Location: Huntingdon, England
Device: Calibre reader
|
Soft hyphens lost on conversion to EPUB
We are developing an input plugin for OSIS - an XML based format used for Bibles. We are finding that on conversion to EPUB, soft hyphens are being lost. These are actual UTF-8 soft hyphen characters generated by the input plugin as opposed to entities. It appears that these characters are being removed by the EPUB output plugin, since when using the --debug-pipeline option they are there in the html files in the processed directory, but when files are unzipped from the EPUB output they have gone. We cannot rely on HTML5 soft-hyphen dictionaries since we work with documents in many languages for which there is no hyphen dictionary (e.g. Uzbek). Is there a way of preventing these characters from being removed?
|
06-22-2017, 01:54 PM | #2 |
creator of calibre
Posts: 43,843
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
06-22-2017, 01:55 PM | #3 |
creator of calibre
Posts: 43,843
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Actually never mind, the EPUB output plugin removes them deliberately because adobe digital editions older versions have problems with them
|
06-23-2017, 06:13 AM | #4 |
Junior Member
Posts: 2
Karma: 10
Join Date: Jun 2017
Location: Huntingdon, England
Device: Calibre reader
|
I understand. It's a pity there is no way to disable this behaviour though. Might you consider introducing an EPUB output option to do this?
|
06-23-2017, 06:33 AM | #5 |
creator of calibre
Posts: 43,843
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
It's pretty trivial to run calibre from source and just comeent out the copule of lines that do this in the epub output plugin. See https://manual.calibre-ebook.com/develop.html
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
visibility of soft-hyphens | DrChiper | Editor | 9 | 09-02-2016 05:27 PM |
Soft hyphens on Windows | Styx | Calibre | 4 | 02-13-2015 04:26 AM |
ePub to pdf: Doesn't respect soft hyphens in ePub | EbokJunkie | Conversion | 4 | 11-18-2013 03:27 AM |
Soft Hyphens | wallcraft | Workshop | 29 | 06-12-2012 04:21 AM |
Calibre deletes soft Hyphens in Epub ? | NASCARaddicted | Calibre | 4 | 09-20-2009 06:31 PM |