Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 12-20-2020, 05:08 PM   #1
JerrySmile
Member
JerrySmile began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Aug 2012
Device: Kindle
epub to docx conversion loses paragraph spaces

Windows 10, latest Calibre, Word 2019

Issue: epub -> docs conversion works, but loses paragraph spaces.

The div are there where they should be in the epub source, and the spaces appear on the epub reader (used calibre as reader).

However, as shown by Word 2019, the docx file resulted from conversion doesn't have any extra spacing, where the div were.

This is a book of poetry, so the spacing between stanzas is critical.

I tried various conversion options wrt heuristics (with and without), look & feel / layout, nothing works.

I have experience on calibre, but not converting to docx, thus any pointers would be appreciated. BTW, I've installed no plugins (should I?).

Thanks in advance.

Last edited by JerrySmile; 12-20-2020 at 05:15 PM.
JerrySmile is offline   Reply With Quote
Old 12-20-2020, 11:04 PM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,859
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
https://www.mobileread.com/forums/sh...d.php?t=186697
kovidgoyal is offline   Reply With Quote
Advert
Old 12-21-2020, 11:27 AM   #3
retiredbiker
Addict
retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.retiredbiker ought to be getting tired of karma fortunes by now.
 
retiredbiker's Avatar
 
Posts: 387
Karma: 1638210
Join Date: May 2013
Location: Ontario, Canada
Device: Kindle KB, Oasis, Pop_Os!, Jutoh, Kobo Forma
Quote:
This is a book of poetry,
I've recently worked with an author on 4 volumes containing mostly poetry and quotations...with all of the infinite styling challenges of poetry. It's not easy.

Your styling must be spot on, no cheating using <br/> for a new line, for example. Use the editor and do a solid style for every need. I have 21 styles to handle what is in those four books, for example.

IMHO, Calibre won't do a perfect job of converting this sort of thing for a word processor if you need precise output. In my case, we do the ebook first, then go to Writer for page size, final editing, and then a pdf for sending to Amazon for the printed book. The only thing I've found that will do the conversion "really well but not perfectly" is Jutoh.

I open the epub in Jutoh, and the styles from Calibre are nearly perfectly maintained, right down to the style name. So a style I called "Verse1_Top" in the editor stays that way in Jutoh. Most settings are also perfect. Some tiny adjustments are still needed, for some reason.

Then when I use Jutoh to compile an odt for Writer, it is still 99% good, again, right down to the style names. This is a huge leg up. Then in Writer I add the page styles needed for not only the page size, but headers, footers, page numbering, TOC generation, gutter margins, and all that. Then we do the pagination and final edits, export to pdf, and Amazon is happy.

Although I use Writer, Jutoh will work with Word as well, although I can't personally speak to the results.

Last edited by retiredbiker; 12-21-2020 at 11:32 AM.
retiredbiker is offline   Reply With Quote
Old 12-21-2020, 03:48 PM   #4
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 73,983
Karma: 128903378
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
What does work is converting the eBook to HTML and loading that into Word.
JSWolf is offline   Reply With Quote
Old 12-21-2020, 04:34 PM   #5
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,572
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by retiredbiker View Post
IMHO, Calibre won't do a perfect job of converting this sort of thing for a word processor if you need precise output. In my case, we do the ebook first, then go to Writer for page size, final editing, and then a pdf for sending to Amazon for the printed book. The only thing I've found that will do the conversion "really well but not perfectly" is Jutoh.
FWIW: the Mammoth DOCX converter uses a map of Word Styles to equivalent CSS entries - DiapDealer has written a Sigil wrapper plugin for it. The mapping of Word Styles to CSS entries must be provided by the book maker. Useful if you can re-use the mapping on multiple books, especially the ones yet to be written

BR
BetterRed is offline   Reply With Quote
Advert
Old 01-03-2021, 10:40 AM   #6
JerrySmile
Member
JerrySmile began at the beginning.
 
Posts: 14
Karma: 10
Join Date: Aug 2012
Device: Kindle
Hello, everyone,

I really appreciate your replies. Sorry for not answering for quite a while.

Now, let me give you some details I've found recently.

1. Calibre does an excellent job for the
epub -> docs
conversion for the epub version of Whitman's "Leaves of Grass," as found here, course in the public domain.

And this is important for Whitman is using, of course, a wide variety of stanzas. The important result was that the stanzas were separated via extra blank lines.

The options I used in Calibre were simple:
- look & feel: layout: insert blank line between paras
- no heuristics

This is the epub code for two stanzas at the beginning of a poem:

---
<h2 id="pgepubid00008">Eidolons</h2>
<div xml:space="preserve" class="pgmonospaced">****** I met a seer,<br/>* Passing the hues and objects of the world,<br/>* The fields of art and learning, pleasure, sense,<br/>****** To glean eidolons.<br/><br/>****** Put in thy chants said he,<br/>* No more the puzzling hour nor day, nor segments, parts, put in,<br/>* Put first before the rest as light for all and entrance-song of all,<br/>****** That of eidolons.<br/>
---

The fact that <br/> separators were used in the source seems to have been critical in the success of the calibre conversion to docs (please see further).

2. Calibre failed for me in terms of stanzas separation for the
epub -> docs
conversion for the epub code containing other separators such as:

---
<div class="page_top_padding" id="div3">
<p class="poetry" id="p1">Text of first verse of the stanza</p>
<p class="poetry" id="p2">Text of 2nd verse of the stanza< /p>
<p class="poetry" id="p3">Text of last verse of the stanza</p>
</div>
---

This was epub code generated by Calibre in a previous
azw3 -> epub conversion
in cases where a direct
azw3 -> docs conversion
was losing stanza separation.

I tried many options, with and without heuristics. Any suggestions for options?

The stanza separation was lost.

I tried many options.

3. Another case in which Calibre seems to do a good job in terms of stanzas separation for both of the conversions

azw3 -> docs
azw3 -> epub

is when the epub code looks like this:

---
<p class="vsb"> 1st verse in the stanza </p>
<p class="v"> next verse in the stanza </p>
<p class="v"> next verse in the stanza </p>
<p class="vsb">1st verse in the stanza</p>
<p class="vsb">1st verse in the stanza</p>
<p class="v"> next verse in the stanza </p>
<p class="v"> next verse in the stanza </p>
---

I guess class="vsb" forces a new stanza.

I'm not able to read any azw3 code, sorry.

Last edited by JerrySmile; 01-03-2021 at 04:43 PM.
JerrySmile is offline   Reply With Quote
Reply

Tags
calibre, docx, epub, paragraphs


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Is there a way to keep paragraph indents in Calibre epub>docx conversion? Gregg Bell Conversion 3 02-09-2017 07:40 PM
I lose paragraph spacing in epub>docx Calibre conversion Gregg Bell Conversion 9 02-09-2017 07:40 PM
Convert from PDF to ePub puts paragraph spaces mkelley Conversion 17 01-02-2012 07:48 AM
Paragraph spaces in ePub to Mobi conversion disrupts indent formatting markpearl Conversion 34 09-21-2011 02:42 PM
Huge Sentence and Paragraph Spaces EPub to Mobi Dasha Amazon Kindle 10 06-06-2011 06:43 PM


All times are GMT -4. The time now is 10:19 AM.


MobileRead.com is a privately owned, operated and funded community.