Quote:
Originally Posted by kovidgoyal
open a ticket and attach the RTF
|
i'd like to confirm that its actually a calibre problem & that I can reproduce it consistently before opening a ticket.
my hunch is that I've made so many edits/changes to the .rtf source that its internal structure has become suspect, or overcomplicated.
as an additional test, I asked calibre to convert the same file from rtf to txt, & I see some infrequent, spurious word splits also in the .txt output. ( but If I use word to save it as txt, then it looks OK )
what do calibre rtf to txt & rtf to mobi have in common, do they use a common front end ?
I don't know how the innards of rtf files work, after they are edited, but this could well be a microsoft problem. Will the rtf contain complex chains of pointers to changes - even after performing: select all - copy - paste into a new file ?
PS if I were to open a ticket ( link please ) his is a big file , over 1MB, so would I be able to attach it ?
PPS summarising why it may be a calibre issue:
I began this process with a pdf source which, after conversion, had the usual issue i.e. line breaks in annoying places, plus the odd typo. So after going pdf to epub ( not with calibre) then epub to mobi I made an rtf version from the epub and everytime i complete a chapter, I fix up the formatting for that chapter in word & then use calibre to make a new mobi file.
I've done this repeatedly for 20+ chapters, over 2 weeks, without ever encountering the split word bug - note that there were no split words in the initial pdf conversion -
so all that has changed recently is the version of calibre, and that fact that the total number of edits of the rtf has increased.
Now when I scroll though today's latest mobi conversion in the internal reader, I see a couple of split words in the early chapters which I'd previously "signed off " as done.
so fresh conversions are introducing word split errors in places where they had not used to be. ( but only very occasionally - about one per 50-100 pages )