View Single Post
Old 11-02-2010, 03:39 PM   #3
cybmole
Wizard
cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.
 
Posts: 3,720
Karma: 1759970
Join Date: Sep 2010
Device: none
Quote:
Originally Posted by kovidgoyal View Post
open a ticket and attach the RTF
i'd like to confirm that its actually a calibre problem & that I can reproduce it consistently before opening a ticket.

my hunch is that I've made so many edits/changes to the .rtf source that its internal structure has become suspect, or overcomplicated.

as an additional test, I asked calibre to convert the same file from rtf to txt, & I see some infrequent, spurious word splits also in the .txt output. ( but If I use word to save it as txt, then it looks OK )

what do calibre rtf to txt & rtf to mobi have in common, do they use a common front end ?

I don't know how the innards of rtf files work, after they are edited, but this could well be a microsoft problem. Will the rtf contain complex chains of pointers to changes - even after performing: select all - copy - paste into a new file ?

PS if I were to open a ticket ( link please ) his is a big file , over 1MB, so would I be able to attach it ?

PPS summarising why it may be a calibre issue:
I began this process with a pdf source which, after conversion, had the usual issue i.e. line breaks in annoying places, plus the odd typo. So after going pdf to epub ( not with calibre) then epub to mobi I made an rtf version from the epub and everytime i complete a chapter, I fix up the formatting for that chapter in word & then use calibre to make a new mobi file.

I've done this repeatedly for 20+ chapters, over 2 weeks, without ever encountering the split word bug - note that there were no split words in the initial pdf conversion -

so all that has changed recently is the version of calibre, and that fact that the total number of edits of the rtf has increased.

Now when I scroll though today's latest mobi conversion in the internal reader, I see a couple of split words in the early chapters which I'd previously "signed off " as done.
so fresh conversions are introducing word split errors in places where they had not used to be. ( but only very occasionally - about one per 50-100 pages )

Last edited by cybmole; 11-02-2010 at 03:59 PM.
cybmole is offline   Reply With Quote