Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 03-19-2016, 05:00 AM   #1
pokeba
Junior Member
pokeba began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Nov 2015
Location: USA
Device: iPad
Convert a Chinese book, markers based on punctuation caused an erranour page break

Hi,

I have converted many Chinese books from Word files to epub with Calibre successfully. I recently encountered this issue that for a book, Calibre generates an extra page break.

The error happens in the only place where I have a short English phrase, as shown in the attached jpg file ( I have turned on "show all marks" in Word).

I checked the output of debug. In the input/index.html file, the line is correct:

<p class="block_33">*</p>
<p class="block_5 text_10">Mee Soto</p>
<p class="block_7">(词/曲:疏效平、李家欣)</p>

But in the parsed/index.html, it became:
<p class="block_33">*</p>
<p class="block_5 text_10" style="page-break-before:always">Mee Soto</p>
<p class="block_7">(词/曲:疏效平、李家欣)</p>

Note the extra "page-break-before:always" was added.

In the log file, it says:
...
Median line length is 135, calculated with html format
Looking for more split points based on punctuation, currently have 2
marked 3 section markers based on punctuation. - Mee Soto</p>
...

So somehow Calibre thinks there is a punctuation in "Mee Soto</p>
But I don't see it and I have spent quite a few days try to get rid of the extra page break.

I also found that if I change the "Mee Soto" to other English text, the page break will still be there. But if I change "Mee Soto" to some Chinese characters, then Calibre will not generate the extra page break.

I'd appreciate if anyone can help or point me why Calibre see a punctuation in "Mee Soto</p>.

Thanks
Attached Thumbnails
Click image for larger version

Name:	Mee_soto_in_Word_file.JPG
Views:	198
Size:	30.0 KB
ID:	147189  
pokeba is offline   Reply With Quote
Old 03-19-2016, 07:36 AM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,345
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Dont turn on heuristics in the conversion.
kovidgoyal is offline   Reply With Quote
Advert
Old 03-21-2016, 04:46 AM   #3
pokeba
Junior Member
pokeba began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Nov 2015
Location: USA
Device: iPad
Quote:
Originally Posted by kovidgoyal View Post
Dont turn on heuristics in the conversion.
This solved the issue. Thanks very much.
pokeba is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Problems caused by <a> markup used for index markers yucca Sigil 9 10-04-2013 06:34 AM
how to convert a scanned page from a book (looks like photo of page) to clean text? neuvivlio Workshop 34 11-29-2012 09:05 AM
inline page markers overtop of text?? Stinger Kobo Reader 9 05-28-2010 05:42 PM
Book Designer 5.0 - How to force a page break ebookfab Sony Reader 13 12-26-2008 03:48 AM
Book Designer - Page Break Line is not showing? pitolee Sony Reader 6 04-19-2007 09:26 PM


All times are GMT -4. The time now is 07:12 AM.


MobileRead.com is a privately owned, operated and funded community.