Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 07-03-2010, 12:28 AM   #1
prky
Member
prky began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Nov 2009
Device: IPhone 3GS
HTML Conversion - Multiline Headers

Hi all,

I've noticed some HTML -> EPUB conversion funkiness with Calibre since I upgraded from a 0.6.54 version to 0.7.5

I had a book which was converted in 0.6.54 which worked fine, but when I went to reconvert it in 0.7.5 (after changing the CSS for emphasis handling) I found the entire book was converted in large text.

Looking at the HTML, each chapter had the following type of chapter header:

Code:
<h2>1
Chapter 1 Title</h2>
It appears that in 0.6.54, this was parsed as:

Code:
<h2>1</h2>
<h2>Chapter 1 Title</h2>
ie, just use header 2 for those two lines, however in 0.7.5 it was parsed as:

Code:
<h2>1
Chapter 1 Title
ie an unterminated header 2, so the entire chapter was done in header 2!

I edited the HTML to manually make it look like

Code:
<h2>1</h2>
<h2>Chapter 1 Title</h2>
Which fixed it.

Question - is this a HTML parsing bug in 0.7.x, or was it never meant to work the way it did in 0.6.x?

Is there a way I can make it parse HTML tags across multiple lines?

Ta,

prk.
prky is offline   Reply With Quote
Old 07-03-2010, 09:24 AM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,850
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
There was a bug in the html input plugin in 0.7.5, upgrade to 0.7.7 and you should be fine
kovidgoyal is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PDF to ePub conversion issue - headers getting left in deadSkip Calibre 7 07-09-2010 02:07 AM
Multiline Regex? prky Calibre 25 05-01-2010 09:56 PM
conversion TO html in_the_fade Calibre 4 04-29-2010 10:51 AM
Evading headers in PDF->EPUB conversion davef Calibre 6 08-29-2009 03:26 PM
Calibre not removing headers in mobi conversion kim- Calibre 2 08-04-2009 11:37 AM


All times are GMT -4. The time now is 05:54 AM.


MobileRead.com is a privately owned, operated and funded community.