|04-30-2012, 04:35 AM||#1|
Join Date: Mar 2012
html to epub CLI conversion / html input
I have searched a lot but no luck, possibly my queries are not relevant, so here is my question.
I use calibre CLI in order to convert an html file to epub. The options I used are derived from those found in logs when using calibre GUI, so I expect to have the same result.
But I am missing something, the result is my epub has sometimes unresolved links to footnotes and/or missing page breaks. I tried to fine tune options, but that did not change anything.
My html source is a filtered html document from Word.
If I try to directly convert it to epub with ebook-convert and "my" options, I have the problem previously described.
If I first import the html file in calibre, get the resulting zip file then ebook-convert it with the exact same options as above, this time I have the expected result, exactly the same that if I did the conversion with calibre GUI.
So my conclusion is I am missing something with the html input. When importing in calibre there is some processing on it, but I don't know how to replicate it with CLI since I cannot see any log related to the import.
I tried to ebook-convert from html to zip first, but the resulting zip is completely different than if imported into calibre.
Can someone provide any tip/information/link to appropriate documentation section/existing forum thread ?
Last edited by m4mmon; 04-30-2012 at 05:27 AM. Reason: better problem description
|04-30-2012, 05:36 AM||#2|
Join Date: Mar 2012
I have found the missing step. Before converting the html to epub, I need to perform an html to OEB conversion first... So this is a 2-step conversion as I had suspected:
ebook-convert.exe book.htm oeb ebook-convert.exe oeb\book.htm book.epub %opts%
ebook-convert.exe book.htm book.epub %opts%
Maybe someone will point a mistake or something, but since I have exactly the same result as when using calibre GUI to perform my conversion, I think my problem is solved.
Last edited by m4mmon; 05-05-2012 at 02:08 AM.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|HTML input plugin stripping text within toc tags in child html file||nimblebooks||Conversion||3||02-21-2012 03:24 PM|
|Converting Epub to HTML from CLI removes formatting||drjonez||Conversion||2||01-20-2012 12:07 PM|
|Problem with html -> Mobi conversion - html tags visible.||khromov||Calibre||9||08-06-2011 11:25 AM|
|html to epub - input issue||jwalk||Conversion||4||06-07-2011 03:10 PM|
|Conversion Help Please - HTML to ePub||PocketGoddess||Calibre||1||11-22-2010 02:01 PM|