07-07-2009, 06:44 PM | #1 |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
ebooks.adelaide Mobi Conversion Failures
The University of Adelaide offers free ebooks, formatted mostly in separate html chapters. http://ebooks.adelaide.edu.au/
When attempting to use Calibre to convert these html segments to Mobibook, I get the following error details: (To get around this difficulty I have followed the fairly convoluted alternative method of downloading the same titles from Gutenberg in txt format, converting to html using GUItenMark, then having Calibre convert that html file to Mobibook, which works fine.) Has anyone an explanation why one html conversion works and the other doesn't? Convert book 1 of 1 (u'Decline and Fall of the Roman Empire (Chapter1)') InputFormatPlugin: HTML Input running on C:\Documents and Settings\Compaq_Owner\My Documents\My Books\eBook Library\Edward Gibbon\Decline and Fall of the Roman Empire (Ch (143)\Decline and Fall of the Roman Empire (Ch - Edward Gibbon.html Language not specified Creator not specified Building file list... IgnoreFile(u"Could not read from file: C:\\Documents and Settings\\Compaq_Owner\\My Documents\\My Books\\eBook Library\\Edward Gibbon\\Decline and Fall of the Roman Empire (Ch (143)\\index.html with error: (2, 'No such file or directory')",) IgnoreFile(u"Could not read from file: C:\\Documents and Settings\\Compaq_Owner\\My Documents\\My Books\\eBook Library\\Edward Gibbon\\Decline and Fall of the Roman Empire (Ch (143)\\chapter2.html with error: (2, 'No such file or directory')",) Found files... HTMLFile:0:a:C:\Documents and Settings\Compaq_Owner\My Documents\My Books\eBook Library\Edward Gibbon\Decline and Fall of the Roman Empire (Ch (143)\Decline and Fall of the Roman Empire (Ch - Edward Gibbon.html Parsing Decline%20and%20Fall%20of%20the%20Roman%20Empire%2 0%28Ch%20-%20Edward%20Gibbon.html ... Stripping comments and meta tags from Decline%20and%20Fall%20of%20the%20Roman%20Empire%2 0%28Ch%20-%20Edward%20Gibbon.html Traceback (most recent call last): File "worker.py", line 103, in <module> File "worker.py", line 90, in main File "calibre\gui2\convert\gui_conversion.pyo", line 17, in gui_convert File "calibre\ebooks\conversion\plumber.pyo", line 599, in run File "calibre\customize\conversion.pyo", line 213, in __call__ File "calibre\ebooks\html\input.pyo", line 284, in convert File "calibre\ebooks\html\input.pyo", line 356, in create_oebbook File "calibre\ebooks\oeb\base.pyo", line 947, in fget File "calibre\ebooks\oeb\base.pyo", line 812, in _parse_xhtml File "lxml.etree.pyx", line 2440, in lxml.etree.fromstring (src/lxml/lxml.etree.c:23985) File "parser.pxi", line 1510, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:63925) File "parser.pxi", line 1382, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:62795) File "parser.pxi", line 891, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:59726) File "parser.pxi", line 542, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:56659) File "parser.pxi", line 628, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:57504) File "parser.pxi", line 568, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:56902) lxml.etree.XMLSyntaxError: Attribute xm |
07-07-2009, 07:12 PM | #2 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
As the error message indicates the adelaide HTMl contains some invalid syntax that calibre's parser cannot compensate for. The output of GutenMark doesn't. If you post the html that causes the problem, I'lls ee if I can cook up a workaround
|
07-07-2009, 07:27 PM | #3 | |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
Quote:
Anyway, my failed experience with the Adelaide titles seems to ALWAYS be the case, so I suggest that you just download from their site any title you want and see what the problem is and hopefully rectify it. Again, many thanks, and regards, Jim |
|
07-10-2009, 12:12 PM | #4 | |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
Quote:
|
|
07-10-2009, 12:37 PM | #5 | |
Wizard
Posts: 1,101
Karma: 4388403
Join Date: Oct 2007
Device: Palm>Ebookman>IPaq>Axim>Cybook>Kndl2>IPAD>Kndl3SO>Voyager>Oasis
|
Quote:
If I may offer a hint for you. You have asked Kovidgoyal to download from their site any book and see what happens. Unfortunately, we all have him running around like crazy trying to add special requests and fixes for each of us - all out of his own time. From your description, I can see that you know how much of a hassle it is to go through 'a few extra steps' of a work around of downloading from Gutenberg and editing the book to what you want. Likewise, asking him to find adelaide, figure out their search system and download a book in order to do you a favor raises the bar on his task. Imagine the horror if he downloads the one book that doesn't have your problem [grin]. And all of this for free since Calibre is free software. Now, I'm probably guilty of mind-reading and putting words in his mouth. However, you could probably get better response if you would do the work for him to find a book and post it directly here for him. Then he could concentrate in the programming that he does so well instead of the fetch and carry work! |
|
07-10-2009, 01:02 PM | #6 | |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
Quote:
http://ebooks.adelaide.edu.au/j/joyce/james/j8d/ Please be assured that I am sensitive to the burden many of us are placing on Kovidgoyal, and I don't wish to be an unreasonable further burden. I just didn't know exactly how to provide him with the information he initially requested of me -- and I still don't. But I am trying. Thanks for your understandable concern. Regards, Jim |
|
07-10-2009, 04:06 PM | #7 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
That file converts with my current install of calibre, so it should work for you with calibre 0.6b12 and higher
|
07-10-2009, 05:26 PM | #8 |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
|
07-11-2009, 02:09 AM | #9 | |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
Quote:
Convert book 1 of 1 (u'Decline and Fall of the Roman Empire (Chapter1)') InputFormatPlugin: HTML Input running on C:\Documents and Settings\Compaq_Owner\My Documents\My Books\eBook Library\Edward Gibbon\Decline and Fall of the Roman Empire (Ch (147)\Decline and Fall of the Roman Empire (Ch - Edward Gibbon.html Language not specified Creator not specified Building file list... IgnoreFile(u"Could not read from file: C:\\Documents and Settings\\Compaq_Owner\\My Documents\\My Books\\eBook Library\\Edward Gibbon\\Decline and Fall of the Roman Empire (Ch (147)\\index.html with error: (2, 'No such file or directory')",) IgnoreFile(u"Could not read from file: C:\\Documents and Settings\\Compaq_Owner\\My Documents\\My Books\\eBook Library\\Edward Gibbon\\Decline and Fall of the Roman Empire (Ch (147)\\chapter2.html with error: (2, 'No such file or directory')",) Found files... HTMLFile:0:a:C:\Documents and Settings\Compaq_Owner\My Documents\My Books\eBook Library\Edward Gibbon\Decline and Fall of the Roman Empire (Ch (147)\Decline and Fall of the Roman Empire (Ch - Edward Gibbon.html Parsing Decline%20and%20Fall%20of%20the%20Roman%20Empire%2 0%28Ch%20-%20Edward%20Gibbon.html ... Stripping comments and meta tags from Decline%20and%20Fall%20of%20the%20Roman%20Empire%2 0%28Ch%20-%20Edward%20Gibbon.html Traceback (most recent call last): File "worker.py", line 103, in <module> File "worker.py", line 90, in main File "calibre\gui2\convert\gui_conversion.pyo", line 17, in gui_convert File "calibre\ebooks\conversion\plumber.pyo", line 638, in run File "calibre\customize\conversion.pyo", line 213, in __call__ File "calibre\ebooks\html\input.pyo", line 284, in convert File "calibre\ebooks\html\input.pyo", line 358, in create_oebbook File "calibre\ebooks\oeb\base.pyo", line 935, in fget File "calibre\ebooks\oeb\base.pyo", line 814, in _parse_xhtml File "lxml.etree.pyx", line 2440, in lxml.etree.fromstring (src/lxml/lxml.etree.c:23985) File "parser.pxi", line 1510, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:63925) File "parser.pxi", line 1382, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:62795) File "parser.pxi", line 891, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:59726) File "parser.pxi", line 542, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:56659) File "parser.pxi", line 628, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:57504) File "parser.pxi", line 568, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:56902) lxml.etree.XMLSyntaxError: Attribute xmlns redefined, line 1, column 23 |
|
07-11-2009, 01:15 PM | #10 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
zip up your downloaded HTML files into a single zip file and attach them here or to a ticket on the calibre website and I'll take a look. What program are you using to download them?
|
07-11-2009, 03:12 PM | #11 | |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
Quote:
Kovid: IGNORE THIS MESSAGE. I NEED TO RESOLVE SOMETHING. Sorry, Jim Last edited by ascherjim; 07-11-2009 at 03:19 PM. Reason: Initially sent wrong zip file |
|
07-11-2009, 03:16 PM | #12 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
In the rror message you posted, the error occurs in parsing the file
Decline and Fall of the Roman Empire 28Ch - Edward Gibbon.html Just zip up and attach that file |
07-11-2009, 03:33 PM | #13 | |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
Quote:
g43d.zip Chapter One from this is the html file that I've failed at converting to Mobipocket with Calibre. |
|
07-11-2009, 04:25 PM | #14 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Will be fixed in next beta
|
07-11-2009, 04:28 PM | #15 |
Addict
Posts: 260
Karma: 274
Join Date: Apr 2006
Location: Gig Harbor, Washington
Device: BeBook One, PocketBook 360, Kindle Paperwhite, Kobo Aura One
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PDF to Mobi Conversion | rayh | Calibre | 2 | 09-24-2010 02:33 AM |
eBooks@Adelaide ePub problems? | dhume01 | ePub | 22 | 09-02-2010 06:37 AM |
Epub to Mobi conversion | MichaelGray | Calibre | 2 | 08-12-2010 01:08 PM |
Mobigen Mass Batch conversion of HTML-Single-File ebooks to .mobi ebooks | cklammer | Kindle Formats | 9 | 11-20-2009 03:00 AM |
what the ’ ??? (mobi conversion woes) | zelda_pinwheel | Workshop | 14 | 04-02-2008 02:27 AM |