View Full Version : Some problems with conversions


babychaos
02-06-2011, 12:50 PM
Am a little new to converting epub files to mobi. Most of my 60 or so books have worked fine, but there are a couple of them failing for some reason. The error message i get below. Can anyone make sense of it?


calibre, version 0.7.43
ERROR: Conversion Error: <b>Failed</b>: Convert book 1 of 1 (Discworld 01 - The Colour of Magic)

Convert book 1 of 1 (Discworld 01 - The Colour of Magic)
Resolved conversion options
calibre version: 0.7.43
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part|prologue|epilogue\\s+', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': 'c:\\users\\babych~1\\appdata\\local\\temp\\calibr e_0.7.43_tmp_7te48d\\calibre_0.7.43_jls6zg.jpeg',
'debug_pipeline': None,
'dehyphenate': True,
'delete_blank_paragraphs': True,
'disable_font_rescaling': False,
'dont_compress': True,
'enable_heuristics': False,
'extra_css': None,
'fix_indents': True,
'font_size_mapping': None,
'format_scene_breaks': True,
'html_unwrap_factor': 0.4,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.KindleInput object at 0x057608D0>,
'insert_blank_line': False,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': True,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_chapter_headings': True,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'mobi_ignore_margins': True,
'no_chapters_in_toc': False,
'no_inline_navbars': True,
'no_inline_toc': False,
'output_profile': <calibre.customize.profiles.KindleOutput object at 0x05760B30>,
'page_breaks_before': u'/',
'personal_doc': u'[PDOC]',
'prefer_author_sort': False,
'prefer_metadata_cover': False,
'pretty_print': False,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': 'c:\\users\\babych~1\\appdata\\local\\temp\\calibr e_0.7.43_tmp_7te48d\\calibre_0.7.43_wehsxd.opf',
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'rescale_images': False,
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'toc_title': None,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: EPUB Input running
on C:\Users\Babychaos\Calibre Library\Terry Pratchett\Discworld 01 - The Colour of Magic (43)\Discworld 01 - The Colour of Magic - Terry Pratchett.epub
Found HTML cover cover.html
Parsing all content...
Parsing text/part9.xhtml ...
Parsing text/part2.xhtml ...
Parsing text/part4.xhtml ...
Parsing text/part5.xhtml ...
Parsing text/part6.xhtml ...
Parsing text/part7.xhtml ...
Parsing text/part8.xhtml ...
Parsing text/part3.xhtml ...
Parsing text/part10.xhtml ...
Parsing text/part1.xhtml ...
Initial parse failed:
Traceback (most recent call last):
File "site-packages\calibre\ebooks\oeb\base.py", line 863, in first_pass
File "lxml.etree.pyx", line 2532, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48634)
File "parser.pxi", line 1545, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:72245)
File "parser.pxi", line 1417, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:71041)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67581)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:64257)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:65178)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64521)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'text/part1.xhtml' as HTML
Failed to parse content in text/part1.xhtml
Traceback (most recent call last):
File "site-packages\calibre\ebooks\oeb\reader.py", line 164, in _manifest_prune_invalid
File "site-packages\calibre\ebooks\oeb\base.py", line 1109, in fget
File "site-packages\calibre\ebooks\oeb\base.py", line 887, in _parse_xhtml
File "site-packages\calibre\ebooks\oeb\base.py", line 876, in first_pass
File "site-packages\lxml\html\__init__.py", line 603, in fromstring
File "site-packages\lxml\html\__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Parsing text/part11.xhtml ...
Parsing text/part12.xhtml ...
Parsing text/part13.xhtml ...
Parsing text/part14.xhtml ...
Parsing text/part15.xhtml ...
Parsing text/part16.xhtml ...
Parsing cover.html ...
Referenced file 'text/part1.xhtml' not in manifest
Parsing text/part1.xhtml ...
Initial parse failed:
Traceback (most recent call last):
File "site-packages\calibre\ebooks\oeb\base.py", line 863, in first_pass
File "lxml.etree.pyx", line 2532, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48634)
File "parser.pxi", line 1545, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:72245)
File "parser.pxi", line 1417, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:71041)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67581)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:64257)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:65178)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64521)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'text/part1.xhtml' as HTML
Python function terminated unexpectedly
Document is empty (Error Code: 1)
Traceback (most recent call last):
File "site.py", line 103, in main
File "site.py", line 85, in run_entry_point
File "site-packages\calibre\utils\ipc\worker.py", line 110, in main
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 31, in gui_convert_override
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 25, in gui_convert
File "site-packages\calibre\ebooks\conversion\plumber.py", line 908, in run
File "site-packages\calibre\ebooks\conversion\plumber.py", line 1037, in create_oebbook
File "site-packages\calibre\ebooks\oeb\reader.py", line 72, in __call__
File "site-packages\calibre\ebooks\oeb\reader.py", line 598, in _all_from_opf
File "site-packages\calibre\ebooks\oeb\reader.py", line 248, in _manifest_from_opf
File "site-packages\calibre\ebooks\oeb\reader.py", line 181, in _manifest_add_missing
File "site-packages\calibre\ebooks\oeb\base.py", line 1109, in fget
File "site-packages\calibre\ebooks\oeb\base.py", line 887, in _parse_xhtml
File "site-packages\calibre\ebooks\oeb\base.py", line 876, in first_pass
File "site-packages\lxml\html\__init__.py", line 603, in fromstring
File "site-packages\lxml\html\__init__.py", line 514, in document_fromstring
lxml.etree.ParserError: Document is empty

theducks
02-06-2011, 01:09 PM
Parsing text/part1.xhtml ...
Initial parse failed:

XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

If I am getting this figured out :D
You have a malformed part1 file.
Use Tweak (tap T with the book selected and expand. Open part1 with notepad..) Line 2 :chinscratch:

BTW, it would not hurt to validate EPUB's with Flightcrew :bulb2:

kovidgoyal
02-06-2011, 01:09 PM
It's impossible to help without access to the books. Open a ticket and attach the files that are failing to convert.