View Single Post
Old 08-06-2010, 08:37 PM   #1
mburgoa
Junior Member
mburgoa began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Aug 2010
Device: Kindle 2
Initial parse failed:

I'm trying to fetch news from a newspaper, but I seems not to be working out for me. The first "error message" that I see in the log is "Initial parse failed:". Anyone who could help me here?

Here's the log:

ERROR: Conversion Error: <b>Failed</b>: Fetch news from Los Tiempos

Fetch news from Los Tiempos
Resolved conversion options
calibre version: 0.7.12
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0,
'book_producer': None,
'change_justification': 'original',
'chapter': None,
'chapter_mark': 'pagebreak',
'comments': None,
'cover': None,
'debug_pipeline': None,
'disable_font_rescaling': False,
'dont_compress': False,
'dont_download_recipe': False,
'extra_css': None,
'font_size_mapping': None,
'footer_regex': '(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' ,
'header_regex': '(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' ,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x911b9ec>,
'insert_blank_line': False,
'insert_metadata': False,
'isbn': None,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0,
'linearize_tables': False,
'lrf': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'max_toc_links': 50,
'no_chapters_in_toc': False,
'no_inline_navbars': True,
'no_inline_toc': False,
'output_profile': <calibre.customize.profiles.KindleOutput object at 0x91232ac>,
'page_breaks_before': None,
'password': None,
'personal_doc': '[PDOC]',
'prefer_author_sort': False,
'prefer_metadata_cover': False,
'preprocess_html': False,
'pretty_print': False,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': None,
'remove_first_image': False,
'remove_footer': False,
'remove_header': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'rescale_images': False,
'series': None,
'series_index': None,
'tags': None,
'test': False,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'toc_title': None,
'use_auto_toc': False,
'username': None,
'verbose': 2}
InputFormatPlugin: Recipe Input running
Synthesizing mastheadImage
Downloading
Fetching http://www.lostiempos.com/imprimir_a...fecha=20100806
Downloading
Fetching http://www.lostiempos.com/imprimir_a...fecha=20100806
Downloading
Fetching http://www.lostiempos.com/imprimir_a...fecha=20100806
Downloading
Fetching http://www.lostiempos.com/imprimir_a...fecha=20100806
Downloading
Fetching http://www.lostiempos.com/imprimir_a...fecha=20100806
Processing images...
Processing images...
Processing images...
Processing images...
Processing links...
Processing links...
Processing images...
Processing links...
Processing links...
Processing links...
http://www.lostiempos.com/imprimir_a...fecha=20100806 saved to /tmp/calibre_0.7.12_J5Qwo9_plumber/feed_0/article_4/imprimir_art.xhtml
Downloading
http://www.lostiempos.com/imprimir_a...fecha=20100806 saved to /tmp/calibre_0.7.12_J5Qwo9_plumber/feed_0/article_1/imprimir_art.xhtml
Downloading
http://www.lostiempos.com/imprimir_a...fecha=20100806 saved to /tmp/calibre_0.7.12_J5Qwo9_plumber/feed_0/article_2/imprimir_art.xhtml
http://www.lostiempos.com/imprimir_a...fecha=20100806 saved to /tmp/calibre_0.7.12_J5Qwo9_plumber/feed_0/article_0/imprimir_art.xhtml
http://www.lostiempos.com/imprimir_a...fecha=20100806 saved to /tmp/calibre_0.7.12_J5Qwo9_plumber/feed_0/article_3/imprimir_art.xhtml
Fetching http://www.lostiempos.com/imprimir_a...fecha=20100806
Fetching http://www.lostiempos.com/imprimir_a...fecha=20100806
Downloading
Fetching http://www.lostiempos.com/imprimir_a...fecha=20100806
Downloaded article: Presidente Morales promulga la Ley Siderúrgica del Mutún from http://www.lostiempos.com/diario/act...47_159879.html
Downloaded article: Secuestran a alcalde de Uyuni que retornaba de Sucre tras el intento de diálogo con el Gobierno from http://www.lostiempos.com/diario/act...71_159959.html
Downloaded article: Potosí conmemora el Día de la Patria con una masiva marcha de reivindicación y rebeldía from http://www.lostiempos.com/diario/act...68_159952.html
Downloaded article: Evo pide conciliación en la sesión congresal en Santa Cruz from http://www.lostiempos.com/diario/act...56_159922.html
Downloaded article: Histórico: Bolivia celebró su 185 aniversario en Santa Cruz from http://www.lostiempos.com/diario/act...45_159867.html
Processing images...
Processing links...
http://www.lostiempos.com/imprimir_a...fecha=20100806 saved to /tmp/calibre_0.7.12_J5Qwo9_plumber/feed_0/article_5/imprimir_art.xhtml
Processing images...
Processing images...
Processing links...
Processing links...
http://www.lostiempos.com/imprimir_a...fecha=20100806 saved to /tmp/calibre_0.7.12_J5Qwo9_plumber/feed_0/article_7/imprimir_art.xhtml
http://www.lostiempos.com/imprimir_a...fecha=20100806 saved to /tmp/calibre_0.7.12_J5Qwo9_plumber/feed_0/article_6/imprimir_art.xhtml
Downloaded article: Presentan en Cochabamba la Chicha y el Ajayu de coca from http://www.lostiempos.com/diario/act...61_159931.html
Downloaded article: Piloto del Enola Gay no se arrepiente de haber lanzado la bomba de Hiroshima from http://www.lostiempos.com/diario/act...67_159950.html
Downloaded article: Ecuador y Chile reconocen vigencia de tratados marítimos from http://www.lostiempos.com/diario/act...60_159929.html
Parsing all content...
Parsing index.html ...
Forcing index.html into XHTML namespace
Parsing feed_0/article_6/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_6/index.html' as HTML
Failed to parse content in feed_0/article_6/index.html
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 159, in _manifest_prune_invalid
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Parsing feed_0/article_1/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_1/index.html' as HTML
Failed to parse content in feed_0/article_1/index.html
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 159, in _manifest_prune_invalid
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Parsing feed_0/article_0/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_0/index.html' as HTML
Failed to parse content in feed_0/article_0/index.html
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 159, in _manifest_prune_invalid
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Parsing feed_0/article_2/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_2/index.html' as HTML
Failed to parse content in feed_0/article_2/index.html
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 159, in _manifest_prune_invalid
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Parsing feed_0/article_5/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_5/index.html' as HTML
Failed to parse content in feed_0/article_5/index.html
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 159, in _manifest_prune_invalid
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Parsing feed_0/article_4/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_4/index.html' as HTML
Failed to parse content in feed_0/article_4/index.html
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 159, in _manifest_prune_invalid
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Parsing feed_0/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Opening and ending tag mismatch: br line 32 and div, line 33, column 7

Parsing file 'feed_0/index.html' as HTML
Forcing feed_0/index.html into XHTML namespace
Parsing feed_0/article_7/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_7/index.html' as HTML
Failed to parse content in feed_0/article_7/index.html
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 159, in _manifest_prune_invalid
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Parsing feed_0/article_3/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_3/index.html' as HTML
Failed to parse content in feed_0/article_3/index.html
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 159, in _manifest_prune_invalid
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
ParserError: Document is empty

Referenced file 'feed_0/article_7/index.html' not in manifest
Referenced file 'feed_0/article_6/index.html' not in manifest
Referenced file 'feed_0/article_1/index.html' not in manifest
Referenced file 'feed_0/article_3/index.html' not in manifest
Referenced file 'feed_0/article_2/index.html' not in manifest
Referenced file 'feed_0/article_5/index.html' not in manifest
Referenced file 'feed_0/article_4/index.html' not in manifest
Referenced file 'feed_0/article_0/index.html' not in manifest
Parsing feed_0/article_0/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 816, in first_pass
File "lxml.etree.pyx", line 2538, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48266)
File "parser.pxi", line 1536, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71653)
File "parser.pxi", line 1408, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70449)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67144)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63820)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64741)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64084)
XMLSyntaxError: Start tag expected, '<' not found, line 2, column 1

Parsing file 'feed_0/article_0/index.html' as HTML
Traceback (most recent call last):
File "/tmp/init.py", line 48, in <module>
File "/home/kovid/build/calibre/src/calibre/utils/ipc/worker.py", line 99, in main
File "/home/kovid/build/calibre/src/calibre/gui2/convert/gui_conversion.py", line 24, in gui_convert
File "/home/kovid/build/calibre/src/calibre/ebooks/conversion/plumber.py", line 824, in run
File "/home/kovid/build/calibre/src/calibre/ebooks/conversion/plumber.py", line 951, in create_oebbook
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 72, in __call__
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 593, in _all_from_opf
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 243, in _manifest_from_opf
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/reader.py", line 176, in _manifest_add_missing
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 1060, in fget
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 840, in _parse_xhtml
File "/home/kovid/build/calibre/src/calibre/ebooks/oeb/base.py", line 829, in first_pass
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 603, in fromstring
File "/usr/lib/python2.6/site-packages/lxml/html/__init__.py", line 514, in document_fromstring
lxml.etree.ParserError: Document is empty
mburgoa is offline   Reply With Quote