View Single Post
Old 03-29-2011, 01:06 AM   #1
Bikeridr
Information Acquirer
Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.Bikeridr ought to be getting tired of karma fortunes by now.
 
Bikeridr's Avatar
 
Posts: 436
Karma: 4265156
Join Date: Sep 2010
Location: Latvia, Rigas Rajons
Device: Kindle 3 International, Pocketbook Color
Another HTML conversion error

Hello,

I've read through a few (HTML) conversion threads, but can't find why I get all the errors.

I'm trying to convert a web document fetched here to .mobi. My FireFox stores the html as a .htm with a folder containing all "extra" files.
I've imported the .htm and manually copied the "additional files" folder to where Calibre stores the imported .htm.
I've tried both with and without heuristics, and below is the error message without the heuristics:

calibre, version 0.7.52
ERROR: Feil ved konverteringen: <b>Feilet</b>: Convert book 1 of 1 (Speaker Wire - A History)

Convert book 1 of 1 (Speaker Wire - A History)
Resolved conversion options
calibre version: 0.7.52
{'asciiize': True,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'breadth_first': False,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part\\s+', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': None,
'debug_pipeline': None,
'dehyphenate': True,
'delete_blank_paragraphs': True,
'disable_font_rescaling': True,
'dont_compress': False,
'dont_package': False,
'enable_heuristics': False,
'extra_css': None,
'fix_indents': True,
'font_size_mapping': None,
'format_scene_breaks': True,
'html_unwrap_factor': 0.4,
'input_encoding': u'cp1252',
'input_profile': <calibre.customize.profiles.InputProfile object at 0x04FF6FB0>,
'insert_blank_line': False,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': True,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_chapter_headings': True,
'max_levels': 5,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'mobi_ignore_margins': False,
'no_chapters_in_toc': False,
'no_inline_navbars': True,
'no_inline_toc': False,
'output_profile': <calibre.customize.profiles.KindleOutput object at 0x050272F0>,
'page_breaks_before': u"//*[name()='h1' or name()='h2']",
'personal_doc': u'[PDOC]',
'prefer_author_sort': False,
'prefer_metadata_cover': False,
'pretty_print': False,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': 'c:\\users\\esben\\appdata\\local\\temp\\calibre_0 .7.52_tmp_xae9af\\calibre_0.7.52_irqbce.opf',
'remove_fake_margins': True,
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': u'',
'rescale_images': False,
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'toc_title': None,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: HTML Input running
on D:\Dokumenter\Calibre_Bibliotek\Russell_ Roger\Speaker Wire - A History (40)\Speaker Wire - A History - Russell_ Roger.htm
Language not specified
Building file list...
Found files...
HTMLFile:0:a:\Dokumenter\Calibre_Bibliotek\Russe ll_ Roger\Speaker Wire - A History (40)\Speaker Wire - A History - Russell_ Roger.htm
Normalizing filename cases
Rewriting HTML links
Parsing Speaker%20Wire%20-%20A%20History%20-%20Russell_%20Roger.htm ...
Initial parse failed:
Traceback (most recent call last):
File "site-packages\calibre\ebooks\oeb\base.py", line 886, in first_pass
File "lxml.etree.pyx", line 2532, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48634)
File "parser.pxi", line 1545, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:72245)
File "parser.pxi", line 1417, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:71041)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67581)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:64257)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:65178)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64521)
XMLSyntaxError: Opening and ending tag mismatch: meta line 57 and head, line 58, column 8

Parsing file 'Speaker%20Wire%20-%20A%20History%20-%20Russell_%20Roger.htm' as HTML
Forcing Speaker%20Wire%20-%20A%20History%20-%20Russell_%20Roger.htm into XHTML namespace
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\image19.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\colorbar.gif
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\wirebusters3.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\impedance.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\response6.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\hearing3.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\wire8.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\wire9.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\wire4.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\wire5.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\monsterb.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\monsterc.jpg
Added d:\dokumenter\calibre_bibliotek\russell_ roger\speaker wire - a history (40)\filer_for_wire\thin%20wire.jpg
Python function terminated unexpectedly
[Errno 2] No such file or directory: u'd:\\dokumenter\\calibre_bibliotek\\russell_ roger\\speaker wire - a history (40)\\filer_for_wire\\thin wire.jpg' (Error Code: 1)
Traceback (most recent call last):
File "site.py", line 103, in main
File "site.py", line 85, in run_entry_point
File "site-packages\calibre\utils\ipc\worker.py", line 110, in main
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 31, in gui_convert_override
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 25, in gui_convert
File "site-packages\calibre\ebooks\conversion\plumber.py", line 915, in run
File "site-packages\calibre\customize\conversion.py", line 204, in __call__
File "site-packages\calibre\ebooks\html\input.py", line 294, in convert
File "site-packages\calibre\ebooks\html\input.py", line 373, in create_oebbook
File "site-packages\calibre\ebooks\oeb\base.py", line 185, in rewrite_links
File "site-packages\calibre\ebooks\html\input.py", line 468, in resource_adder
File "site-packages\calibre\ebooks\oeb\base.py", line 1148, in fget
File "site-packages\calibre\ebooks\oeb\base.py", line 472, in read
IOError: [Errno 2] No such file or directory: u'd:\\dokumenter\\calibre_bibliotek\\russell_ roger\\speaker wire - a history (40)\\filer_for_wire\\thin wire.jpg'

The converted .pdf (from file -> print menu in browser) does however convert to .mobi, but the result is not 100%
Bikeridr is offline   Reply With Quote