| 
			
			 | 
		#1 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 2 
				Karma: 10 
				Join Date: Mar 2011 
				
				
				
				Device: Nook 
				
				
				 | 
	
	
	
		
		
			
			 
				
				HTML to EPUB conversion crashes
			 
			
			
			I'm running Calibre 0.7.38 on Fedora 14, and whenever I try and convert a html book to epub it crashes. I've captured the crash output here: 
		
	
		
		
		
		
		
		
		
		
		
		
	
	ERROR: Conversion Error: <b>Failed</b>: Convert book 1 of 1 (The drowned world) Convert book 1 of 1 (The drowned world) Processing archive... Resolved conversion options calibre version: 0.7.38 {'asciiize': True, 'author_sort': None, 'authors': None, 'base_font_size': 0.0, 'book_producer': None, 'breadth_first': False, 'change_justification': u'original', 'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part|prologue|epilogue\\s+', 'i')) or @class = 'chapter']", 'chapter_mark': u'pagebreak', 'comments': None, 'cover': None, 'debug_pipeline': None, 'disable_font_rescaling': False, 'dont_package': False, 'dont_split_on_page_breaks': False, 'epub_flatten': False, 'extra_css': None, 'extract_to': None, 'flow_size': 260, 'font_size_mapping': None, 'footer_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' , 'header_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' , 'html_unwrap_factor': 0.4, 'input_encoding': '', 'input_profile': <calibre.customize.profiles.InputProfile object at 0x378d6d0>, 'insert_blank_line': False, 'insert_metadata': False, 'isbn': None, 'keep_ligatures': False, 'language': None, 'level1_toc': None, 'level2_toc': None, 'level3_toc': None, 'line_height': 0.0, 'linearize_tables': False, 'margin_bottom': 5.0, 'margin_left': 5.0, 'margin_right': 5.0, 'margin_top': 5.0, 'max_levels': 5, 'max_toc_links': 50, 'minimum_line_height': 120.0, 'no_chapters_in_toc': False, 'no_default_epub_cover': False, 'no_inline_navbars': False, 'no_svg_cover': False, 'output_profile': <calibre.customize.profiles.NookOutput object at 0x378ddd0>, 'page_breaks_before': u"//*[name()='h1' or name()='h2']", 'prefer_metadata_cover': False, 'preprocess_html': False, 'preserve_cover_aspect_ratio': False, 'pretty_print': True, 'pubdate': None, 'publisher': None, 'rating': None, 'read_metadata_from_opf': '/tmp/calibre_0.7.38_tmp_ArlGun/calibre_0.7.38_5evi0W.opf', 'remove_first_image': False, 'remove_footer': False, 'remove_header': False, 'remove_paragraph_spacing': False, 'remove_paragraph_spacing_indent_size': 1.5, 'series': None, 'series_index': None, 'smarten_punctuation': False, 'tags': None, 'timestamp': None, 'title': None, 'title_sort': None, 'toc_filter': None, 'toc_threshold': 6, 'use_auto_toc': False, 'verbose': 2} InputFormatPlugin: HTML Input running on /tmp/calibre_0.7.38_tmp_ArlGun/calibre_0.7.38_hdYewr_plumber/content.opf Parsing all content... Traceback (most recent call last): File "/usr/bin/calibre-parallel", line 19, in <module> sys.exit(main()) File "/usr/lib64/calibre/calibre/utils/ipc/worker.py", line 106, in main result = func(*args, **kwargs) File "/usr/lib64/calibre/calibre/gui2/convert/gui_conversion.py", line 24, in gui_convert plumber.run() File "/usr/lib64/calibre/calibre/ebooks/conversion/plumber.py", line 853, in run accelerators, tdir) File "/usr/lib64/calibre/calibre/customize/conversion.py", line 216, in __call__ log, accelerators) File "/usr/lib64/calibre/calibre/ebooks/html/input.py", line 299, in convert encoding=opts.input_encoding) File "/usr/lib64/calibre/calibre/ebooks/conversion/plumber.py", line 990, in create_oebbook reader()(oeb, path_or_stream) File "/usr/lib64/calibre/calibre/ebooks/oeb/reader.py", line 71, in __call__ opf = self._read_opf() File "/usr/lib64/calibre/calibre/ebooks/oeb/reader.py", line 104, in _read_opf data = self.oeb.decode(data) File "/usr/lib64/calibre/calibre/ebooks/oeb/base.py", line 1897, in decode return fix_data(data.decode(self.input_encoding, 'replace')) LookupError: unknown encoding: I'm not sure what it means by unknown encoding. It successfully imports the HTML to ZIP, and only crashes when I convert to EPUB. I've been saving them as text and then adding them back into my library and converting that way, but it's not ideal. Anyone have any ideas, or know of a bulk HTML to txt/PDF (I know, gasp!) converter that would make this easier on me? Thanks!  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#2 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			You  have an invalid encoding specified in conversion->look & Feel
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| Advert | |
| 
         | 
    
| 
			
			 | 
		#3 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 2 
				Karma: 10 
				Join Date: Mar 2011 
				
				
				
				Device: Nook 
				
				
				 | 
	
	|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
![]()  | 
            
        
    
            
  | 
    
			 
			Similar Threads
		 | 
	||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Classic Nook crashes with epub from html with arrows. | Archon | Barnes & Noble NOOK | 4 | 02-02-2011 09:33 AM | 
| Conversion Help Please - HTML to ePub | PocketGoddess | Calibre | 1 | 11-22-2010 03:01 PM | 
| Help with HTML to ePub conversion...? | Nethfel | Calibre | 4 | 05-10-2010 03:26 PM | 
| ePub conversion crashes reader | shmish | Calibre | 1 | 03-07-2010 01:16 PM | 
| Calibre 0.51 Crashes on EPUB conversion | fogus | Calibre | 2 | 03-14-2009 10:47 PM |