View Single Post
Old 06-07-2010, 06:57 AM   #1
sglinert
Enthusiast
sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.sglinert ought to be getting tired of karma fortunes by now.
 
Posts: 49
Karma: 1024532
Join Date: May 2008
Device: iPad Air, iPod Touch 5, Illudium Q-42 Explosive Space Modulator
Can't convert RTF files

I have about 4000 rtf files that were saved from Word 2003. Calibre won't convert them. Here's the error:

__________________

ERROR: Conversion Error: <b>Failed</b>: Convert book 1 of 1 (Acceptance)

Convert book 1 of 1 (Acceptance)
Resolved conversion options
calibre version: 0.7.1
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part\\s+', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': None,
'debug_pipeline': None,
'disable_font_rescaling': False,
'dont_split_on_page_breaks': False,
'extra_css': None,
'extract_to': None,
'flow_size': 1640,
'font_size_mapping': None,
'footer_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' ,
'header_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' ,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x039EAA90>,
'insert_blank_line': False,
'insert_metadata': False,
'isbn': None,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'max_toc_links': 50,
'no_chapters_in_toc': True,
'no_default_epub_cover': False,
'no_inline_navbars': False,
'no_svg_cover': False,
'output_profile': <calibre.customize.profiles.OutputProfile object at 0x039EAC70>,
'page_breaks_before': u'/',
'prefer_metadata_cover': False,
'preprocess_html': False,
'preserve_cover_aspect_ratio': False,
'pretty_print': True,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': 'c:\\docume~1\\susans~1\\locals~1\\temp\\calibre_0 .7.1_iqiitk.opf',
'remove_first_image': False,
'remove_footer': False,
'remove_header': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'series': None,
'series_index': None,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: RTF Input running
on E:\E-Books\EBook Library\Harry Potter Fanfiction\Acceptance (4306)\Acceptance - Harry Potter Fanfiction.rtf
Converting RTF to XML...
Preprocessing to convert unicode characters
line: 1 has bad encoding
File "preprocessed.rtf" does not appear to be RTF.
Python function terminated unexpectedly
'ascii' codec can't decode byte 0xbe in position 53: ordinal not in range(128)Invalid RTF: File not ascii encoded.
(Error Code: 1)
Traceback (most recent call last):
File "site.py", line 103, in main
File "site.py", line 85, in run_entry_point
File "site-packages\calibre\utils\ipc\worker.py", line 99, in main
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 24, in gui_convert
File "site-packages\calibre\ebooks\conversion\plumber.py", line 808, in run
File "site-packages\calibre\customize\conversion.py", line 211, in __call__
File "site-packages\calibre\ebooks\rtf\input.py", line 197, in convert
File "site-packages\calibre\ebooks\rtf\input.py", line 88, in generate_xml
File "site-packages\calibre\ebooks\rtf2xml\ParseRtf.py", line 238, in parse_rtf
calibre.ebooks.rtf2xml.ParseRtf.InvalidRtfExceptio n: 'ascii' codec can't decode byte 0xbe in position 53: ordinal not in range(128)Invalid RTF: File not ascii encoded.

________________

There's nothing odd about them, as far as I can see. Line 1 of the file in question seems to be a carriage return.

Saving the file as HTML and converting worked fine. But, needless to say, I am not up to converting 4000 files to html, one by one. Happy to send files if that would help figure out the problem.
sglinert is offline   Reply With Quote