View Single Post
Old 12-21-2010, 08:48 PM   #1
kdd6
Junior Member
kdd6 began at the beginning.
 
kdd6's Avatar
 
Posts: 8
Karma: 10
Join Date: Oct 2010
Device: none
can't open e-book with e-book reader and other stuff

i waited for at least 30 minutes before terminating the e-reader. i was trying to open a 41mb .pcr file. i tried to open the same file in book designer. a phrase that contained the words "palm file" flew by before it choked and would not, could not, open the book.

for the past several days i have tried to convert this book to a text file (extracted text, extracted images, extracted text, parsing text, malformed html, parsing with beautiful soup, python stopped unexpectedly, then a retrace route was logged followed by memory error); tried converting also to a .mobi, epub, pdb, etc. drm was not an issue.

the following is a result log to give you an idea of what's been happening. any ideas would be appreciated.

ERROR: Conversion Error: <b>Failed</b>: Convert book 1 of 1 (My Detective Library - including 300 ''cases'' (works) by 33 greatest authors of mystery and crime fiction)

Convert book 1 of 1 (My Detective Library - including 300 ''cases'' (works) by 33 greatest authors of mystery and crime fiction)
Resolved conversion options
calibre version: 0.7.24
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part|prologue|epilogue\\s+', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': None,
'debug_pipeline': u'C:/Documents and Settings/KlausDieterDill/My Documents/My eBooks',
'disable_font_rescaling': False,
'extra_css': None,
'font_size_mapping': None,
'footer_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' ,
'force_max_line_length': False,
'header_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' ,
'html_unwrap_factor': 0.40000000000000002,
'inline_toc': False,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.MobipocketInput object at 0x03BF9A90>,
'insert_blank_line': False,
'insert_metadata': False,
'isbn': None,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'max_line_length': 0,
'max_toc_links': 50,
'newline': u'system',
'no_chapters_in_toc': False,
'no_inline_navbars': False,
'output_encoding': 'utf-8',
'output_profile': <calibre.customize.profiles.JetBook5Output object at 0x03BF9C50>,
'page_breaks_before': u"//*[name()='h1' or name()='h2']",
'prefer_metadata_cover': False,
'preprocess_html': False,
'pretty_print': False,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': 'c:\\docume~1\\klausd~1\\locals~1\\temp\\calibre_0 .7.24_tmp_k0jxcl\\calibre_0.7.24_kwis1p.opf',
'remove_first_image': False,
'remove_footer': False,
'remove_header': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'series': None,
'series_index': None,
'smarten_punctuation': False,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: MOBI Input running
on C:\Documents and Settings\KlausDieterDill\Calibre Library\Doyle_ Sir Arthur Conan\My Detective Library - including 300 ''c (2)\My Detective Library - including 300 ''c - Doyle_ Sir Arthur Conan.prc
Extracting text...
Adding anchors...
Extracting images...
Cleaning up HTML...
Parsing HTML...
Malformed markup, parsing using BeautifulSoup
MOBI markup appears to contain random bytes. Stripping.
Extracting text...
Python function terminated unexpectedly
(Error Code: 1)
Traceback (most recent call last):
File "site.py", line 103, in main
File "site.py", line 85, in run_entry_point
File "site-packages\calibre\utils\ipc\worker.py", line 107, in main
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 24, in gui_convert
File "site-packages\calibre\ebooks\conversion\plumber.py", line 832, in run
File "site-packages\calibre\customize\conversion.py", line 216, in __call__
File "site-packages\calibre\ebooks\mobi\input.py", line 28, in convert
File "site-packages\calibre\ebooks\mobi\reader.py", line 297, in extract_content
File "site-packages\calibre\ebooks\mobi\reader.py", line 732, in extract_text
MemoryError
kdd6 is offline   Reply With Quote