![]() |
#1 |
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Oct 2010
Device: none
|
can't open e-book with e-book reader and other stuff
i waited for at least 30 minutes before terminating the e-reader. i was trying to open a 41mb .pcr file. i tried to open the same file in book designer. a phrase that contained the words "palm file" flew by before it choked and would not, could not, open the book.
for the past several days i have tried to convert this book to a text file (extracted text, extracted images, extracted text, parsing text, malformed html, parsing with beautiful soup, python stopped unexpectedly, then a retrace route was logged followed by memory error); tried converting also to a .mobi, epub, pdb, etc. drm was not an issue. the following is a result log to give you an idea of what's been happening. any ideas would be appreciated. ERROR: Conversion Error: <b>Failed</b>: Convert book 1 of 1 (My Detective Library - including 300 ''cases'' (works) by 33 greatest authors of mystery and crime fiction) Convert book 1 of 1 (My Detective Library - including 300 ''cases'' (works) by 33 greatest authors of mystery and crime fiction) Resolved conversion options calibre version: 0.7.24 {'asciiize': False, 'author_sort': None, 'authors': None, 'base_font_size': 0.0, 'book_producer': None, 'change_justification': u'original', 'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part|prologue|epilogue\\s+', 'i')) or @class = 'chapter']", 'chapter_mark': u'pagebreak', 'comments': None, 'cover': None, 'debug_pipeline': u'C:/Documents and Settings/KlausDieterDill/My Documents/My eBooks', 'disable_font_rescaling': False, 'extra_css': None, 'font_size_mapping': None, 'footer_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' , 'force_max_line_length': False, 'header_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s* <a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)' , 'html_unwrap_factor': 0.40000000000000002, 'inline_toc': False, 'input_encoding': None, 'input_profile': <calibre.customize.profiles.MobipocketInput object at 0x03BF9A90>, 'insert_blank_line': False, 'insert_metadata': False, 'isbn': None, 'keep_ligatures': False, 'language': None, 'level1_toc': None, 'level2_toc': None, 'level3_toc': None, 'line_height': 0.0, 'linearize_tables': False, 'margin_bottom': 5.0, 'margin_left': 5.0, 'margin_right': 5.0, 'margin_top': 5.0, 'max_line_length': 0, 'max_toc_links': 50, 'newline': u'system', 'no_chapters_in_toc': False, 'no_inline_navbars': False, 'output_encoding': 'utf-8', 'output_profile': <calibre.customize.profiles.JetBook5Output object at 0x03BF9C50>, 'page_breaks_before': u"//*[name()='h1' or name()='h2']", 'prefer_metadata_cover': False, 'preprocess_html': False, 'pretty_print': False, 'pubdate': None, 'publisher': None, 'rating': None, 'read_metadata_from_opf': 'c:\\docume~1\\klausd~1\\locals~1\\temp\\calibre_0 .7.24_tmp_k0jxcl\\calibre_0.7.24_kwis1p.opf', 'remove_first_image': False, 'remove_footer': False, 'remove_header': False, 'remove_paragraph_spacing': False, 'remove_paragraph_spacing_indent_size': 1.5, 'series': None, 'series_index': None, 'smarten_punctuation': False, 'tags': None, 'timestamp': None, 'title': None, 'title_sort': None, 'toc_filter': None, 'toc_threshold': 6, 'use_auto_toc': False, 'verbose': 2} InputFormatPlugin: MOBI Input running on C:\Documents and Settings\KlausDieterDill\Calibre Library\Doyle_ Sir Arthur Conan\My Detective Library - including 300 ''c (2)\My Detective Library - including 300 ''c - Doyle_ Sir Arthur Conan.prc Extracting text... Adding anchors... Extracting images... Cleaning up HTML... Parsing HTML... Malformed markup, parsing using BeautifulSoup MOBI markup appears to contain random bytes. Stripping. Extracting text... Python function terminated unexpectedly (Error Code: 1) Traceback (most recent call last): File "site.py", line 103, in main File "site.py", line 85, in run_entry_point File "site-packages\calibre\utils\ipc\worker.py", line 107, in main File "site-packages\calibre\gui2\convert\gui_conversion.py", line 24, in gui_convert File "site-packages\calibre\ebooks\conversion\plumber.py", line 832, in run File "site-packages\calibre\customize\conversion.py", line 216, in __call__ File "site-packages\calibre\ebooks\mobi\input.py", line 28, in convert File "site-packages\calibre\ebooks\mobi\reader.py", line 297, in extract_content File "site-packages\calibre\ebooks\mobi\reader.py", line 732, in extract_text MemoryError |
![]() |
![]() |
![]() |
#2 |
US Navy, Retired
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,889
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
|
To view in calibre's viewer it first converts it to html which we know you weren't successful in doing before.
I would install Kindle for PC and try to read it with that application or read it with Mobipocket Reader Desktop. |
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Quote:
|
|
![]() |
![]() |
![]() |
#4 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 657
Karma: 64171
Join Date: Sep 2010
Location: Kent, England, Sol 3, ZZ9 plural Z Alpha
Device: Sony PRS-300, Kobo Aura HD, iPad (Marvin)
|
How I'd try and get round problem. (no idea if this will work as I haven't tried it myself)
If you do a convert with Debug option, does the first stage (input) get created. If so you may be able create say 2 copies of the whole directory, suffixed with say p1 & p2 Then go into p1 and try and indentify the main text, edit to remove the second half (edit: of text) at a reasonable point. then zip the directory, import into calibre and convert to preferred format. Then repeat with p2, removing first half of text at same point you used in p1, and again imprt into calibre and convert again. If you leave all images & css etc alone, just removing html'd text then all embedded images etc should still work, any links which jump from first half to second and vice-versa will no longer work, but with a merge of the converted text and editing they could br put right as well. Any ideas if this would work for OP? |
![]() |
![]() |
![]() |
#5 | |
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Oct 2010
Device: none
|
Quote:
|
|
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Oct 2010
Device: none
|
Quote:
i also have noticed that there are 2 calibre-parallel.exe processes running along with calibre.exe one calibre-parallel uses around 30mb memory and the other uses over 200mb and climbs during the conversion process. any ides why? maybe a calibre developer would step in and answer this question; but i would like to hear your take on the issue. cheers. |
|
![]() |
![]() |
![]() |
#7 |
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Oct 2010
Device: none
|
<quote>If you do a convert with Debug option, does the first stage (input) get created.</quote>
yes the debug file gets created, but there is nothing in the file except for the explanations about the different processes(?) and what they do. thanks for your input. |
![]() |
![]() |
![]() |
#8 | ||
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Quote:
RAM usage would of course increase with the duration of the conversion, as the conversion process generates new data while running, otherwise it would be quite useless... Quote:
|
||
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Classic Unable to open book | Larken | Barnes & Noble NOOK | 2 | 05-13-2010 01:08 PM |
jetBook cannot open any book! | rgeorg | Ectaco jetBook | 2 | 05-09-2010 03:05 AM |
Cannot open PDF book in Eco Reader... XPDF? | Seanno | HanLin eBook | 7 | 03-04-2010 03:11 PM |
Open Book Alliance | daesdaemar | News | 0 | 08-21-2009 08:45 AM |
Sometimes I can't open a book at all | ProfJulie | Bookeen | 15 | 03-14-2008 08:00 AM |