|
|
#1 |
|
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Jul 2021
Device: none
|
Trying to convert a book from Amazon. I'm not sure what file format my Kindle 1.117 app downloads.
At first when converting the file to EPUB Calibre gave an error message that reads: The splits were too big for some EPUB readers, so I tried other file types, but the only thing that comes out right is the cover page. Other than that, instead of the text from the book, it's just pages upon pages of random characters, like when you try to open an image file with notepad. Then I tried heuristics enabled but that didn't work, then I set the split size to the size of the whole file and I didn't get an error message but I did get the random characters. Code:
Convert book 1 of 1 (Volume 1)
DeDRM v7.2.1: Trying to decrypt i9t3knr0.mobi
Using Library AlfCrypto DLL/DYLIB/SO
Using Library AlfCrypto DLL/DYLIB/SO
MobiDeDrm v1.0.
Copyright © 2008-2020 The Dark Reverser, Apprentice Harper et al.
Decrypting Mobipocket 4 ebook: Volume 1
Got DSN key from database default_key
Found 4 keys to try after 0.3 seconds
Crypto Type is: 0
This book is not encrypted.
Decryption succeeded after 0.3 seconds
DeDRM v7.2.1: Finished after 0.5 seconds
Conversion options changed from defaults:
verbose: 2
read_metadata_from_opf: 'C:\\...\\calibre_vycg3n56\\m738vugq.opf'
output_profile: 'generic_eink'
cover: 'C:\\...\\calibre_vycg3n56\\wvk9dzg4.jpeg'
Resolved conversion options
calibre version: 5.23.0
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': 'original',
'chapter': "//*[((name()='h1' or name()='h2') and re:test(., "
"'\\s*((chapter|book|section|part)\\s+)|((prolog|prologue|epilogue)(\\s+|$))', "
"'i')) or @class = 'chapter']",
'chapter_mark': 'pagebreak',
'comments': None,
'cover': 'C:\\...\\calibre_vycg3n56\\wvk9dzg4.jpeg',
'debug_pipeline': None,
'dehyphenate': True,
'delete_blank_paragraphs': True,
'disable_font_rescaling': False,
'dont_split_on_page_breaks': False,
'duplicate_links_in_toc': False,
'embed_all_fonts': False,
'embed_font_family': None,
'enable_heuristics': False,
'epub_flatten': False,
'epub_inline_toc': False,
'epub_toc_at_end': False,
'epub_version': '2',
'expand_css': False,
'extra_css': None,
'extract_to': None,
'filter_css': '',
'fix_indents': True,
'flow_size': 260,
'font_size_mapping': None,
'format_scene_breaks': True,
'html_unwrap_factor': 0.4,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x07BFD730>,
'insert_blank_line': False,
'insert_blank_line_size': 0.5,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': True,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_chapter_headings': True,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'no_chapters_in_toc': False,
'no_default_epub_cover': False,
'no_inline_navbars': False,
'no_svg_cover': False,
'output_profile': <calibre.customize.profiles.GenericEink object at 0x07BFD898>,
'page_breaks_before': "//*[name()='h1' or name()='h2']",
'prefer_metadata_cover': False,
'preserve_cover_aspect_ratio': False,
'pretty_print': True,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': 'C:\\...\\calibre_vycg3n56\\m738vugq.opf',
'remove_fake_margins': True,
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': '',
'search_replace': '[]',
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'start_reading_at': None,
'subset_embedded_fonts': False,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'toc_title': None,
'transform_css_rules': '[]',
'unsmarten_punctuation': False,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
DeDRM v7.2.1: Trying to decrypt pe9u4_op.mobi
MobiDeDrm v1.0.
Copyright © 2008-2020 The Dark Reverser, Apprentice Harper et al.
Decrypting Mobipocket 4 ebook: Second Language Acquisition: Volume 1
Got DSN key from database default_key
Found 4 keys to try after 0.2 seconds
Crypto Type is: 0
This book is not encrypted.
Decryption succeeded after 0.2 seconds
DeDRM v7.2.1: Finished after 0.5 seconds
InputFormatPlugin: MOBI Input running
on C:\...\calibre_vycg3n56\7ognbxx0.mobi
Extracting text...
Adding anchors...
Extracting images...
Cleaning up HTML...
Parsing HTML...
Malformed markup, parsing using html5-parser
Converting style information to CSS...
Creating OPF...
Parsing all content...
Parsing index.html ...
Initial parse failed, using more forgiving parsers
Parsing index.html as HTML
HTML 5 parsing failed, falling back to older parsers
Traceback (most recent call last):
File "calibre\ebooks\oeb\parse_utils.py", line 211, in parse_html
File "calibre\utils\xml_parse.py", line 27, in safe_xml_fromstring
File "src/lxml/etree.pyx", line 3237, in lxml.etree.fromstring
File "src/lxml/parser.pxi", line 1896, in lxml.etree._parseMemoryDocument
File "src/lxml/parser.pxi", line 1777, in lxml.etree._parseDoc
File "src/lxml/parser.pxi", line 1082, in lxml.etree._BaseParser._parseUnicodeDoc
File "src/lxml/parser.pxi", line 615, in lxml.etree._ParserContext._handleParseResultDoc
File "src/lxml/parser.pxi", line 725, in lxml.etree._handleParseResult
File "src/lxml/parser.pxi", line 654, in lxml.etree._raiseParseError
File "<string>", line 724
lxml.etree.XMLSyntaxError: Attribute _ redefined, line 724, column 675
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "calibre\ebooks\oeb\parse_utils.py", line 218, in parse_html
File "calibre\utils\xml_parse.py", line 27, in safe_xml_fromstring
File "src/lxml/etree.pyx", line 3237, in lxml.etree.fromstring
File "src/lxml/parser.pxi", line 1896, in lxml.etree._parseMemoryDocument
File "src/lxml/parser.pxi", line 1777, in lxml.etree._parseDoc
File "src/lxml/parser.pxi", line 1082, in lxml.etree._BaseParser._parseUnicodeDoc
File "src/lxml/parser.pxi", line 615, in lxml.etree._ParserContext._handleParseResultDoc
File "src/lxml/parser.pxi", line 725, in lxml.etree._handleParseResult
File "src/lxml/parser.pxi", line 654, in lxml.etree._raiseParseError
File "<string>", line 724
lxml.etree.XMLSyntaxError: Attribute _ redefined, line 724, column 675
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "calibre\ebooks\oeb\parse_utils.py", line 224, in parse_html
File "calibre\ebooks\oeb\parse_utils.py", line 105, in html5_parse
ValueError: HTML 5 parsing resulted in a tree with nesting depth > 100
Forcing index.html into XHTML namespace
Parsing styles.css ...
Generating default TOC from spine...
Merging user specified metadata...
Detecting structure...
Auto generated TOC with 0 entries.
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 1 items of level: p_75
Ignoring level p_75
Cleaning up manifest...
Trimming unused files from manifest...
Trimming 'images/00002.jpg' from manifest
Trimming 'images/00001.jpg' from manifest
Creating EPUB Output...
Rescaling image from 600x857 to 526x751 cover.jpeg
Splitting markup on page breaks and flow limits, if any...
Looking for large trees in index.html...
Found large tree #0
Splitting...
Split point: {http://www.w3.org/1999/xhtml}wc__________________________v______w__gk___________w__w__y________h4__e__i____________n /*/*[2]/*[54]/*/*/*/*/*/*/*[2]/*/*/*/*/*/*[2]/*/*/*/*
Split tree still too large: 8925 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}u____h5__ /*/*[2]/*[24]/*
Split tree still too large: 565 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}m____xj___r________r_______i_________wd____________sh3____oxu____xp__1w____fq________o___x_________ /*/*[2]/*[16]
Committed sub-tree #1 (231 KB)
Split tree still too large: 334 KB
Splitting...
Traceback (most recent call last):
File "runpy.py", line 194, in _run_module_as_main
File "runpy.py", line 87, in _run_code
File "site.py", line 82, in <module>
File "site.py", line 77, in main
File "site.py", line 49, in run_entry_point
File "calibre\utils\ipc\worker.py", line 216, in main
File "calibre\gui2\convert\gui_conversion.py", line 41, in gui_convert_override
File "calibre\gui2\convert\gui_conversion.py", line 28, in gui_convert
File "calibre\ebooks\conversion\plumber.py", line 1271, in run
File "calibre\ebooks\conversion\plugins\epub_output.py", line 207, in convert
File "calibre\ebooks\oeb\transforms\split.py", line 66, in __call__
File "calibre\ebooks\oeb\transforms\split.py", line 75, in split_item
File "calibre\ebooks\oeb\transforms\split.py", line 224, in __init__
File "calibre\ebooks\oeb\transforms\split.py", line 372, in split_to_size
File "calibre\ebooks\oeb\transforms\split.py", line 372, in split_to_size
File "calibre\ebooks\oeb\transforms\split.py", line 372, in split_to_size
File "calibre\ebooks\oeb\transforms\split.py", line 350, in split_to_size
calibre.ebooks.oeb.transforms.split.SplitError: Could not find reasonable point at which to split: index.html Sub-tree size: 334 KB
Last edited by Manhattan; 07-24-2021 at 10:40 AM. |
|
|
|
|
|
#2 |
|
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Jul 2021
Device: none
|
Tried converting to TXT
Here's an excerpt from when I tried converting it to TXT:
Spoiler:
Last edited by theducks; 07-24-2021 at 11:10 AM. Reason: spoilered. |
|
|
|
| Advert | |
|
|
|
|
#3 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,702
Karma: 79983758
Join Date: Nov 2007
Location: Toronto
Device: Libra H2O, Libra Colour
|
Maybe share the book's title?
Sent from my Pixel 4a using Tapatalk |
|
|
|
|
|
#4 | |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,163
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Quote:
You should be able to use the KindleUnpack plugin to extract the PDF from it. |
|
|
|
|
|
|
#5 | |
|
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Jul 2021
Device: none
|
How to use the KindleUnpack Plugin?
I installed the KUP but that didn't change anything. I also tried using the standalone Kindle Unpack on the files in my Calibre Library folder, but I got an error message
Quote:
|
|
|
|
|
| Advert | |
|
|
|
|
#6 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,163
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
|
|
|
|
|
|
#7 |
|
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Jul 2021
Device: none
|
It's a textbook purchased from amazon. When viewed through the kindle app, it navigates a little weirdly. It doesn't flow smoothly like one long document. Instead it lists each page discretely. The inability to scroll and move throughout the book freely and the inability to adjust any of the viewing settings (e.g. text reflow, page and text color, etc.) is why I wanted to change it to another format. But now I have a feeling that's just not possible.
|
|
|
|
|
|
#8 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,163
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Even if you manage to extract the PDF from the Kindle format you will find that conversion from that to a reflowable format is problematic. See the thread Read this before Posting PDF Questions.
|
|
|
|
|
|
#9 |
|
Junior Member
![]() Posts: 8
Karma: 10
Join Date: Jul 2021
Device: none
|
Are Amazon eBooks PDF?
I see!
Thanks. The issue of PDF reflow aside, I would still like to find a way convert the Amazon eBook to something I can use outside the Kindle. Last edited by Manhattan; 07-26-2021 at 12:15 AM. |
|
|
|
|
|
#10 |
|
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,271
Karma: 61916422
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
NO!
Not ALL Amazon books are the same internals. Visit our Wiki here at MR. https://wiki.mobileread.com/wiki/E-book_formats and you will see many were used over time and need. |
|
|
|
|
|
#11 | |
|
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 15,032
Karma: 111111255
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
The default behaviour for PDFs and all ebooks on anything is a page at a time. PDFs are a portable way of proofing or reading material designed for a particular size of paper. Real ebooks (most Amazon and epub) are designed that the virtual "paper page" is exactly the size of the device screen or an application window on a desktop/laptop OS. |
|
|
|
|
|
|
#12 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,732
Karma: 20469902
Join Date: Oct 2014
Location: Lockport, IL
Device: Kindle PW4, Kindle Paperwhite Signature Edition
|
I believe that at least the Kindle app on Android in theory* supports continuous scrolling, though presumably not for fixed layout books.
*I don't know if it works, it's not my cup of tea for books. For websites, it's kind of a necessary evil since they made that design decision some time in the very very early days of web browsers and changing it would potentially break stuff, at the very least the general user interface people expect. |
|
|
|
|
|
#13 | |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,163
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Quote:
The failure of KindleUnpack to extract the PDF suggests that something is wrong with the copy of the book you are working with. I suggest you try downloading and copying it again. Perhaps the original copy was corrupted in some way. If that does not help then sharing the ASIN of the book here would give us the ability to check it out. Without more information it is hard to make other suggestions. |
|
|
|
|
|
|
#14 |
|
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 80,825
Karma: 150249619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
My guess is maybe the DRM was not removed.
|
|
|
|
|
|
#15 |
|
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 48,276
Karma: 174315444
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Are you suspecting that this is one of Amazon's new KFX DRMmed textbooks? I would disagree since the OP's post shows the DeDRM plugin as reporting that the book was not DRMmed.
Edit: I finally located the message which seemed similar to the OPs from a few months back. See After converting a mobi file to pdf, the pdf is gibberish. . I suspect the OP might be better off asking at Apprentice Alf's blog. Last edited by DNSB; 07-26-2021 at 04:00 PM. |
|
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Could not find reasonable point at which to split - revisited | stoneguy | Conversion | 1 | 12-23-2019 04:42 PM |
| Can't split at chosen point | dkplayaclub | Editor | 14 | 05-11-2015 02:18 PM |
| Find, replace, split chapter? | laelaps | Sigil | 8 | 01-20-2011 05:01 PM |
| Error: Could not find reasonable point at which to split | MTR | Calibre | 7 | 10-09-2009 12:03 PM |
| Could not find reasonable point at which to split | scgf | Calibre | 20 | 09-14-2009 02:19 PM |