|
|
#1 |
|
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 793
Karma: 1538394
Join Date: Sep 2013
Device: Kobo Sage
|
Calibre 6.11: Conversion from AZW3 to EPUB3 Removing Used Images
The conversion process in Calibre 6.11 seems to be removing an image that is actually in use in a book. This is the third book in a row, now that it's happened. I wasn't sure about the previous two times, but I paid more attention this time. This particular book is Alexander McCall Smith's "Morality for Beautiful Girls" (the third in his No. 1 Ladies' Detective Agency series):
https://www.amazon.com/gp/product/B0...w_myk_ro_title The previous two times were for the first and second books in the series. I downloaded a fresh copy of the AZW3 from my library at Amazon, Added it to my working Calibre library and converted it individually to an EPUB3. Immediately afterward, I edited the result and noted broken image links (specifically, those used in the Chapter titles). Comparing the list of images in the AZW3 to those present in the EPUB3 I get: 00013.jpeg (in use in chapter titles, but missing from EPUB) 00014.jpeg (in use in title page, present in EPUB) 00015.jpeg (in use as the cover page, renamed to cover.jpeg in EPUB) 00016.jpeg (in use in About the Author file, present in EPUB) 00019.jpeg (in use in end of book ads, present in EPUB) 00020.jpeg (this and all the rest not used and correctly removed) 00021.jpeg 00022.jpeg 00023.jpeg 00025.jpeg So, it's just the image used in the chapter titles that incorrectly removed (ditto for the previous two books). The lines in the source chapters referencing that image look fine to me: Code:
<p class="cn"><span id="c02"></span><b>CHAPTER TWO</b><br/><img src="../images/00013.jpeg" alt="image"/></p> The Job Details for the conversion follows. But, it looks to me like it's just correctly handling the cover and trimming the unused images. I don't see any reference to 00013.jpeg in there (but, I might have missed it): Code:
Convert book 1 of 1 (Morality for Beautiful Girls)
DeDRM v10.0.3: Trying to decrypt rjbj_mx1.azw3
Using Library AlfCrypto DLL/DYLIB/SO
MobiDeDrm v1.1.
Copyright © 2008-2020 The Dark Reverser, Apprentice Harper et al.
Decrypting Kindle Format 8 ebook: Morality for Beautiful Girls (No 1. Ladies' Detective Agency Book 3)
Found 6 keys to try after 0.0 seconds
Crypto Type is: 0
This book is not encrypted.
Decryption succeeded after 0.0 seconds
DeDRM v10.0.3: Finished after 0.0 seconds
Conversion options changed from defaults:
replace_scene_breaks: '<hr />'
margin_bottom: 0.0
enable_heuristics: True
margin_top: 0.0
markup_chapter_headings: False
verbose: 2
cover: 'C:\\Users\\Dave\\AppData\\Local\\Temp\\calibre_0q52gfog\\ugbrj2s9.jpeg'
output_profile: 'tablet'
smarten_punctuation: True
remove_first_image: True
expand_css: True
minimum_line_height: 0.0
insert_metadata: True
filter_css: 'line-height'
read_metadata_from_opf: 'C:\\Users\\Dave\\AppData\\Local\\Temp\\calibre_0q52gfog\\7mq1buiz.opf'
disable_font_rescaling: True
epub_version: '3'
margin_right: 0.0
preserve_cover_aspect_ratio: True
margin_left: 0.0
Resolved conversion options
calibre version: 6.11.0
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': 'original',
'chapter': "//*[((name()='h1' or name()='h2') and re:test(., "
"'\\s*((chapter|book|section|part)\\s+)|((prolog|prologue|epilogue)(\\s+|$))', "
"'i')) or @class = 'chapter']",
'chapter_mark': 'pagebreak',
'comments': None,
'cover': 'C:\\Users\\Dave\\AppData\\Local\\Temp\\calibre_0q52gfog\\ugbrj2s9.jpeg',
'debug_pipeline': None,
'dehyphenate': True,
'delete_blank_paragraphs': True,
'disable_font_rescaling': True,
'dont_split_on_page_breaks': False,
'duplicate_links_in_toc': False,
'embed_all_fonts': False,
'embed_font_family': None,
'enable_heuristics': True,
'epub_flatten': False,
'epub_inline_toc': False,
'epub_toc_at_end': False,
'epub_version': '3',
'expand_css': True,
'extra_css': None,
'extract_to': None,
'filter_css': 'line-height',
'fix_indents': True,
'flow_size': 260,
'font_size_mapping': None,
'format_scene_breaks': True,
'html_unwrap_factor': 0.4,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x000001212ADDDED0>,
'insert_blank_line': False,
'insert_blank_line_size': 0.5,
'insert_metadata': True,
'isbn': None,
'italicize_common_cases': True,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 0.0,
'margin_left': 0.0,
'margin_right': 0.0,
'margin_top': 0.0,
'markup_chapter_headings': False,
'max_toc_links': 50,
'minimum_line_height': 0.0,
'no_chapters_in_toc': False,
'no_default_epub_cover': False,
'no_inline_navbars': False,
'no_svg_cover': False,
'output_profile': <calibre.customize.profiles.TabletOutput object at 0x000001212ADDC9D0>,
'page_breaks_before': "//*[name()='h1' or name()='h2']",
'prefer_metadata_cover': False,
'preserve_cover_aspect_ratio': True,
'pretty_print': True,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': 'C:\\Users\\Dave\\AppData\\Local\\Temp\\calibre_0q52gfog\\7mq1buiz.opf',
'remove_fake_margins': True,
'remove_first_image': True,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': '<hr />',
'search_replace': '[]',
'series': None,
'series_index': None,
'smarten_punctuation': True,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'start_reading_at': None,
'subset_embedded_fonts': False,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'toc_title': None,
'transform_css_rules': '[]',
'transform_html_rules': '[]',
'unsmarten_punctuation': False,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
DeDRM v10.0.3: Trying to decrypt jc2h9k1y.azw3
MobiDeDrm v1.1.
Copyright © 2008-2020 The Dark Reverser, Apprentice Harper et al.
Decrypting Kindle Format 8 ebook: Morality for Beautiful Girls (No 1. Ladies' Detective Agency Book 3)
Found 6 keys to try after 0.0 seconds
Crypto Type is: 0
This book is not encrypted.
Decryption succeeded after 0.0 seconds
DeDRM v10.0.3: Finished after 0.0 seconds
InputFormatPlugin: MOBI Input running
on C:\Users\Dave\AppData\Local\Temp\calibre_0q52gfog\dmzu44me.azw3
Found KF8 MOBI of type 'standalone'
Extracting text...
Parsing all content...
Parsing text/part0000.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0018.html ...
********* Heuristic processing HTML *********
There are 4 blank lines. 0.042105263157894736 percent blank
Hard line breaks check returned False
Median line length is 100, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0025.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing styles/0002.css ...
Parsing text/part0013.html ...
********* Heuristic processing HTML *********
There are 2 blank lines. 0.022988505747126436 percent blank
Hard line breaks check returned False
Median line length is 141, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0022.html ...
********* Heuristic processing HTML *********
There are 1 blank lines. 0.0196078431372549 percent blank
Hard line breaks check returned False
Median line length is 100, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0005.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 94, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0009.html ...
********* Heuristic processing HTML *********
There are 1 blank lines. 0.025 percent blank
Hard line breaks check returned False
Median line length is 123, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0015.html ...
********* Heuristic processing HTML *********
There are 2 blank lines. 0.025974025974025976 percent blank
Hard line breaks check returned False
Median line length is 161, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0019.html ...
********* Heuristic processing HTML *********
There are 3 blank lines. 0.03614457831325301 percent blank
Hard line breaks check returned False
Median line length is 114, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0027.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0014.html ...
********* Heuristic processing HTML *********
There are 3 blank lines. 0.046153846153846156 percent blank
Hard line breaks check returned False
Median line length is 107, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing styles/0005.css ...
Parsing text/part0001.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned True
Median line length is 48, calculated with html format
Unwrapping required, unwrapping Lines
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0002.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned True
Median line length is 91, calculated with html format
Unwrapping required, unwrapping Lines
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0010.html ...
********* Heuristic processing HTML *********
There are 4 blank lines. 0.07142857142857142 percent blank
Hard line breaks check returned False
Median line length is 190, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0016.html ...
********* Heuristic processing HTML *********
There are 1 blank lines. 0.015384615384615385 percent blank
Hard line breaks check returned False
Median line length is 116, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0003.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0007.html ...
********* Heuristic processing HTML *********
There are 1 blank lines. 0.020833333333333332 percent blank
Hard line breaks check returned False
Median line length is 100, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0021.html ...
********* Heuristic processing HTML *********
There are 3 blank lines. 0.045454545454545456 percent blank
Hard line breaks check returned False
Median line length is 124, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing styles/0003.css ...
Parsing styles/0004.css ...
Parsing text/part0006.html ...
********* Heuristic processing HTML *********
There are 1 blank lines. 0.012195121951219513 percent blank
Hard line breaks check returned False
Median line length is 137, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0008.html ...
********* Heuristic processing HTML *********
There are 3 blank lines. 0.023076923076923078 percent blank
Hard line breaks check returned False
Median line length is 159, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0011.html ...
********* Heuristic processing HTML *********
There are 2 blank lines. 0.021052631578947368 percent blank
Hard line breaks check returned False
Median line length is 126, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0024.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned True
Median line length is 43, calculated with html format
Unwrapping required, unwrapping Lines
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0017.html ...
********* Heuristic processing HTML *********
There are 2 blank lines. 0.028985507246376812 percent blank
Hard line breaks check returned False
Median line length is 108, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing styles/0006.css ...
Parsing text/part0023.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0028.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0026.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 164, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing styles/0001.css ...
Parsing text/part0004.html ...
********* Heuristic processing HTML *********
There are 2 blank lines. 0.024096385542168676 percent blank
Hard line breaks check returned False
Median line length is 155, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0012.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 118, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Parsing text/part0020.html ...
********* Heuristic processing HTML *********
There are 1 blank lines. 0.017241379310344827 percent blank
Hard line breaks check returned False
Median line length is 104, calculated with html format
Fixing hyphenated content
Formatting scene breaks
Reading TOC from NCX...
Removed first image
Merging user specified metadata...
Detecting structure...
Inserting metadata into book...
Flattening CSS and remapping font sizes...
Filtering CSS properties: line-height
Source base font size is 11.99880pt
Removing fake margins...
Found 30 items of level: div_1
Found 39 items of level: div_2
Found 2 items of level: div_3
Found 4 items of level: p_4
Found 1496 items of level: p_2
Found 45 items of level: p_5
Found 2 items of level: p_1
Ignoring level p_1
Ignoring level p_4
Ignoring level div_3
div_1 left margin stats: Counter({'': 25, '0': 1})
div_1 right margin stats: Counter({'': 25, '0': 1})
div_2 left margin stats: Counter({'': 1})
div_2 right margin stats: Counter({'': 1})
Negative text indent detected at level p_2, ignoring this level
p_5 left margin stats: Counter({'0': 45})
p_5 right margin stats: Counter({'0': 45})
Cleaning up manifest...
Trimming unused files from manifest...
Trimming 'images/00022.jpeg' from manifest
Trimming 'images/00020.jpeg' from manifest
Trimming 'images/00025.jpeg' from manifest
Trimming 'images/00021.jpeg' from manifest
Trimming 'images/00023.jpeg' from manifest
Creating EPUB Output...
Splitting markup on page breaks and flow limits, if any...
Looking for large trees in text/part0000.html...
No large trees found
Looking for large trees in text/part0018.html...
No large trees found
Looking for large trees in text/part0025.html...
No large trees found
Looking for large trees in text/part0013.html...
No large trees found
Looking for large trees in text/part0022.html...
No large trees found
Looking for large trees in text/part0005.html...
No large trees found
Looking for large trees in text/part0009.html...
No large trees found
Looking for large trees in text/part0015.html...
No large trees found
Looking for large trees in text/part0019.html...
No large trees found
Looking for large trees in text/part0027.html...
No large trees found
Looking for large trees in jacket.xhtml...
No large trees found
Looking for large trees in text/part0014.html...
No large trees found
Looking for large trees in text/part0001.html...
No large trees found
Looking for large trees in text/part0002.html...
No large trees found
Looking for large trees in text/part0010.html...
No large trees found
Looking for large trees in text/part0016.html...
No large trees found
Looking for large trees in text/part0003.html...
No large trees found
Looking for large trees in text/part0007.html...
No large trees found
Looking for large trees in text/part0021.html...
No large trees found
Looking for large trees in text/part0006.html...
No large trees found
Looking for large trees in text/part0008.html...
No large trees found
Looking for large trees in text/part0011.html...
No large trees found
Splitting on page-break at id=calibre_pb_0
Adjusted split point to ancestor
Splitting on page-break at id=calibre_pb_1
Splitting on page-break at id=calibre_pb_2
Splitting on page-break at id=calibre_pb_3
Splitting on page-break at id=calibre_pb_4
Splitting on page-break at id=calibre_pb_5
Splitting on page-break at id=calibre_pb_6
Splitting on page-break at id=calibre_pb_7
Splitting on page-break at id=calibre_pb_8
Looking for large trees in text/part0024.html...
No large trees found
Split into 9 parts
Looking for large trees in text/part0017.html...
No large trees found
Looking for large trees in text/part0023.html...
No large trees found
Splitting on page-break at id=calibre_pb_0
Looking for large trees in text/part0028.html...
No large trees found
Split into 2 parts
Looking for large trees in text/part0026.html...
No large trees found
Looking for large trees in text/part0004.html...
No large trees found
Looking for large trees in text/part0012.html...
No large trees found
Looking for large trees in text/part0020.html...
No large trees found
Encrypting font: fonts/00001.otf
Encrypting font: fonts/00002.otf
Encrypting font: fonts/00003.otf
Encrypting font: fonts/00004.otf
Encrypting font: fonts/00005.otf
Encrypting font: fonts/00006.otf
Encrypting font: fonts/00007.otf
Encrypting font: fonts/00008.otf
Encrypting font: fonts/00009.otf
Encrypting font: fonts/00010.otf
Encrypting font: fonts/00011.otf
Encrypting font: fonts/00012.otf
Encrypting font: fonts/00017.otf
Encrypting font: fonts/00018.otf
Upgrading to EPUB 3...
EPUB output written to C:\Users\Dave\AppData\Local\Temp\calibre_0q52gfog\i1aiot12.epub
CPU: AMD Ryzen 5 5600X (Vermeer, VMR-B0) 3700 MHz (37.00x100.0) @ 3593 MHz (36.00x99.8) Motherboard: GIGABYTE B550 VISION D-P BIOS: F15d, 07/20/2022 Chipset: AMD B550 (Promontory PROM19 C) Memory: 16384 MBytes @ 1597 MHz, 14-14-14-34 - 8192 MB PC25600 DDR4 SDRAM - G.Skill F4-3200C14-8GFX - 8192 MB PC25600 DDR4 SDRAM - G.Skill F4-3200C14-8GFX Graphics: AMD Radeon RX 6800 XT AMD Radeon RX 6800 XT, 16384 MB GDDR6 SDRAM Drive: KINGSTON SKC3000D2048G, 2000.4 GB, NVMe Sound: ATI/AMD Navi2x - High Definition Audio Controller Sound: AMD Family 17h/19h - HD Audio Controller Network: RealTek Semiconductor RTL8125 Gaming 2.5GbE Family Ethernet Controller Network: Intel Wi-Fi 6 AX200 160MHz OS: Microsoft Windows 10 Professional (x64) Build 19045.2364 Last edited by enuddleyarbl; 01-08-2023 at 09:05 PM. |
|
|
|
|
|
#2 |
|
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 48,676
Karma: 174510110
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Could you try using the KindleUnpack plugin to convert the azw3 to epub and see if the images survive?
|
|
|
|
| Advert | |
|
|
|
|
#3 | |
|
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 793
Karma: 1538394
Join Date: Sep 2013
Device: Kobo Sage
|
Quote:
Surprisingly, I ran the AZW3 through ScrambleEbook and the resulting AZW3 also loses the corresponding image in the Calibre conversion process. And, like the real book, that images comes across just fine in KindleUpack. https://drive.google.com/file/d/1s9f...ew?usp=sharing |
|
|
|
|
|
|
#4 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,664
Karma: 28549046
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Open a bug report and attach the file there. Mark it private.
|
|
|
|
|
|
#5 |
|
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,171
Karma: 92500001
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Your log shows that you have the option "Remove first image" set under the Structure Detection tab. That appears to be the source of the problem.
Unrelated to this, you also have Heuristic Processing enabled which can cause a variety of problems if used indiscriminately. |
|
|
|
| Advert | |
|
|
|
|
#6 | |
|
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 793
Karma: 1538394
Join Date: Sep 2013
Device: Kobo Sage
|
Quote:
I'm going to consider this to be "works as designed" and not bother opening any kind of bug report. I'll have to consider whether I want to turn that option off or keep it on and just keep an eye out for unexpectedly disappearing images. Thanks. BTW: what is considered to be the "First Image?" The first image that shows up in the book (usually the cover) or the first image that's listed in the Images section? |
|
|
|
|
|
|
#7 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,664
Karma: 28549046
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
the first image that shows up in the book.
|
|
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Conversion epub to AZW3 removes images | FelixKrull | Conversion | 1 | 10-17-2022 06:38 AM |
| epub3 to azw3 conversion | lumpynose | Conversion | 4 | 05-18-2019 10:55 PM |
| AZW3 to EPUB conversion: images cropped | dr_Fell | Conversion | 9 | 07-01-2018 06:17 PM |
| Calibre Epub to AZW3: Scale images to fit Voyage/PW? | cw30000 | Conversion | 3 | 07-08-2017 01:10 AM |
| AZW3 Images dissapear when the book is added to the Calibre Library | alansplace | Calibre | 3 | 11-08-2015 06:48 PM |