I have an ebook in AZW3 format.
When converting in PDF, Calibre adds an uncessary new page inside a chapter title, that increase the number of pages in the book
Convert book 1 of 1 (GMAT Integrated Reasoning and Essay (Manhattan Prep GMAT Strategy Guides))
Conversion options changed from defaults:
debug_pipeline: u'D:\\data\\programmi\\Calibre Portable\\conv_debug'
pdf_default_font_size: 14
markup_chapter_headings: False
paper_size: u'a4'
italicize_common_cases: False
fix_indents: False
pdf_serif_family: u'Constantia'
output_profile: 'generic_eink'
verbose: 2
unwrap_lines: False
read_metadata_from_opf: u'C:\\Users\\raffa\\AppData\\Local\\Temp\\calibre_ c3jbvj\\4j9uwf.opf'
delete_blank_paragraphs: False
dehyphenate: False
cover: u'C:\\Users\\raffa\\AppData\\Local\\Temp\\calibre_ c3jbvj\\p4pa8h.jpeg'
format_scene_breaks: False
enable_heuristics: True
Resolved conversion options
calibre version: 3.16.0
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., '\\s*((chapter|book|section|part)\\s+)|((prolog|pr ologue|epilogue)(\\s+|$))', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': u'C:\\Users\\raffa\\AppData\\Local\\Temp\\calibre_ c3jbvj\\p4pa8h.jpeg',
'custom_size': None,
'debug_pipeline': u'D:\\data\\programmi\\Calibre Portable\\conv_debug',
'dehyphenate': False,
'delete_blank_paragraphs': False,
'disable_font_rescaling': False,
'duplicate_links_in_toc': False,
'embed_all_fonts': False,
'embed_font_family': None,
'enable_heuristics': True,
'expand_css': False,
'extra_css': None,
'filter_css': u'',
'fix_indents': False,
'font_size_mapping': None,
'format_scene_breaks': False,
'html_unwrap_factor': 0.4,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x04CD7C90>,
'insert_blank_line': False,
'insert_blank_line_size': 0.5,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': False,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_chapter_headings': False,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'no_chapters_in_toc': False,
'no_inline_navbars': False,
'output_profile': <calibre.customize.profiles.GenericEink object at 0x04CD7E90>,
'page_breaks_before': u"//*[name()='h1' or name()='h2']",
'paper_size': u'a4',
'pdf_add_toc': False,
'pdf_default_font_size': 14,
'pdf_footer_template': None,
'pdf_header_template': None,
'pdf_mark_links': False,
'pdf_mono_family': u'Liberation Mono',
'pdf_mono_font_size': 16,
'pdf_page_margin_bottom': 72.0,
'pdf_page_margin_left': 72.0,
'pdf_page_margin_right': 72.0,
'pdf_page_margin_top': 72.0,
'pdf_page_numbers': False,
'pdf_sans_family': u'Liberation Sans',
'pdf_serif_family': u'Constantia',
'pdf_standard_font': u'serif',
'prefer_metadata_cover': False,
'preserve_cover_aspect_ratio': False,
'pretty_print': False,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': u'C:\\Users\\raffa\\AppData\\Local\\Temp\\calibre_ c3jbvj\\4j9uwf.opf',
'remove_fake_margins': True,
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': u'',
'search_replace': '[]',
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'start_reading_at': None,
'subset_embedded_fonts': False,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'toc_title': None,
'transform_css_rules': '[]',
'uncompressed_pdf': False,
'unit': u'inch',
'unsmarten_punctuation': False,
'unwrap_lines': False,
'use_auto_toc': False,
'use_profile_size': False,
'verbose': 2}
InputFormatPlugin: MOBI Input running
on C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\2 7mm_p.azw3
Found KF8 MOBI of type 'standalone'
Extracting text...
Input debug saved to: D:\data\programmi\Calibre Portable\conv_debug\input
Parsing all content...
Parsing text/part0029.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 143, calculated with html format
Parsing text/part0030.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0028.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0022.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0023.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 149, calculated with html format
Parsing text/part0005.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0032.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 100, calculated with html format
Parsing text/part0031.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0024.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0000.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0002.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0003.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned True
Median line length is 37, calculated with html format
Parsing text/part0007.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0017.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 95, calculated with html format
Parsing text/part0025.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0011.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 104, calculated with html format
Parsing text/part0013.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0015.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0021.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0019.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing styles/0001.css ...
Parsing text/part0026.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 128, calculated with html format
Parsing text/part0001.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 88, calculated with html format
Parsing text/part0004.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 59, calculated with html format
Parsing text/part0006.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0008.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 164, calculated with html format
Parsing text/part0010.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned True
Median line length is 79, calculated with html format
Parsing text/part0012.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0014.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 87, calculated with html format
Parsing text/part0016.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0018.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0020.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 132, calculated with html format
Parsing text/part0009.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0033.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0035.html ...
********* Heuristic processing HTML *********
There are 0 blank lines. 0.0 percent blank
Hard line breaks check returned False
Median line length is 110, calculated with html format
Parsing text/part0027.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Parsing text/part0034.html ...
********* Heuristic processing HTML *********
flow is too short, not running heuristics
Reading TOC from NCX...
Parsed HTML written to: D:\data\programmi\Calibre Portable\conv_debug\parsed
Merging user specified metadata...
Detecting structure...
Detected chapter: Chapter 1
Detected chapter: Chapter 1
Detected chapter: Chapter 2
Detected chapter: Chapter 2
Detected chapter: Chapter 3
Detected chapter: Chapter 3
Detected chapter: Chapter 4
Detected chapter: Chapter 4
Detected chapter: Chapter 5
Detected chapter: Chapter 5
Detected chapter: Chapter 6
Detected chapter: Chapter 6
Detected chapter: Chapter 7
Detected chapter: Two-Part Analysis
Detected chapter: Chapter 7
Detected chapter: Chapter 8
Detected chapter: Chapter 8
Structured HTML written to: D:\data\programmi\Calibre Portable\conv_debug\structure
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 9 items of level: p_3
Found 1145 items of level: p_2
Found 11 items of level: p_5
Found 36 items of level: div_1
Found 7 items of level: div_2
Ignoring level p_3
Ignoring level p_5
Negative text indent detected at level p_2, ignoring this level
div_1 left margin stats: Counter({u'': 18})
div_1 right margin stats: Counter({u'': 18})
div_2 left margin stats: Counter()
div_2 right margin stats: Counter()
Cleaning up manifest...
Trimming unused files from manifest...
Trimming u'images/00070.jpeg' from manifest
Processed HTML written to: D:\data\programmi\Calibre Portable\conv_debug\processed
Creating PDF Output...
Splitting markup on page breaks and flow limits, if any...
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Splitting on page-break at id=calibre_pb_2
Splitting on page-break at id=calibre_pb_3
Adjusted split point to ancestor
Split into 3 parts
Splitting on page-break at id=calibre_pb_0
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Splitting on page-break at id=calibre_pb_2
Splitting on page-break at id=calibre_pb_3
Split into 5 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_0
Splitting on page-break at id=calibre_pb_1
Split into 2 parts
Splitting on page-break at id=calibre_pb_0
Converting input as a text based book...
Serializing oeb input to disk for processing...
Rendering pages to PDF...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0000_split_000.html...
Loading font: family_name=Constantia full_name=Constantia Bold postscript_name=Constantia-Bold subfamily_name=Bold
Loading font: family_name=Constantia full_name=Constantia postscript_name=Constantia subfamily_name=Regular
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0000_split_001.html...
Loading font: family_name=Constantia full_name=Constantia Bold Italic postscript_name=Constantia-BoldItalic subfamily_name=Bold Italic
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0001.html...
Loading font: family_name=Constantia full_name=Constantia Italic postscript_name=Constantia-Italic subfamily_name=Italic
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0002.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0003.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0004_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0004_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0004_split_002.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0004_split_003.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0004_split_004.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0005.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0006_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0006_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0007.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0008.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0009_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0009_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0010.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0011.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0012_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0012_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0013.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0014.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0015_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0015_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0016.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0017.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0018_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0018_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0019.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0020.html...
Could not find page 88 for link destination, using page 87 instead
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0021_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0021_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0022.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0023.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0024_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0024_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0024_split_002.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0025.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0026.html...
Loading font: family_name=Tahoma full_name=Tahoma postscript_name=Tahoma subfamily_name=Regular
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0027_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0027_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0028.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0029.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0030_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0030_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0031.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0032.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0033_split_000.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0033_split_001.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0034.html...
Processing C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s hbcvw_pdf_out\text\part0035.html...
Rendered PDF in 3.052 seconds:
PDF output written to C:\Users\raffa\AppData\Local\Temp\calibre_c3jbvj\s wpzhz.pdf
Here is a screenshot of the end result
Spoiler:
As you can see, "Chapter 1 of" and "Integrated Reasoning - How to use this guide" are in two separate pages. This is bad.
Instead, using the Prince PDF plugin, the result is correct, as you can see from this screenshot:
Spoiler:
Quote:
Checking "Renumber sequences of <h1> and <h2> tags to prevent splitting" in "Heuristic Processing" tab of the conversion dialog changes nothing.
Can you help me?
I would like to use the stock PDF conversion option because Prince does not scale the images well.