|  11-08-2021, 06:00 AM | #1 | 
| Member  Posts: 11 Karma: 10 Join Date: Oct 2021 Device: none | 
				
				PDF removes chapter breaks
			 
			
			Hello !  I have an issue with my chapter's page breaks. I use this command line for epub conversion : PHP Code: 
			PHP Code: 
			Using conversion through Calibre software works, but I need it to work through command line. I am searching through options guidelines without success  Edit : I am using Calibre 5.30 Last edited by ydonse; 11-08-2021 at 06:02 AM. | 
|   |   | 
|  11-08-2021, 07:15 AM | #2 | 
| creator of calibre            Posts: 45,595 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			The command line and the GUI use exactly the same conversion code if the versions of calibre are the same. just set the same options and you will get the same result. You can check the options the GUI uses in the job log.
		 | 
|   |   | 
|  11-08-2021, 08:46 AM | #3 | 
| Member  Posts: 11 Karma: 10 Join Date: Oct 2021 Device: none | 
			
			Thank you for your response Kovid, and thank you for your great work ! In my command line response I have these logs : " 1 => 'Conversion options changed from defaults:', 2 => ' margin_top: 99.0', 3 => ' cover: mypath.png', 4 => ' verbose: 1', 5 => ' margin_bottom: 85.0', 6 => ' preserve_cover_aspect_ratio: True'," And in the soft I have these : "Conversion options changed from defaults: pdf_serif_family: 'Liberation Serif' cover: '/tmp/calibre_5.31.1_tmp_xxrib_kp/lfbxxtgd.jpeg' pdf_mono_family: 'Liberation Mono' output_profile: 'generic_eink' read_metadata_from_opf: '/tmp/calibre_5.31.1_tmp_xxrib_kp/qy1i8fel.opf' pdf_sans_family: 'Liberation Sans' verbose: 2 debug_pipeline: '/home/librinova/Documents/debug'" While they are not exactly the same I don't see any options that could lead to a difference in the chapter page breaks. Moreover, in both logs Chapter is set on pagebreak. I'll continue searching, thanks again for your time. | 
|   |   | 
|  11-08-2021, 11:47 AM | #4 | 
| Still reading            Posts: 14,896 Karma: 110507267 Join Date: Jun 2017 Location: Ireland Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper | 
			
			Other alternatives: "Print" on the Calibre Viewer works well to create a PDF file. It only creates a PDF and can add page numbers. Best option is format and page size, header, footer, page numbers, contents with page numbers etc in LO Writer and Export a PDF. Better than most versions of Word for PDF creation and free. Gives publisher quality PDF assuming the page format, headers, footers, styles, image resolutions etc are OK. There is even an option on export to resample all images to 300 dpi (or other desired resolution). Since you HAVE a docx, I'd only use Calibre to make an epub, and then convert epub to mobi, azw3, kfx, kepub, lrf, pdb (both sorts), not a PDF. | 
|   |   | 
|  09-20-2022, 11:43 AM | #5 | 
| Member  Posts: 11 Karma: 10 Join Date: Oct 2021 Device: none | 
			
			Hello again !  I still have the issue but now directly in the GUI. When I convert a docx to epub it keeps the page breaks. When I convert a docx to pdf, the page breaks are gone. However, when I convert the epub to pdf it works. I don't understand why. edit : i'm on version 5.44 Last edited by ydonse; 09-20-2022 at 11:44 AM. Reason: precise calibre version | 
|   |   | 
|  09-20-2022, 07:13 PM | #6 | 
| creator of calibre            Posts: 45,595 Karma: 28548962 Join Date: Oct 2006 Location: Mumbai, India Device: Various | |
|   |   | 
|  09-20-2022, 07:48 PM | #7 | |
| null operator (he/him)            Posts: 22,001 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | Quote: 
 BR | |
|   |   | 
|  09-21-2022, 05:54 AM | #8 | |
| Member  Posts: 11 Karma: 10 Join Date: Oct 2021 Device: none | Quote: 
 @Kovid : I attached the document.docx (an example) I am trying to convert, the defective output.pdf and the epub which is fine. The options are the default ones. Here is the conversion log to pdf : Code: Conversion du livre 1 sur 1 (extract(1))
Conversion options changed from defaults:
  pdf_serif_family: 'Liberation Serif'
  pdf_mono_family: 'Liberation Mono'
  output_profile: 'generic_eink'
  verbose: 2
  read_metadata_from_opf: '/tmp/calibre_5.44.0_tmp_jylpjcwp/s42l7phw.opf'
  pdf_sans_family: 'Liberation Sans'
Resolved conversion options
calibre version: 5.44.0
{'asciiize': False,
 'author_sort': None,
 'authors': None,
 'base_font_size': 0.0,
 'book_producer': None,
 'change_justification': 'original',
 'chapter': "//*[((name()='h1' or name()='h2') and re:test(., "
            "'\\s*((chapter|book|section|part)\\s+)|((prolog|prologue|epilogue)(\\s+|$))', "
            "'i')) or @class = 'chapter']",
 'chapter_mark': 'pagebreak',
 'comments': None,
 'cover': None,
 'custom_size': None,
 'debug_pipeline': None,
 'dehyphenate': True,
 'delete_blank_paragraphs': True,
 'disable_font_rescaling': False,
 'docx_inline_subsup': False,
 'docx_no_cover': False,
 'docx_no_pagebreaks_between_notes': False,
 'duplicate_links_in_toc': False,
 'embed_all_fonts': False,
 'embed_font_family': None,
 'enable_heuristics': False,
 'expand_css': False,
 'extra_css': None,
 'filter_css': '',
 'fix_indents': True,
 'font_size_mapping': None,
 'format_scene_breaks': True,
 'html_unwrap_factor': 0.4,
 'input_encoding': None,
 'input_profile': <calibre.customize.profiles.InputProfile object at 0x7f94d8c39070>,
 'insert_blank_line': False,
 'insert_blank_line_size': 0.5,
 'insert_metadata': False,
 'isbn': None,
 'italicize_common_cases': True,
 'keep_ligatures': False,
 'language': None,
 'level1_toc': None,
 'level2_toc': None,
 'level3_toc': None,
 'line_height': 0.0,
 'linearize_tables': False,
 'margin_bottom': 5.0,
 'margin_left': 5.0,
 'margin_right': 5.0,
 'margin_top': 5.0,
 'markup_chapter_headings': True,
 'max_toc_links': 50,
 'minimum_line_height': 120.0,
 'no_chapters_in_toc': False,
 'no_inline_navbars': False,
 'output_profile': <calibre.customize.profiles.GenericEink object at 0x7f94d8c39370>,
 'page_breaks_before': '/',
 'paper_size': 'letter',
 'pdf_add_toc': False,
 'pdf_default_font_size': 20,
 'pdf_footer_template': None,
 'pdf_header_template': None,
 'pdf_hyphenate': False,
 'pdf_mark_links': False,
 'pdf_mono_family': 'Liberation Mono',
 'pdf_mono_font_size': 16,
 'pdf_odd_even_offset': 0.0,
 'pdf_page_margin_bottom': 72.0,
 'pdf_page_margin_left': 72.0,
 'pdf_page_margin_right': 72.0,
 'pdf_page_margin_top': 72.0,
 'pdf_page_number_map': None,
 'pdf_page_numbers': False,
 'pdf_sans_family': 'Liberation Sans',
 'pdf_serif_family': 'Liberation Serif',
 'pdf_standard_font': 'serif',
 'pdf_use_document_margins': False,
 'prefer_metadata_cover': False,
 'preserve_cover_aspect_ratio': False,
 'pretty_print': False,
 'pubdate': None,
 'publisher': None,
 'rating': None,
 'read_metadata_from_opf': '/tmp/calibre_5.44.0_tmp_jylpjcwp/s42l7phw.opf',
 'remove_fake_margins': True,
 'remove_first_image': False,
 'remove_paragraph_spacing': False,
 'remove_paragraph_spacing_indent_size': 1.5,
 'renumber_headings': True,
 'replace_scene_breaks': '',
 'search_replace': '[]',
 'series': None,
 'series_index': None,
 'smarten_punctuation': False,
 'sr1_replace': None,
 'sr1_search': None,
 'sr2_replace': None,
 'sr2_search': None,
 'sr3_replace': None,
 'sr3_search': None,
 'start_reading_at': None,
 'subset_embedded_fonts': False,
 'tags': None,
 'timestamp': None,
 'title': None,
 'title_sort': None,
 'toc_filter': None,
 'toc_threshold': 6,
 'toc_title': None,
 'transform_css_rules': '[]',
 'transform_html_rules': '[]',
 'uncompressed_pdf': False,
 'unit': 'inch',
 'unsmarten_punctuation': False,
 'unwrap_lines': True,
 'use_auto_toc': False,
 'use_profile_size': False,
 'verbose': 2}
InputFormatPlugin: DOCX Input running
on /tmp/calibre_5.44.0_tmp_jylpjcwp/t_j8ms92.docx
Converting Word markup to HTML
Converting styles to CSS
Cleaning up redundant markup generated by Word
Generating Table of Contents from headings
Parsing all content...
Parsing index.html ...
Initial parse failed, using more forgiving parsers
Parsing index.html as HTML
Parsing docx.css ...
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 60 items of level: p_1
p_1  left margin stats: Counter({'0': 60})
p_1  right margin stats: Counter({'0': 60})
Cleaning up manifest...
Trimming unused files from manifest...
Creating PDF Output...
Converting input as a text based book...
Merged 2 instances of LiberationSerif reducing size from 15.9 KB to 15.5 KB
Merged 2 instances of Carlito reducing size from 4.3 KB to 3.5 KB
Merged 2 instances of LiberationSerif-Bold reducing size from 7.9 KB to 7.4 KB
PDF output written to /tmp/calibre_5.44.0_tmp_jylpjcwp/9q_c2az6.pdf | |
|   |   | 
|  09-21-2022, 06:35 AM | #9 | 
| null operator (he/him)            Posts: 22,001 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | 
			
			Try putting the page breaks in a paragraph by themselves rather than at the end of a paragraph.  See attachments, the PDF was created in the GIU from the modified DOCX. BR | 
|   |   | 
|  09-21-2022, 08:54 AM | #10 | 
| Member  Posts: 11 Karma: 10 Join Date: Oct 2021 Device: none | 
			
			I tried and it worked, thank you. I don't understand why there is a difference between the pdf and epub result but at least I have something to work with, thanks.
		 | 
|   |   | 
|  09-21-2022, 09:28 AM | #11 | |
| null operator (he/him)            Posts: 22,001 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | Quote: 
 When a page break is created in Word it is always inserted in a new para. The only way I know to get it appended to end of the previous para is to rubout the pilcrow at the end of the previous para - and why would you do that. BR | |
|   |   | 
|  | 
| Tags | 
| chapter breaks, conversion, epub 3, options, pdf | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| xpath to insert chapter breaks - but chapter name cut off ? | Rob557 | Conversion | 2 | 03-06-2014 06:59 AM | 
| Calibre removes pages breaks from mobi | djprescott | Conversion | 2 | 02-12-2013 05:24 PM | 
| MOBI -> EPUB conversion removes scene breaks | tlangner | Conversion | 7 | 01-26-2013 12:37 AM | 
| Combining several PDFs into one PDF with chapter breaks between each one | kerrypolka | Conversion | 0 | 10-21-2012 08:02 AM | 
| Calibre removes Sigal's created chapter breaks | Themus | Calibre | 6 | 11-08-2011 09:35 PM |