11-08-2021, 06:00 AM | #1 |
Member
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
|
PDF removes chapter breaks
Hello !
I have an issue with my chapter's page breaks. I use this command line for epub conversion : PHP Code:
PHP Code:
Using conversion through Calibre software works, but I need it to work through command line. I am searching through options guidelines without success Edit : I am using Calibre 5.30 Last edited by ydonse; 11-08-2021 at 06:02 AM. |
11-08-2021, 07:15 AM | #2 |
creator of calibre
Posts: 44,339
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
The command line and the GUI use exactly the same conversion code if the versions of calibre are the same. just set the same options and you will get the same result. You can check the options the GUI uses in the job log.
|
Advert | |
|
11-08-2021, 08:46 AM | #3 |
Member
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
|
Thank you for your response Kovid, and thank you for your great work !
In my command line response I have these logs : " 1 => 'Conversion options changed from defaults:', 2 => ' margin_top: 99.0', 3 => ' cover: mypath.png', 4 => ' verbose: 1', 5 => ' margin_bottom: 85.0', 6 => ' preserve_cover_aspect_ratio: True'," And in the soft I have these : "Conversion options changed from defaults: pdf_serif_family: 'Liberation Serif' cover: '/tmp/calibre_5.31.1_tmp_xxrib_kp/lfbxxtgd.jpeg' pdf_mono_family: 'Liberation Mono' output_profile: 'generic_eink' read_metadata_from_opf: '/tmp/calibre_5.31.1_tmp_xxrib_kp/qy1i8fel.opf' pdf_sans_family: 'Liberation Sans' verbose: 2 debug_pipeline: '/home/librinova/Documents/debug'" While they are not exactly the same I don't see any options that could lead to a difference in the chapter page breaks. Moreover, in both logs Chapter is set on pagebreak. I'll continue searching, thanks again for your time. |
11-08-2021, 11:47 AM | #4 |
the rook, bossing Never.
Posts: 12,249
Karma: 89531599
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
|
Other alternatives:
"Print" on the Calibre Viewer works well to create a PDF file. It only creates a PDF and can add page numbers. Best option is format and page size, header, footer, page numbers, contents with page numbers etc in LO Writer and Export a PDF. Better than most versions of Word for PDF creation and free. Gives publisher quality PDF assuming the page format, headers, footers, styles, image resolutions etc are OK. There is even an option on export to resample all images to 300 dpi (or other desired resolution). Since you HAVE a docx, I'd only use Calibre to make an epub, and then convert epub to mobi, azw3, kfx, kepub, lrf, pdb (both sorts), not a PDF. |
09-20-2022, 11:43 AM | #5 |
Member
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
|
Hello again !
I still have the issue but now directly in the GUI. When I convert a docx to epub it keeps the page breaks. When I convert a docx to pdf, the page breaks are gone. However, when I convert the epub to pdf it works. I don't understand why. edit : i'm on version 5.44 Last edited by ydonse; 09-20-2022 at 11:44 AM. Reason: precise calibre version |
Advert | |
|
09-20-2022, 07:13 PM | #6 |
creator of calibre
Posts: 44,339
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
09-20-2022, 07:48 PM | #7 | |
null operator (he/him)
Posts: 20,935
Karma: 27620688
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
BR |
|
09-21-2022, 05:54 AM | #8 | |
Member
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
|
Quote:
@Kovid : I attached the document.docx (an example) I am trying to convert, the defective output.pdf and the epub which is fine. The options are the default ones. Here is the conversion log to pdf : Code:
Conversion du livre 1 sur 1 (extract(1)) Conversion options changed from defaults: pdf_serif_family: 'Liberation Serif' pdf_mono_family: 'Liberation Mono' output_profile: 'generic_eink' verbose: 2 read_metadata_from_opf: '/tmp/calibre_5.44.0_tmp_jylpjcwp/s42l7phw.opf' pdf_sans_family: 'Liberation Sans' Resolved conversion options calibre version: 5.44.0 {'asciiize': False, 'author_sort': None, 'authors': None, 'base_font_size': 0.0, 'book_producer': None, 'change_justification': 'original', 'chapter': "//*[((name()='h1' or name()='h2') and re:test(., " "'\\s*((chapter|book|section|part)\\s+)|((prolog|prologue|epilogue)(\\s+|$))', " "'i')) or @class = 'chapter']", 'chapter_mark': 'pagebreak', 'comments': None, 'cover': None, 'custom_size': None, 'debug_pipeline': None, 'dehyphenate': True, 'delete_blank_paragraphs': True, 'disable_font_rescaling': False, 'docx_inline_subsup': False, 'docx_no_cover': False, 'docx_no_pagebreaks_between_notes': False, 'duplicate_links_in_toc': False, 'embed_all_fonts': False, 'embed_font_family': None, 'enable_heuristics': False, 'expand_css': False, 'extra_css': None, 'filter_css': '', 'fix_indents': True, 'font_size_mapping': None, 'format_scene_breaks': True, 'html_unwrap_factor': 0.4, 'input_encoding': None, 'input_profile': <calibre.customize.profiles.InputProfile object at 0x7f94d8c39070>, 'insert_blank_line': False, 'insert_blank_line_size': 0.5, 'insert_metadata': False, 'isbn': None, 'italicize_common_cases': True, 'keep_ligatures': False, 'language': None, 'level1_toc': None, 'level2_toc': None, 'level3_toc': None, 'line_height': 0.0, 'linearize_tables': False, 'margin_bottom': 5.0, 'margin_left': 5.0, 'margin_right': 5.0, 'margin_top': 5.0, 'markup_chapter_headings': True, 'max_toc_links': 50, 'minimum_line_height': 120.0, 'no_chapters_in_toc': False, 'no_inline_navbars': False, 'output_profile': <calibre.customize.profiles.GenericEink object at 0x7f94d8c39370>, 'page_breaks_before': '/', 'paper_size': 'letter', 'pdf_add_toc': False, 'pdf_default_font_size': 20, 'pdf_footer_template': None, 'pdf_header_template': None, 'pdf_hyphenate': False, 'pdf_mark_links': False, 'pdf_mono_family': 'Liberation Mono', 'pdf_mono_font_size': 16, 'pdf_odd_even_offset': 0.0, 'pdf_page_margin_bottom': 72.0, 'pdf_page_margin_left': 72.0, 'pdf_page_margin_right': 72.0, 'pdf_page_margin_top': 72.0, 'pdf_page_number_map': None, 'pdf_page_numbers': False, 'pdf_sans_family': 'Liberation Sans', 'pdf_serif_family': 'Liberation Serif', 'pdf_standard_font': 'serif', 'pdf_use_document_margins': False, 'prefer_metadata_cover': False, 'preserve_cover_aspect_ratio': False, 'pretty_print': False, 'pubdate': None, 'publisher': None, 'rating': None, 'read_metadata_from_opf': '/tmp/calibre_5.44.0_tmp_jylpjcwp/s42l7phw.opf', 'remove_fake_margins': True, 'remove_first_image': False, 'remove_paragraph_spacing': False, 'remove_paragraph_spacing_indent_size': 1.5, 'renumber_headings': True, 'replace_scene_breaks': '', 'search_replace': '[]', 'series': None, 'series_index': None, 'smarten_punctuation': False, 'sr1_replace': None, 'sr1_search': None, 'sr2_replace': None, 'sr2_search': None, 'sr3_replace': None, 'sr3_search': None, 'start_reading_at': None, 'subset_embedded_fonts': False, 'tags': None, 'timestamp': None, 'title': None, 'title_sort': None, 'toc_filter': None, 'toc_threshold': 6, 'toc_title': None, 'transform_css_rules': '[]', 'transform_html_rules': '[]', 'uncompressed_pdf': False, 'unit': 'inch', 'unsmarten_punctuation': False, 'unwrap_lines': True, 'use_auto_toc': False, 'use_profile_size': False, 'verbose': 2} InputFormatPlugin: DOCX Input running on /tmp/calibre_5.44.0_tmp_jylpjcwp/t_j8ms92.docx Converting Word markup to HTML Converting styles to CSS Cleaning up redundant markup generated by Word Generating Table of Contents from headings Parsing all content... Parsing index.html ... Initial parse failed, using more forgiving parsers Parsing index.html as HTML Parsing docx.css ... Reading TOC from NCX... Merging user specified metadata... Detecting structure... Flattening CSS and remapping font sizes... Source base font size is 12.00000pt Removing fake margins... Found 60 items of level: p_1 p_1 left margin stats: Counter({'0': 60}) p_1 right margin stats: Counter({'0': 60}) Cleaning up manifest... Trimming unused files from manifest... Creating PDF Output... Converting input as a text based book... Merged 2 instances of LiberationSerif reducing size from 15.9 KB to 15.5 KB Merged 2 instances of Carlito reducing size from 4.3 KB to 3.5 KB Merged 2 instances of LiberationSerif-Bold reducing size from 7.9 KB to 7.4 KB PDF output written to /tmp/calibre_5.44.0_tmp_jylpjcwp/9q_c2az6.pdf |
|
09-21-2022, 06:35 AM | #9 |
null operator (he/him)
Posts: 20,935
Karma: 27620688
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Try putting the page breaks in a paragraph by themselves rather than at the end of a paragraph. See attachments, the PDF was created in the GIU from the modified DOCX.
BR |
09-21-2022, 08:54 AM | #10 |
Member
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
|
I tried and it worked, thank you. I don't understand why there is a difference between the pdf and epub result but at least I have something to work with, thanks.
|
09-21-2022, 09:28 AM | #11 | |
null operator (he/him)
Posts: 20,935
Karma: 27620688
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
When a page break is created in Word it is always inserted in a new para. The only way I know to get it appended to end of the previous para is to rubout the pilcrow at the end of the previous para - and why would you do that. BR |
|
Tags |
chapter breaks, conversion, epub 3, options, pdf |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
xpath to insert chapter breaks - but chapter name cut off ? | Rob557 | Conversion | 2 | 03-06-2014 06:59 AM |
Calibre removes pages breaks from mobi | djprescott | Conversion | 2 | 02-12-2013 05:24 PM |
MOBI -> EPUB conversion removes scene breaks | tlangner | Conversion | 7 | 01-26-2013 12:37 AM |
Combining several PDFs into one PDF with chapter breaks between each one | kerrypolka | Conversion | 0 | 10-21-2012 08:02 AM |
Calibre removes Sigal's created chapter breaks | Themus | Calibre | 6 | 11-08-2011 09:35 PM |