Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 11-08-2021, 06:00 AM   #1
ydonse
Member
ydonse began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
PDF removes chapter breaks

Hello !

I have an issue with my chapter's page breaks.
I use this command line for epub conversion :
PHP Code:
/usr/bin/xvfb-run --auto-servernum /usr/bin/ebook-convert mydoc.docx mydoc.epub --cover mycover.png  --insert-blank-line --remove-paragraph-spacing --remove-paragraph-spacing-indent-size ---preserve-cover-aspect-ratio --embed-all-fonts --epub-version 3 2>&1
i use the command line for PDF conversion :
PHP Code:
/usr/bin/xvfb-run --auto-servernum /usr/bin/ebook-convert mydoc.docx mydoc.pdf --cover mycover.png  --margin-bottom 85 --margin-top 99  --preserve-cover-aspect-ratio 2>&1
My epub has no issue and has its page break before chapters. However the breaks have disappeared from the PDF version.

Using conversion through Calibre software works, but I need it to work through command line.

I am searching through options guidelines without success

Edit : I am using Calibre 5.30

Last edited by ydonse; 11-08-2021 at 06:02 AM.
ydonse is offline   Reply With Quote
Old 11-08-2021, 07:15 AM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,866
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
The command line and the GUI use exactly the same conversion code if the versions of calibre are the same. just set the same options and you will get the same result. You can check the options the GUI uses in the job log.
kovidgoyal is offline   Reply With Quote
Old 11-08-2021, 08:46 AM   #3
ydonse
Member
ydonse began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
Thank you for your response Kovid, and thank you for your great work !

In my command line response I have these logs :
" 1 => 'Conversion options changed from defaults:',
2 => ' margin_top: 99.0',
3 => ' cover: mypath.png',
4 => ' verbose: 1',
5 => ' margin_bottom: 85.0',
6 => ' preserve_cover_aspect_ratio: True',"

And in the soft I have these :
"Conversion options changed from defaults:
pdf_serif_family: 'Liberation Serif'
cover: '/tmp/calibre_5.31.1_tmp_xxrib_kp/lfbxxtgd.jpeg'
pdf_mono_family: 'Liberation Mono'
output_profile: 'generic_eink'
read_metadata_from_opf: '/tmp/calibre_5.31.1_tmp_xxrib_kp/qy1i8fel.opf'
pdf_sans_family: 'Liberation Sans'
verbose: 2
debug_pipeline: '/home/librinova/Documents/debug'"

While they are not exactly the same I don't see any options that could lead to a difference in the chapter page breaks. Moreover, in both logs Chapter is set on pagebreak.

I'll continue searching, thanks again for your time.
ydonse is offline   Reply With Quote
Old 11-08-2021, 11:47 AM   #4
Quoth
the rook, bossing Never.
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 11,171
Karma: 85874891
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper11
Other alternatives:
"Print" on the Calibre Viewer works well to create a PDF file. It only creates a PDF and can add page numbers.

Best option is format and page size, header, footer, page numbers, contents with page numbers etc in LO Writer and Export a PDF. Better than most versions of Word for PDF creation and free. Gives publisher quality PDF assuming the page format, headers, footers, styles, image resolutions etc are OK. There is even an option on export to resample all images to 300 dpi (or other desired resolution).

Since you HAVE a docx, I'd only use Calibre to make an epub, and then convert epub to mobi, azw3, kfx, kepub, lrf, pdb (both sorts), not a PDF.
Quoth is offline   Reply With Quote
Old 09-20-2022, 11:43 AM   #5
ydonse
Member
ydonse began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
Hello again !

I still have the issue but now directly in the GUI.
When I convert a docx to epub it keeps the page breaks.
When I convert a docx to pdf, the page breaks are gone.
However, when I convert the epub to pdf it works. I don't understand why.

edit : i'm on version 5.44

Last edited by ydonse; 09-20-2022 at 11:44 AM. Reason: precise calibre version
ydonse is offline   Reply With Quote
Old 09-20-2022, 07:13 PM   #6
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,866
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
https://www.mobileread.com/forums/sh...d.php?t=186697
kovidgoyal is offline   Reply With Quote
Old 09-20-2022, 07:48 PM   #7
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,587
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by ydonse View Post
Hello again !

I still have the issue but now directly in the GUI.
When I convert a docx to epub it keeps the page breaks.
When I convert a docx to pdf, the page breaks are gone.
However, when I convert the epub to pdf it works. I don't understand why.

edit : i'm on version 5.44
If you have a DOCX and you want a PDF, open the DOCX in Microsoft Word or LibreOffice Writer and save it as a PDF.

BR
BetterRed is online now   Reply With Quote
Old 09-21-2022, 05:54 AM   #8
ydonse
Member
ydonse began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
Quote:
Originally Posted by BetterRed View Post
If you have a DOCX and you want a PDF, open the DOCX in Microsoft Word or LibreOffice Writer and save it as a PDF.
Thank you but that is not an option, I need to use Calibre.


@Kovid :

I attached the document.docx (an example) I am trying to convert, the defective output.pdf and the epub which is fine. The options are the default ones.

Here is the conversion log to pdf :

Code:
Conversion du livre 1 sur 1 (extract(1))
Conversion options changed from defaults:
  pdf_serif_family: 'Liberation Serif'
  pdf_mono_family: 'Liberation Mono'
  output_profile: 'generic_eink'
  verbose: 2
  read_metadata_from_opf: '/tmp/calibre_5.44.0_tmp_jylpjcwp/s42l7phw.opf'
  pdf_sans_family: 'Liberation Sans'
Resolved conversion options
calibre version: 5.44.0
{'asciiize': False,
 'author_sort': None,
 'authors': None,
 'base_font_size': 0.0,
 'book_producer': None,
 'change_justification': 'original',
 'chapter': "//*[((name()='h1' or name()='h2') and re:test(., "
            "'\\s*((chapter|book|section|part)\\s+)|((prolog|prologue|epilogue)(\\s+|$))', "
            "'i')) or @class = 'chapter']",
 'chapter_mark': 'pagebreak',
 'comments': None,
 'cover': None,
 'custom_size': None,
 'debug_pipeline': None,
 'dehyphenate': True,
 'delete_blank_paragraphs': True,
 'disable_font_rescaling': False,
 'docx_inline_subsup': False,
 'docx_no_cover': False,
 'docx_no_pagebreaks_between_notes': False,
 'duplicate_links_in_toc': False,
 'embed_all_fonts': False,
 'embed_font_family': None,
 'enable_heuristics': False,
 'expand_css': False,
 'extra_css': None,
 'filter_css': '',
 'fix_indents': True,
 'font_size_mapping': None,
 'format_scene_breaks': True,
 'html_unwrap_factor': 0.4,
 'input_encoding': None,
 'input_profile': <calibre.customize.profiles.InputProfile object at 0x7f94d8c39070>,
 'insert_blank_line': False,
 'insert_blank_line_size': 0.5,
 'insert_metadata': False,
 'isbn': None,
 'italicize_common_cases': True,
 'keep_ligatures': False,
 'language': None,
 'level1_toc': None,
 'level2_toc': None,
 'level3_toc': None,
 'line_height': 0.0,
 'linearize_tables': False,
 'margin_bottom': 5.0,
 'margin_left': 5.0,
 'margin_right': 5.0,
 'margin_top': 5.0,
 'markup_chapter_headings': True,
 'max_toc_links': 50,
 'minimum_line_height': 120.0,
 'no_chapters_in_toc': False,
 'no_inline_navbars': False,
 'output_profile': <calibre.customize.profiles.GenericEink object at 0x7f94d8c39370>,
 'page_breaks_before': '/',
 'paper_size': 'letter',
 'pdf_add_toc': False,
 'pdf_default_font_size': 20,
 'pdf_footer_template': None,
 'pdf_header_template': None,
 'pdf_hyphenate': False,
 'pdf_mark_links': False,
 'pdf_mono_family': 'Liberation Mono',
 'pdf_mono_font_size': 16,
 'pdf_odd_even_offset': 0.0,
 'pdf_page_margin_bottom': 72.0,
 'pdf_page_margin_left': 72.0,
 'pdf_page_margin_right': 72.0,
 'pdf_page_margin_top': 72.0,
 'pdf_page_number_map': None,
 'pdf_page_numbers': False,
 'pdf_sans_family': 'Liberation Sans',
 'pdf_serif_family': 'Liberation Serif',
 'pdf_standard_font': 'serif',
 'pdf_use_document_margins': False,
 'prefer_metadata_cover': False,
 'preserve_cover_aspect_ratio': False,
 'pretty_print': False,
 'pubdate': None,
 'publisher': None,
 'rating': None,
 'read_metadata_from_opf': '/tmp/calibre_5.44.0_tmp_jylpjcwp/s42l7phw.opf',
 'remove_fake_margins': True,
 'remove_first_image': False,
 'remove_paragraph_spacing': False,
 'remove_paragraph_spacing_indent_size': 1.5,
 'renumber_headings': True,
 'replace_scene_breaks': '',
 'search_replace': '[]',
 'series': None,
 'series_index': None,
 'smarten_punctuation': False,
 'sr1_replace': None,
 'sr1_search': None,
 'sr2_replace': None,
 'sr2_search': None,
 'sr3_replace': None,
 'sr3_search': None,
 'start_reading_at': None,
 'subset_embedded_fonts': False,
 'tags': None,
 'timestamp': None,
 'title': None,
 'title_sort': None,
 'toc_filter': None,
 'toc_threshold': 6,
 'toc_title': None,
 'transform_css_rules': '[]',
 'transform_html_rules': '[]',
 'uncompressed_pdf': False,
 'unit': 'inch',
 'unsmarten_punctuation': False,
 'unwrap_lines': True,
 'use_auto_toc': False,
 'use_profile_size': False,
 'verbose': 2}
InputFormatPlugin: DOCX Input running
on /tmp/calibre_5.44.0_tmp_jylpjcwp/t_j8ms92.docx
Converting Word markup to HTML
Converting styles to CSS
Cleaning up redundant markup generated by Word
Generating Table of Contents from headings
Parsing all content...
Parsing index.html ...
Initial parse failed, using more forgiving parsers
Parsing index.html as HTML
Parsing docx.css ...
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 60 items of level: p_1
p_1  left margin stats: Counter({'0': 60})
p_1  right margin stats: Counter({'0': 60})
Cleaning up manifest...
Trimming unused files from manifest...
Creating PDF Output...
Converting input as a text based book...
Merged 2 instances of LiberationSerif reducing size from 15.9 KB to 15.5 KB
Merged 2 instances of Carlito reducing size from 4.3 KB to 3.5 KB
Merged 2 instances of LiberationSerif-Bold reducing size from 7.9 KB to 7.4 KB
PDF output written to /tmp/calibre_5.44.0_tmp_jylpjcwp/9q_c2az6.pdf
Attached Files
File Type: docx extract(1).docx (29.3 KB, 88 views)
File Type: pdf Output.pdf (44.4 KB, 78 views)
File Type: epub extract(1) - Yohann Donse.epub (32.4 KB, 83 views)
ydonse is offline   Reply With Quote
Old 09-21-2022, 06:35 AM   #9
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,587
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Try putting the page breaks in a paragraph by themselves rather than at the end of a paragraph. See attachments, the PDF was created in the GIU from the modified DOCX.

BR
Attached Files
File Type: docx XX extract(1) - Yohann Donse.docx (29.4 KB, 251 views)
File Type: pdf XX extract(1) - Yohann Donse.pdf (57.2 KB, 88 views)
BetterRed is online now   Reply With Quote
Old 09-21-2022, 08:54 AM   #10
ydonse
Member
ydonse began at the beginning.
 
Posts: 11
Karma: 10
Join Date: Oct 2021
Device: none
I tried and it worked, thank you. I don't understand why there is a difference between the pdf and epub result but at least I have something to work with, thanks.
ydonse is offline   Reply With Quote
Old 09-21-2022, 09:28 AM   #11
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 20,587
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by ydonse View Post
I tried and it worked, thank you. I don't understand why there is a difference between the pdf and epub result…
Probably because the EPUB Output and the PDF Output plugins are very different - an EPUB is equivalent to a continuous scroll, a PDF is equivalent to a paged book.

When a page break is created in Word it is always inserted in a new para. The only way I know to get it appended to end of the previous para is to rubout the pilcrow at the end of the previous para - and why would you do that.

BR
BetterRed is online now   Reply With Quote
Reply

Tags
chapter breaks, conversion, epub 3, options, pdf


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
xpath to insert chapter breaks - but chapter name cut off ? Rob557 Conversion 2 03-06-2014 06:59 AM
Calibre removes pages breaks from mobi djprescott Conversion 2 02-12-2013 05:24 PM
MOBI -> EPUB conversion removes scene breaks tlangner Conversion 7 01-26-2013 12:37 AM
Combining several PDFs into one PDF with chapter breaks between each one kerrypolka Conversion 0 10-21-2012 08:02 AM
Calibre removes Sigal's created chapter breaks Themus Calibre 6 11-08-2011 09:35 PM


All times are GMT -4. The time now is 07:44 PM.


MobileRead.com is a privately owned, operated and funded community.