Enthusiast
Posts: 44
Karma: 31594
Join Date: Jul 2012
Location: Essex,UK
Device: Kobo Touch
1. Added Post Captain - Patrick O'Brian.mobi to Calibre removing DRM with plugin.
2. Converted to epub all OK Log:
Spoiler :
'duplicate_links_in_toc': False,
'enable_heuristics': False,
'epub_flatten': False,
'extra_css': None,
'extract_to': None,
'filter_css': u'',
'fix_indents': True,
'flow_size': 260,
'font_size_mapping': None,
'format_scene_breaks': True,
'html_unwrap_factor': 0.4,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x03BB3F30>,
'insert_blank_line': False,
'insert_blank_line_size': 0.5,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': True,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_chapter_headings': True,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'no_chapters_in_toc': False,
'no_default_epub_cover': False,
'no_inline_navbars': False,
'no_svg_cover': False,
'output_profile': <calibre.customize.profiles.KoboReaderOutput object at 0x03B3E2F0>,
'page_breaks_before': u"//*[name()='h1' or name()='h2']",
'prefer_metadata_cover': False,
'preserve_cover_aspect_ratio': False,
'pretty_print': True,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': u'C:\\Users\\Michael\\AppData\\Local\\Temp\\calibr e_0.8.63_tmp_cqzvw_\\qgzd9k.opf',
'remove_fake_margins': True,
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': u'',
'search_replace': '[]',
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'unsmarten_punctuation': False,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: MOBI Input running
on C:\Users\Michael\AppData\Local\Temp\calibre_0.8.63 _tmp_cqzvw_\ucjubf.mobi
Extracting text...
Adding anchors...
Extracting images...
Cleaning up HTML...
Parsing HTML...
Converting style information to CSS...
Creating OPF...
Parsing all content...
Parsing Post_Captain.html ...
Forcing Post_Captain.html into XHTML namespace
Parsing styles.css ...
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 16 items of level: div_1
Found 32 items of level: div_2
Found 3173 items of level: p_2
div_1 left margin stats: Counter({u'': 16})
div_1 right margin stats: Counter({u'': 16})
div_2 left margin stats: Counter()
div_2 right margin stats: Counter()
Negative text indent detected at level p_2, ignoring this level
Cleaning up manifest...
Trimming unused files from manifest...
Trimming u'images/00001.jpg' from manifest
Trimming u'images/00002.jpg' from manifest
Creating EPUB Output...
Rescaling image from 550x825 to 457x686 cover.jpeg
Splitting markup on page breaks and flow limits, if any...
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Looking for large trees in Post_Captain.html...
No large trees found
Split into 16 parts
Removing anchor from TOC href: Post_Captain_split_001.html#filepos671
Removing anchor from TOC href: Post_Captain_split_002.html#filepos55498
Removing anchor from TOC href: Post_Captain_split_003.html#filepos105608
Removing anchor from TOC href: Post_Captain_split_004.html#filepos177732
Removing anchor from TOC href: Post_Captain_split_005.html#filepos234556
Removing anchor from TOC href: Post_Captain_split_006.html#filepos308487
Removing anchor from TOC href: Post_Captain_split_007.html#filepos414949
Removing anchor from TOC href: Post_Captain_split_008.html#filepos517846
Removing anchor from TOC href: Post_Captain_split_009.html#filepos619013
Removing anchor from TOC href: Post_Captain_split_010.html#filepos689673
Removing anchor from TOC href: Post_Captain_split_011.html#filepos773169
Removing anchor from TOC href: Post_Captain_split_012.html#filepos826718
Removing anchor from TOC href: Post_Captain_split_013.html#filepos928967
Removing anchor from TOC href: Post_Captain_split_014.html#filepos956797
EPUB output written to C:\Users\Michael\AppData\Local\Temp\calibre_0.8.63 _tmp_cqzvw_\b84tbx.epub
3. Search and Replace within Convert Epub Log:
Spoiler :
Convert book 1 of 1 (Post Captain)
Resolved conversion options
calibre version: 0.8.63
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., '\\s*((chapter|book|section|part)\\s+)|((prolog|pr ologue|epilogue)(\\s+|$))', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': u'C:\\Users\\Michael\\AppData\\Local\\Temp\\calibr e_0.8.63_tmp_cqzvw_\\zwfehu.jpeg',
'debug_pipeline': None,
'dehyphenate': True,
'delete_blank_paragraphs': True,
'disable_font_rescaling': False,
'dont_split_on_page_breaks': False,
'duplicate_links_in_toc': False,
'enable_heuristics': False,
'epub_flatten': False,
'extra_css': None,
'extract_to': None,
'filter_css': u'',
'fix_indents': True,
'flow_size': 260,
'font_size_mapping': None,
'format_scene_breaks': True,
'html_unwrap_factor': 0.4,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x039D3F30>,
'insert_blank_line': False,
'insert_blank_line_size': 0.5,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': True,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_chapter_headings': True,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'no_chapters_in_toc': False,
'no_default_epub_cover': False,
'no_inline_navbars': False,
'no_svg_cover': False,
'output_profile': <calibre.customize.profiles.KoboReaderOutput object at 0x0395E2F0>,
'page_breaks_before': u"//*[name()='h1' or name()='h2']",
'prefer_metadata_cover': False,
'preserve_cover_aspect_ratio': False,
'pretty_print': True,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': u'C:\\Users\\Michael\\AppData\\Local\\Temp\\calibr e_0.8.63_tmp_cqzvw_\\ki39bm.opf',
'remove_fake_margins': True,
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': u'',
'search_replace': '[["<div.*?>", ""], ["</div>", ""]]',
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'unsmarten_punctuation': False,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: MOBI Input running
on C:\Users\Michael\AppData\Local\Temp\calibre_0.8.63 _tmp_cqzvw_\gluoda.mobi
Extracting text...
Adding anchors...
Extracting images...
Cleaning up HTML...
Parsing HTML...
Converting style information to CSS...
Creating OPF...
Parsing all content...
Parsing Post_Captain.html ...
Forcing Post_Captain.html into XHTML namespace
Parsing styles.css ...
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 3173 items of level: p_1
Negative text indent detected at level p_1, ignoring this level
Cleaning up manifest...
Trimming unused files from manifest...
Trimming u'images/00001.jpg' from manifest
Trimming u'images/00002.jpg' from manifest
Creating EPUB Output...
Rescaling image from 550x825 to 457x686 cover.jpeg
Splitting markup on page breaks and flow limits, if any...
Looking for large trees in Post_Captain.html...
Found large tree #0
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[1598]
Split tree still too large: 485 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[801]
Committed sub-tree #1 (251 KB)
Committed sub-tree #2 (234 KB)
Split tree still too large: 489 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[798]
Committed sub-tree #3 (244 KB)
Committed sub-tree #4 (245 KB)
Split into 4 parts
EPUB output written to C:\Users\Michael\AppData\Local\Temp\calibre_0.8.63 _tmp_cqzvw_\1bvxhh.epub
***** Problem end up with combined 4 HTML files and chapters do not work!
How can I stop HTML files combining?
Many thanks
Michael
Last edited by theducks; 08-04-2012 at 06:46 PM .
Reason: wrap HUGE paste in spoiler