Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 09-22-2012, 05:41 AM   #1
Boronian
Junior Member
Boronian began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Sep 2012
Device: Sony PRS-T1
Conversion problem: text is messy

Hello,

I got a problem with a mobi to epub conversion. I used just the default settings of calibre to get an epub of my mobi book. But at two places the text is messed up. I show you what I mean.
The first screen is the mobi file, the second one is the converted epub. The key text is highlighted.
So what can I do to solve this problem?
I am thankful for your help!


And here is the log:

Spoiler:
Konvertiere Buch 1 von 1 (Hegemony)
Resolved conversion options
calibre version: 0.8.70
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., '\\s*((chapter|book|section|part)\\s+)|((prolog|pr ologue|epilogue)(\\s+|$))', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': u'C:\\Users\\Boronian\\AppData\\Local\\Temp\\calib re_0.8.70_tmp_ygz5an\\enovmj.jpeg',
'debug_pipeline': None,
'dehyphenate': True,
'delete_blank_paragraphs': True,
'disable_font_rescaling': False,
'dont_split_on_page_breaks': False,
'duplicate_links_in_toc': False,
'enable_heuristics': False,
'epub_flatten': False,
'extra_css': None,
'extract_to': None,
'filter_css': u'',
'fix_indents': True,
'flow_size': 260,
'font_size_mapping': None,
'format_scene_breaks': True,
'html_unwrap_factor': 0.4,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x03F64410>,
'insert_blank_line': False,
'insert_blank_line_size': 0.5,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': True,
'keep_ligatures': False,
'language': None,
'level1_toc': u'//h:h2',
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_chapter_headings': True,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'no_chapters_in_toc': False,
'no_default_epub_cover': False,
'no_inline_navbars': False,
'no_svg_cover': False,
'output_profile': <calibre.customize.profiles.SonyReaderOutput object at 0x03F648B0>,
'page_breaks_before': u"//*[name()='h1' or name()='h2']",
'prefer_metadata_cover': False,
'preserve_cover_aspect_ratio': False,
'pretty_print': True,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': u'C:\\Users\\Boronian\\AppData\\Local\\Temp\\calib re_0.8.70_tmp_ygz5an\\5d4lxb.opf',
'remove_fake_margins': True,
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': u'',
'search_replace': '[]',
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'start_reading_at': None,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'unsmarten_punctuation': False,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: MOBI Input running
on C:\Users\Boronian\AppData\Local\Temp\calibre_0.8.7 0_tmp_ygz5an\scs3nh.azw3
Found KF8 MOBI of type 'standalone'
Extracting text...
KF8 has no metadata Table of Contents
Initial parse failed, using more forgiving parsers
Failed to read inline ToC
Traceback (most recent call last):
File "site-packages\calibre\ebooks\mobi\reader\mobi8.py", line 399, in write_opf
File "site-packages\calibre\ebooks\mobi\reader\mobi8.py", line 433, in read_inline_toc
TypeError: 'lxml.etree.XPath' object does not support indexing

Parsing all content...
Parsing text/part0000.html ...
Referenced file u'text/XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX' not found
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Interpreting class and tag selectors case insensitively in the CSS selector: h|P
Interpreting class and tag selectors case insensitively in the CSS selector: h|H1
Interpreting class and tag selectors case insensitively in the CSS selector: h|A
Interpreting class and tag selectors case insensitively in the CSS selector: h|P.western
Interpreting class and tag selectors case insensitively in the CSS selector: h|H1.western
Interpreting class and tag selectors case insensitively in the CSS selector: h|A.western
Source base font size is 12.00000pt
Removing fake margins...
Found 347 items of level: div_1
Found 2728 items of level: p_1
div_1 left margin stats: Counter()
div_1 right margin stats: Counter()
p_1 left margin stats: Counter({u'0': 2728})
p_1 right margin stats: Counter({u'0': 2728})
Cleaning up manifest...
Trimming unused files from manifest...
Trimming u'images/00004.jpeg' from manifest
Creating EPUB Output...
Rescaling image from 531x863 to 462x751 images/00002.gif
Rescaling image from 619x1005 to 462x751 cover.jpeg
Splitting markup on page breaks and flow limits, if any...
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Splitting on page-break
Looking for large trees in text/part0000.html...
No large trees found
Split into 25 parts
This EPUB file has no Table of Contents. Creating a default TOC
EPUB output written to C:\Users\Boronian\AppData\Local\Temp\calibre_0.8.7 0_tmp_ygz5an\xo84ch.epub
Attached Thumbnails
Click image for larger version

Name:	mobi.png
Views:	240
Size:	118.4 KB
ID:	92751   Click image for larger version

Name:	epub.png
Views:	248
Size:	267.3 KB
ID:	92752  
Boronian is offline   Reply With Quote
Old 09-22-2012, 06:13 AM   #2
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 9,894
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
Quote:
Originally Posted by Boronian View Post
The first screen is the mobi file, the second one is the converted epub. The key text is highlighted.
I don't think your images show a true before and after. I'm guessing the image in your Kindle 4 PC view is the DRM version you purchased from Amazon. Show us the mobi that ended up in calibre. I'm guessing whatever tool you used to remove the drm is responsible for your slight corruption.
DoctorOhh is offline   Reply With Quote
Advert
Old 09-22-2012, 12:34 PM   #3
Boronian
Junior Member
Boronian began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Sep 2012
Device: Sony PRS-T1
Thanks for your answer!
Hm I think that could be possible.
Well...now I added the mobi file to calibre and reopened the azw3 file saved in the folder of the calibre library with the kindle software. There I didn't find any corruption. But as soon as I converted the file to epub I found the corruption there.
I am not entirely sure I did what you suggested although.
Boronian is offline   Reply With Quote
Old 09-22-2012, 08:43 PM   #4
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 9,894
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
Quote:
Originally Posted by Boronian View Post
Well...now I added the mobi file to calibre and reopened the azw3 file saved in the folder of the calibre library with the kindle software. There I didn't find any corruption. But as soon as I converted the file to epub I found the corruption there.
I am not entirely sure I did what you suggested although.
It sounds like you did what I suggested.

Maybe someone else has some ideas.
DoctorOhh is offline   Reply With Quote
Old 09-22-2012, 09:03 PM   #5
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 79,407
Karma: 145491800
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
You are converting your KF8 file to ePub in the worst possible way. You do not want to use Calibre to do an actual conversion. Install the plugin Mobi-Unpack. Mobi-Unpack will allow you to pull the files out of the KF8 and create an ePub directly. You can also use Modify ePub will allow you to add in the cover XML that KF8 doesn't have. Then you run the ePub through FlightCrew to check for any errors. Either use Tweak Book or Sigil to fix the errors and when you are done, you have an ePub that matches the KF8.
JSWolf is online now   Reply With Quote
Advert
Old 09-22-2012, 09:06 PM   #6
DoctorOhh
US Navy, Retired
DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.DoctorOhh ought to be getting tired of karma fortunes by now.
 
DoctorOhh's Avatar
 
Posts: 9,894
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
Quote:
Originally Posted by JSWolf View Post
Either use Tweak Book or Sigil to fix the errors and when you are done, you have an ePub that matches the KF8.
I'm sure everything you said is true, any idea why his text may have been corrupted?
DoctorOhh is offline   Reply With Quote
Old 09-22-2012, 11:56 PM   #7
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,251
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Impossible to say. You need to attach the mobi in question. https://www.mobileread.com/forums/sho...d.php?t=186697
kovidgoyal is offline   Reply With Quote
Old 09-28-2012, 12:48 PM   #8
Boronian
Junior Member
Boronian began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Sep 2012
Device: Sony PRS-T1
Sorry for this late answer. I had no internet access for some days.

Big thanks for your help! I installed all the plugins and programs (modify epub, Sigil, Mobi Unpack) and got a very good epub now.

If you are still interested in the mobi and it is ok for the forum policy of posting ebooks I will post it here.

Thanks again!
I found some nice plugins and learned much about Calibre.
Boronian is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Second library very messy Berni Calibre 4 06-15-2012 08:20 AM
Missing text after conversion b00ky Calibre 3 02-29-2012 09:18 AM
Does conversion reformat the text? dynalmadman Calibre 0 02-20-2010 08:33 PM
RTF and TEXT conversion spaze Calibre 4 08-23-2009 03:11 AM
calibre: HTML to LRF conversion, problem with justified text Juggle4Evr Sony Reader 6 07-12-2009 07:19 PM


All times are GMT -4. The time now is 04:26 PM.


MobileRead.com is a privately owned, operated and funded community.