Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 08-06-2011, 02:12 AM   #1
dm15441
Junior Member
dm15441 began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Aug 2011
Device: Kindle 3
Issues with Chinese text file conversion

Hi,

I have a text file that I successfully converted to mobi in Calibre, but when I open it, I get a Calibre parallel exe error. So, I then tried to convert it into an epub file, but I got the below error. It does sound like Sigil may help me, but it seems a little beyond my technical skills. Is there an easier option? Also, I did try just copying and pasting only the beginning of the txt file into an RTF format, but even though that worked, I only see scrambled characters when I open it (to clarify, the text file is scrambled itself as a result of it being a Chinese file, but other Chinese files that I have that are the same way, but smaller in size, became readable when I converted them to mobi in Calibre):

Spoiler:

calibre, version 0.8.11
ERROR: Conversion Error: <b>Failed</b>: Convert book 1 of 1 (哈利波特(1-7全))

Convert book 1 of 1 (哈利波特(1-7全))
Resolved conversion options
calibre version: 0.8.11
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part|prologue|epilogue\\s+', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': None,
'debug_pipeline': None,
'dehyphenate': True,
'delete_blank_paragraphs': True,
'disable_font_rescaling': False,
'dont_split_on_page_breaks': False,
'duplicate_links_in_toc': False,
'enable_heuristics': False,
'epub_flatten': False,
'extra_css': None,
'extract_to': None,
'fix_indents': True,
'flow_size': 260,
'font_size_mapping': None,
'format_scene_breaks': True,
'formatting_type': u'auto',
'html_unwrap_factor': 0.4,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x05770E90>,
'insert_blank_line': False,
'insert_blank_line_size': 0.5,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': True,
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markdown_disable_toc': False,
'markup_chapter_headings': True,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'no_chapters_in_toc': False,
'no_default_epub_cover': False,
'no_inline_navbars': False,
'no_svg_cover': False,
'output_profile': <calibre.customize.profiles.KindleOutput object at 0x05779150>,
'page_breaks_before': u"//*[name()='h1' or name()='h2']",
'paragraph_type': u'auto',
'prefer_metadata_cover': False,
'preserve_cover_aspect_ratio': False,
'preserve_spaces': False,
'pretty_print': True,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': u'c:\\users\\wypath\\appdata\\local\\temp\\calibre _0.8.11_tmp_jqkijr\\calibre_0.8.11_j0uhwg.opf',
'remove_fake_margins': True,
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': u'',
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'txt_in_remove_indents': False,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: TXT Input running
on c:\users\wypath\appdata\local\temp\calibre_0.8.11_ tmp_jqkijr\calibre_0.8.11_qwwffr.txt
Reading text from file...
Detected input encoding as GB2312 with a confidence of 99.0%
Auto detected paragraph type as single
Auto detected formatting as textile
Running text through textile conversion...
Language not specified
Creator not specified
Building file list...
Found files...
HTMLFile:0:a:c:\users\wypath\appdata\local\temp\ca libre_0.8.11_tmp_jqkijr\index.html
Normalizing filename cases
Rewriting HTML links
Parsing index.html ...
Initial parse failed:
Traceback (most recent call last):
File "site-packages\calibre\ebooks\oeb\base.py", line 886, in first_pass
File "lxml.etree.pyx", line 2743, in lxml.etree.fromstring (src/lxml/lxml.etree.c:52665)
File "parser.pxi", line 1573, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:79932)
File "parser.pxi", line 1445, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:78709)
File "parser.pxi", line 920, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:75083)
File "parser.pxi", line 564, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:71739)
File "parser.pxi", line 645, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:72614)
File "parser.pxi", line 585, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:71955)
XMLSyntaxError: Opening and ending tag mismatch: 从郁金香花园中悄悄走过 line 548 and p, line 548, column 102

Parsing file 'index.html' as HTML
Forcing index.html into XHTML namespace
Merging user specified metadata...
Detecting structure...
Auto generated TOC with 0 entries.
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Parsing stylesheet.css ...
Found 13240 items of level: p_1
p_1 left margin stats: Counter({u'0': 13240})
p_1 right margin stats: Counter({u'0': 13240})
Cleaning up manifest...
Trimming unused files from manifest...
Creating EPUB Output...
Looking for large trees in index.html...
Found large tree #0
Splitting...
Split point: {http://www.w3.org/1999/xhtml}hr /*/*[2]/*[13239]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[6621]
Split tree still too large: 1674 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[3311]
Split tree still too large: 784 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[1656]
Split tree still too large: 397 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[828]
Committed sub-tree #1 (211 KB)
Committed sub-tree #2 (186 KB)
Split tree still too large: 386 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[829]
Committed sub-tree #3 (186 KB)
Committed sub-tree #4 (200 KB)
Split tree still too large: 890 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[1656]
Split tree still too large: 461 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[829]
Committed sub-tree #5 (232 KB)
Committed sub-tree #6 (229 KB)
Split tree still too large: 429 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[829]
Committed sub-tree #7 (209 KB)
Committed sub-tree #8 (220 KB)
Split tree still too large: 4890 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[3311]
Split tree still too large: 736 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[1656]
Split tree still too large: 393 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[829]
Committed sub-tree #9 (223 KB)
Committed sub-tree #10 (170 KB)
Split tree still too large: 343 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[829]
Committed sub-tree #11 (164 KB)
Committed sub-tree #12 (179 KB)
Split tree still too large: 4154 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[1656]
Split tree still too large: 335 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[829]
Committed sub-tree #13 (161 KB)
Committed sub-tree #14 (174 KB)
Split tree still too large: 3819 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[829]
Committed sub-tree #15 (145 KB)
Split tree still too large: 3673 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[415]
Committed sub-tree #16 (87 KB)
Split tree still too large: 3586 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[208]
Committed sub-tree #17 (43 KB)
Split tree still too large: 3542 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[104]
Committed sub-tree #18 (13 KB)
Split tree still too large: 3529 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[53]
Split tree still too large: 2431 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[27]
Split tree still too large: 1470 KB
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[14]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[15]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[13]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[16]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[12]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[17]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[11]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[18]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[10]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[19]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[9]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[20]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[8]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[21]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[7]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[22]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[6]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[23]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[5]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[24]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[4]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[25]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[3]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[26]
Split tree too small
Splitting...
Split point: {http://www.w3.org/1999/xhtml}p /*/*[2]/*[2]
Split tree too small
Splitting...
Python function terminated unexpectedly
Could not find reasonable point at which to split: index.html Sub-tree size: 1471 KB (Error Code: 1)
Traceback (most recent call last):
File "site.py", line 132, in main
File "site.py", line 109, in run_entry_point
File "site-packages\calibre\utils\ipc\worker.py", line 181, in main
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 31, in gui_convert_override
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 25, in gui_convert
File "site-packages\calibre\ebooks\conversion\plumber.py", line 1057, in run
File "site-packages\calibre\ebooks\epub\output.py", line 169, in convert
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 57, in __call__
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 67, in split_item
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 205, in __init__
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 425, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 409, in split_to_size
File "site-packages\calibre\ebooks\oeb\transforms\split.py", line 402, in split_to_size
calibre.ebooks.oeb.transforms.split.SplitError: Could not find reasonable point at which to split: index.html Sub-tree size: 1471 KB


Moderator Notice
wrapped long error in spoiler

Last edited by theducks; 08-06-2011 at 08:35 AM.
dm15441 is offline   Reply With Quote
Old 08-06-2011, 10:53 AM   #2
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
Quote:
Originally Posted by dm15441 View Post
Hi,

I have a text file that I successfully converted to mobi in Calibre, but when I open it, I get a Calibre parallel exe error. So, I then tried to convert it into an epub file, but I got the below error. It does sound like Sigil may help me, but it seems a little beyond my technical skills. Is there an easier option? Also, I did try just copying and pasting only the beginning of the txt file into an RTF format, but even though that worked, I only see scrambled characters when I open it (to clarify, the text file is scrambled itself as a result of it being a Chinese file, but other Chinese files that I have that are the same way, but smaller in size, became readable when I converted them to mobi in Calibre):
Next time create a new thread describing your problem instead of adding an unrelated issue to a months old thread...

With text files you sometimes need to define the line paragraph structure on the text input panel of the conversion options. By default Calibre tries to auto-detect where text file paragraphs begin/end, but in your case it failed. Choose one of the other options - single line paragraphs is the safest, but without seeing your doc it's difficult to recommend a specific format, just read the descriptions and see how they line up with your doc.

Simplest way to unscramble the text encoding is to open it in a text editor so it's viewable/readable on your desktop, and then re-save it using the UTF-8 encoding, which is what Calibre expects by default.
ldolse is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Help: Best way to read Chinese file on Kindle pengrus Amazon Kindle 15 09-29-2011 10:07 AM
Touch issues in third-party IME input(Simple Chinese) tigerszheng Barnes & Noble NOOK 3 07-07-2011 10:32 AM
Anyway to make 700 read Chinese text? tke7 Sony Reader 40 06-19-2010 02:28 PM
Is there anyway can make FBReader can read ANSI Chinese text? alxwang PocketBook 1 03-24-2010 11:12 PM
Need help: 505 ignore some Chinese file name siulayhumga Sony Reader 1 06-21-2009 03:46 AM


All times are GMT -4. The time now is 04:48 PM.


MobileRead.com is a privately owned, operated and funded community.