Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 09-27-2010, 03:16 AM   #1
OMEN
Junior Member
OMEN began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Sep 2010
Device: Kindle 3
Conversion error [PDF >> MOBI]

Firstly,
THANK YOU, this an amazing piece of software.


Secondly,
While converting PDFs to MOBI, 90% of the time I get this error.

Code:
ERROR: Conversion Error: <b>Failed</b>: Convert book 1 of 1 (Phishing Exposed (Syngress, 2005))

Convert book 1 of 1 (Phishing Exposed (Syngress, 2005))
Resolved conversion options
calibre version: 0.7.20
{'asciiize': False,
 'author_sort': None,
 'authors': None,
 'base_font_size': 0.0,
 'book_producer': None,
 'change_justification': u'original',
 'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part|prologue|epilogue\\s+', 'i')) or @class = 'chapter']",
 'chapter_mark': u'pagebreak',
 'comments': None,
 'cover': None,
 'debug_pipeline': None,
 'disable_font_rescaling': False,
 'dont_compress': False,
 'extra_css': None,
 'font_size_mapping': None,
 'footer_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)',
 'header_regex': u'(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)',
 'html_unwrap_factor': 0.40000000000000002,
 'input_encoding': None,
 'input_profile': <calibre.customize.profiles.InputProfile object at 0x046DC090>,
 'insert_blank_line': False,
 'insert_metadata': False,
 'isbn': None,
 'keep_ligatures': False,
 'language': None,
 'level1_toc': None,
 'level2_toc': None,
 'level3_toc': None,
 'line_height': 0.0,
 'linearize_tables': True,
 'margin_bottom': 5.0,
 'margin_left': 5.0,
 'margin_right': 5.0,
 'margin_top': 5.0,
 'max_toc_links': 50,
 'new_pdf_engine': False,
 'no_chapters_in_toc': False,
 'no_images': False,
 'no_inline_navbars': True,
 'no_inline_toc': False,
 'output_profile': <calibre.customize.profiles.KindleOutput object at 0x046DC370>,
 'page_breaks_before': u"//*[name()='h1' or name()='h2']",
 'personal_doc': u'[PDOC]',
 'prefer_author_sort': False,
 'prefer_metadata_cover': False,
 'preprocess_html': False,
 'pretty_print': False,
 'pubdate': None,
 'publisher': None,
 'rating': None,
 'read_metadata_from_opf': 'd:\\temp\\calibre_0.7.20_tmp_hqrued\\calibre_0.7.20_k3np7v.opf',
 'remove_first_image': False,
 'remove_footer': True,
 'remove_header': True,
 'remove_paragraph_spacing': True,
 'remove_paragraph_spacing_indent_size': 1.5,
 'rescale_images': False,
 'series': None,
 'series_index': None,
 'smarten_punctuation': False,
 'tags': None,
 'timestamp': None,
 'title': None,
 'title_sort': None,
 'toc_filter': None,
 'toc_threshold': 6,
 'toc_title': None,
 'unwrap_factor': 0.45000000000000001,
 'use_auto_toc': True,
 'verbose': 2}
InputFormatPlugin: PDF Input running
on E:\users\OMEN\Documents\BOOKS\calibre\Unknown\Phishing Exposed (Syngress, 2005) (43)\Phishing Exposed (Syngress, 2005) - Unknown.pdf
Converting file to html...
pdftohtml log:

Retrieving document metadata...
Generating manifest...
Rendering manifest...
Parsing all content...
Parsing index.html ...
Failed to parse content in index.html
Traceback (most recent call last):
  File "site-packages\calibre\ebooks\oeb\reader.py", line 159, in _manifest_prune_invalid
  File "site-packages\calibre\ebooks\oeb\base.py", line 1060, in fget
  File "site-packages\calibre\ebooks\oeb\base.py", line 789, in _parse_xhtml
  File "site-packages\calibre\ebooks\conversion\preprocess.py", line 431, in __call__
  File "site-packages\calibre\ebooks\conversion\preprocess.py", line 152, in __call__
  File "site-packages\calibre\ebooks\conversion\preprocess.py", line 133, in dehyphenate
  File "re.py", line 190, in compile
  File "re.py", line 245, in _compile
error: unbalanced parenthesis

Spine item 'id1' not found
Python function terminated unexpectedly
  Spine is empty (Error Code: 1)
Traceback (most recent call last):
  File "site.py", line 103, in main
  File "site.py", line 85, in run_entry_point
  File "site-packages\calibre\utils\ipc\worker.py", line 99, in main
  File "site-packages\calibre\gui2\convert\gui_conversion.py", line 24, in gui_convert
  File "site-packages\calibre\ebooks\conversion\plumber.py", line 841, in run
  File "site-packages\calibre\ebooks\conversion\plumber.py", line 968, in create_oebbook
  File "site-packages\calibre\ebooks\oeb\reader.py", line 72, in __call__
  File "site-packages\calibre\ebooks\oeb\reader.py", line 594, in _all_from_opf
  File "site-packages\calibre\ebooks\oeb\reader.py", line 289, in _spine_from_opf
calibre.ebooks.oeb.base.OEBError: Spine is empty

AFAIK >> this bug >> http://bugs.calibre-ebook.com/ticket/6975

Windows 7 Pro,
Calibre 0.7.20

Last edited by OMEN; 09-27-2010 at 03:18 AM.
OMEN is offline   Reply With Quote
Old 09-27-2010, 03:32 AM   #2
OMEN
Junior Member
OMEN began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Sep 2010
Device: Kindle 3
ok, i have downgraded to 0.7.19 and conversion works, the formatting is slightly messy but it works.
OMEN is offline   Reply With Quote
Advert
Old 09-27-2010, 05:06 AM   #3
ldolse
Wizard
ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.ldolse is an accomplished Snipe hunter.
 
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
0.7.19 and 0.7.20 will be about the same level of messy. The only difference with pdf conversion in 0.7.20 is that it takes each hyphenated word and does a simple analysis. If the word exists in the document without a hyphen then the hyphen is removed. If it can't find a match with a hyphen then the hyphen is retained.

0.7.19 always retains the hyphens, but otherwise formatting should be identical.

The bug is that currently the search for versions of the word without hyphens can match special characters which cause that failure.
ldolse is offline   Reply With Quote
Old 09-27-2010, 12:02 PM   #4
OMEN
Junior Member
OMEN began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Sep 2010
Device: Kindle 3
hmm, so is this a regex syntax problem ?
OMEN is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
PDF to Mobi Conversion rayh Calibre 2 09-24-2010 02:33 AM
pdf conversion error Beau Calibre 5 08-29-2010 01:26 PM
How can I cure this error in mobi to epub conversion? Patricia1066 Calibre 1 03-17-2010 06:56 AM
Conversion Error lrf -> mobi/epub (Calibre 0.6.33) Joerg_Mosthaf Calibre 2 01-13-2010 07:40 AM
Conversion from Mobi to LRF error jessie102 Calibre 2 08-16-2008 12:00 PM


All times are GMT -4. The time now is 07:21 AM.


MobileRead.com is a privately owned, operated and funded community.