This is a continuation of an existing bug:
https://bugs.launchpad.net/calibre/+bug/1285485
I've already used tools to remove the DRM from the .AZW3 file and after doing so, it's viewable through calibre. I've tried using calibre to convert the .AZW3 file to TXT... (see results below in spoiler). I've tried other online conversion tools, they all error out saying they can't convert the file, but they don't say why. I'm trying to find a way to convert the file to any format that can be impored into a database.
1. Are there some .AZW3 books that cannot be converted to other formats?
2. Is there a way to edit a .AZW3 file?
3. If so, is there something I should look for (to change/remove) to help make it convertible?
4. If someone else is able to convert the file successfully can you please let me know? Any format that either has XML or just regular text would work.
Thanks!
Spoiler :
calibre 1.25
mac 10.9.1 2.93 GHz Intel Core i7 with 20 GB RAM
I am attempting to convert 26.8 meg .azw file to txt, it's been running for over 48 hours now. It's said 47% complete since 1 minute after it started running.
Convert book 1 of 1 (World Almanac and Book of Facts 2014)
Resolved conversion options
calibre version: 1.25.0
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_justification': u'original',
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., '\\s*((chapter|book|section|part)\\s+)|((prolog|pr ologue|epilogue)(\\s+|$))', 'i')) or @class = 'chapter']",
'chapter_mark': u'pagebreak',
'comments': None,
'cover': u'/var/folders/x7/fsp4tg0x5n1_r3m7svfljj2c0000gn/T/calibre_1.25.0_tmp_CZBMVW/nHYv70.jpeg',
'debug_pipeline': None,
'dehyphenate': True,
'delete_blank_paragraphs': True,
'disable_font_rescaling': False,
'duplicate_links_in_toc': False,
'embed_all_fonts': False,
'embed_font_family': None,
'enable_heuristics': False,
'expand_css': False,
'extra_css': None,
'filter_css': u'',
'fix_indents': True,
'font_size_mapping': None,
'force_max_line_length': False,
'format_scene_breaks': True,
'html_unwrap_factor': 0.4,
'inline_toc': False,
'input_encoding': None,
'input_profile': <calibre.customize.profiles.InputProfile object at 0x109fa7750>,
'insert_blank_line': False,
'insert_blank_line_size': 0.5,
'insert_metadata': False,
'isbn': None,
'italicize_common_cases': True,
'keep_color': False,
'keep_image_references': False,
'keep_ligatures': False,
'keep_links': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_tables': False,
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_chapter_headings': True,
'max_line_length': 0,
'max_toc_links': 50,
'minimum_line_height': 120.0,
'newline': u'system',
'no_chapters_in_toc': False,
'no_inline_navbars': False,
'output_profile': <calibre.customize.profiles.KindleDXOutput object at 0x109fa7e10>,
'page_breaks_before': u"//*[name()='h1' or name()='h2']",
'prefer_metadata_cover': False,
'pretty_print': False,
'pubdate': None,
'publisher': None,
'rating': None,
'read_metadata_from_opf': u'/var/folders/x7/fsp4tg0x5n1_r3m7svfljj2c0000gn/T/calibre_1.25.0_tmp_CZBMVW/qDk0Mr.opf',
'remove_fake_margins': True,
'remove_first_image': False,
'remove_paragraph_spacing': False,
'remove_paragraph_spacing_indent_size': 1.5,
'renumber_headings': True,
'replace_scene_breaks': u'',
'search_replace': '[]',
'series': None,
'series_index': None,
'smarten_punctuation': False,
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'start_reading_at': None,
'subset_embedded_fonts': False,
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'txt_output_encoding': u'utf-8',
'txt_output_formatting': u'plain',
'unsmarten_punctuation': False,
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: MOBI Input running
on /var/folders/x7/fsp4tg0x5n1_r3m7svfljj2c0000gn/T/calibre_1.25.0_tmp_CZBMVW/zJ7xqz.azw3
Found KF8 MOBI of type 'standalone'
Extracting text...
Parsing all content...
Parsing text/part0153.html ...
Parsing text/part0115.html ...
Parsing text/part0089.html ...
Parsing text/part0126.html ...
Parsing text/part0064.html ...
Parsing text/part0116.html ...
Parsing text/part0149.html ...
Parsing text/part0090.html ...
Parsing text/part0142.html ...
Parsing text/part0127.html ...
Parsing text/part0065.html ...
Parsing text/part0117.html ...
Parsing text/part0155.html ...
Parsing text/part0091.html ...
Parsing text/part0062.html ...
Parsing text/part0128.html ...
Parsing text/part0066.html ...
Parsing text/part0118.html ...
Parsing text/part0092.html ...
Parsing text/part0129.html ...
Parsing text/part0067.html ...
Parsing text/part0119.html ...
Parsing text/part0093.html ...
Parsing text/part0130.html ...
Parsing text/part0068.html ...
Parsing text/part0150.html ...
Parsing text/part0120.html ...
Parsing text/part0094.html ...
Parsing text/part0095.html ...
Parsing text/part0096.html ...
Parsing text/part0131.html ...
Parsing text/part0097.html ...
Parsing text/part0098.html ...
Parsing text/part0069.html ...
Parsing text/part0099.html ...
Parsing text/part0121.html ...
Parsing text/part0100.html ...
Parsing text/part0101.html ...
Parsing text/part0102.html ...
Parsing text/part0132.html ...
Parsing text/part0103.html ...
Parsing text/part0104.html ...
Parsing text/part0070.html ...
Parsing text/part0105.html ...
Parsing text/part0122.html ...
Parsing text/part0106.html ...
Parsing text/part0107.html ...
Parsing text/part0108.html ...
Parsing text/part0133.html ...
Parsing text/part0109.html ...
Parsing text/part0071.html ...
Parsing text/part0123.html ...
Parsing text/part0151.html ...
Parsing text/part0134.html ...
Parsing text/part0072.html ...
Parsing text/part0124.html ...
Parsing text/part0050.html ...
Parsing text/part0135.html ...
Parsing text/part0148.html ...
Parsing text/part0073.html ...
Parsing text/part0125.html ...
Parsing text/part0143.html ...
Parsing text/part0136.html ...
Parsing styles/0001.css ...
Parsing text/part0000.html ...
Parsing text/part0074.html ...
Parsing text/part0001.html ...
Parsing text/part0002.html ...
Parsing text/part0003.html ...
Parsing text/part0144.html ...
Parsing text/part0004.html ...
Parsing text/part0079.html ...
Parsing text/part0005.html ...
Parsing text/part0137.html ...
Parsing text/part0006.html ...
Parsing text/part0075.html ...
Parsing text/part0007.html ...
Parsing text/part0008.html ...
Parsing text/part0009.html ...
Parsing text/part0145.html ...
Parsing text/part0010.html ...
Parsing text/part0053.html ...
Parsing text/part0080.html ...
Parsing text/part0011.html ...
Parsing text/part0138.html ...
Parsing text/part0012.html ...
Parsing text/part0076.html ...
Parsing text/part0013.html ...
Parsing text/part0014.html ...
Parsing text/part0015.html ...
Parsing text/part0146.html ...
Parsing text/part0158.html ...
Parsing text/part0016.html ...
Parsing text/part0081.html ...
Parsing text/part0017.html ...
Parsing text/part0139.html ...
Parsing text/part0159.html ...
Parsing text/part0018.html ...
Parsing text/part0160.html ...
Parsing text/part0077.html ...
Parsing text/part0161.html ...
Parsing text/part0019.html ...
Parsing text/part0162.html ...
Parsing text/part0163.html ...
Parsing text/part0020.html ...
Parsing text/part0164.html ...
Parsing text/part0165.html ...
Parsing text/part0021.html ...
Parsing text/part0147.html ...
Parsing text/part0022.html ...
Parsing text/part0082.html ...
Parsing text/part0023.html ...
Parsing text/part0140.html ...
Parsing text/part0024.html ...
Parsing text/part0078.html ...
Parsing text/part0156.html ...
Parsing text/part0025.html ...
Parsing text/part0026.html ...
Parsing text/part0027.html ...
Parsing text/part0063.html ...
Parsing text/part0028.html ...
Parsing text/part0083.html ...
Parsing text/part0029.html ...
Parsing text/part0141.html ...
Parsing text/part0030.html ...
Parsing text/part0031.html ...
Parsing text/part0032.html ...
Parsing text/part0152.html ...
Parsing text/part0033.html ...
Parsing text/part0110.html ...
Parsing text/part0057.html ...
Parsing text/part0034.html ...
Parsing text/part0084.html ...
Parsing text/part0035.html ...
Parsing text/part0036.html ...
Parsing text/part0037.html ...
Parsing text/part0038.html ...
Parsing text/part0039.html ...
Parsing text/part0111.html ...
Parsing text/part0058.html ...
Parsing text/part0040.html ...
Parsing text/part0085.html ...
Parsing text/part0041.html ...
Parsing text/part0042.html ...
Parsing text/part0043.html ...
Parsing text/part0044.html ...
Parsing text/part0045.html ...
Parsing text/part0112.html ...
Parsing text/part0059.html ...
Parsing text/part0046.html ...
Parsing text/part0086.html ...
Parsing text/part0047.html ...
Parsing text/part0048.html ...
Parsing text/part0049.html ...
Parsing text/part0051.html ...
Parsing text/part0113.html ...
Parsing text/part0060.html ...
Parsing text/part0052.html ...
Parsing text/part0087.html ...
Parsing text/part0154.html ...
Parsing text/part0054.html ...
Parsing text/part0157.html ...
Parsing text/part0055.html ...
Parsing text/part0056.html ...
Parsing text/part0114.html ...
Parsing text/part0061.html ...
Parsing text/part0088.html ...
Referenced file u'styles/XXXXXXXXXXXXXXXX' not found
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Detected chapter: THE WORLD ALMANAC AND BOOK OF FACTS 2014