12-10-2014, 12:32 PM | #1 |
Zealot
Posts: 118
Karma: 202232
Join Date: Jun 2010
Location: Texas
Device: Kindle Paperwhite Gen2
|
Reddit RSS feed not pulling author info
Hello everyone.
The following is something I've been trying to figure out for several months. My knowledge of RSS is limited and my knowledge of Python is almost non-existent. I like reading certain subreddits on reddit.com. I would like to be able to read them on Kindle. The problem is that when I do this, Calibre does not show me who the author of the story is. This becomes important because many of the stories are continuations or have over-arching themes, so its important to know who is writing it. This is the rss feed for Tales from Tech Support: http://www.reddit.com/r/talesfromtechsupport/.rss . If you click that link, you will see something similar to this: Notice that "submitted by" includes the author. However, when I "view source" I don't see anything indicating that as an "author" tag. Still, I would think that calibre should include all data that is present in the RSS feed. Instead it strips the author from the ouput: Here is the recipe text copied from calibre: Code:
class AdvancedUserRecipe1418232189(BasicNewsRecipe): title = u'Unknown News Source' oldest_article = 7 max_articles_per_feed = 100 auto_cleanup = True feeds = [] Code:
Fetch news from TFTS Resolved conversion options calibre version: 1.33.0 {'asciiize': False, 'author_sort': None, 'authors': None, 'base_font_size': 0, 'book_producer': None, 'change_justification': 'original', 'chapter': None, 'chapter_mark': 'pagebreak', 'comments': None, 'cover': None, 'debug_pipeline': None, 'dehyphenate': True, 'delete_blank_paragraphs': True, 'disable_font_rescaling': False, 'dont_compress': False, 'dont_download_recipe': False, 'duplicate_links_in_toc': False, 'embed_all_fonts': False, 'embed_font_family': None, 'enable_heuristics': False, 'expand_css': False, 'extra_css': None, 'extract_to': None, 'filter_css': None, 'fix_indents': True, 'font_size_mapping': None, 'format_scene_breaks': True, 'html_unwrap_factor': 0.4, 'input_encoding': None, 'input_profile': <calibre.customize.profiles.InputProfile object at 0x1689650>, 'insert_blank_line': False, 'insert_blank_line_size': 0.5, 'insert_metadata': False, 'isbn': None, 'italicize_common_cases': True, 'keep_ligatures': False, 'language': None, 'level1_toc': None, 'level2_toc': None, 'level3_toc': None, 'line_height': 0, 'linearize_tables': False, 'lrf': False, 'margin_bottom': 5.0, 'margin_left': 5.0, 'margin_right': 5.0, 'margin_top': 5.0, 'markup_chapter_headings': True, 'max_toc_links': 50, 'minimum_line_height': 120.0, 'mobi_file_type': 'old', 'mobi_ignore_margins': False, 'mobi_keep_original_images': False, 'mobi_toc_at_start': False, 'no_chapters_in_toc': False, 'no_inline_navbars': True, 'no_inline_toc': False, 'output_profile': <calibre.customize.profiles.KindlePaperWhiteOutput object at 0x1689d90>, 'page_breaks_before': None, 'personal_doc': '[PDOC]', 'prefer_author_sort': False, 'prefer_metadata_cover': False, 'pretty_print': False, 'pubdate': None, 'publisher': None, 'rating': None, 'read_metadata_from_opf': None, 'remove_fake_margins': True, 'remove_first_image': False, 'remove_paragraph_spacing': False, 'remove_paragraph_spacing_indent_size': 1.5, 'renumber_headings': True, 'replace_scene_breaks': '', 'search_replace': None, 'series': None, 'series_index': None, 'share_not_sync': False, 'smarten_punctuation': False, 'sr1_replace': '', 'sr1_search': '', 'sr2_replace': '', 'sr2_search': '', 'sr3_replace': '', 'sr3_search': '', 'start_reading_at': None, 'subset_embedded_fonts': False, 'tags': None, 'test': False, 'timestamp': None, 'title': None, 'title_sort': None, 'toc_filter': None, 'toc_threshold': 6, 'toc_title': None, 'unsmarten_punctuation': False, 'unwrap_lines': True, 'use_auto_toc': False, 'verbose': 2} InputFormatPlugin: Recipe Input running Using custom recipe Synthesizing mastheadImage DownloadingDownloading FetchingDownloading Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/qa24ai_feeds2disk.htmlfile:///tmp/calibre_1.33.0_tmp_RlxFFq/AwiQLK_feeds2disk.html Fetching DownloadingDownloading file:///tmp/calibre_1.33.0_tmp_RlxFFq/BpWnCz_feeds2disk.html FetchingFetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/SG6sj6_feeds2disk.htmlfile:///tmp/calibre_1.33.0_tmp_RlxFFq/dqYnLk_feeds2disk.html WARNING: Encoding detection confidence 82% Candid: 21.410 .md - div link density 0.000 -> 21.410 Candid: 13.205 div - body#readabilityBody link density 0.050 -> 12.550 Top 5 : 21.410 .md - div Top 5 : 12.550 div - body#readabilityBody Candid: 45.510 .md - div link density 0.000 -> 45.510 Candid: 25.255 div - body#readabilityBody link density 0.038 -> 24.287 Top 5 : 45.510 .md - div Top 5 : 24.287 div - body#readabilityBody Candid: 98.005 .md - div link density 0.000 -> 98.005 Processing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/dqYnLk_feeds2disk.html Candid: 46.990 div - body#readabilityBody link density 0.008 -> 46.607 Candid: 13.000 blockquote - .md link density 0.000 -> 13.000file:///tmp/calibre_1.33.0_tmp_RlxFFq/dqYnLk_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_4/dqYnLk_feeds2disk.xhtml Candid: 5.370 blockquote - .md link density 0.000 -> 5.370 Candid: 8.680 blockquote - .md link density 0.000 -> 8.680Downloading Top 5 : 98.005 .md - div FetchingTop 5 : 46.607 div - body#readabilityBody Top 5 : 13.000 blockquote - .mdfile:///tmp/calibre_1.33.0_tmp_RlxFFq/eI7GXz_feeds2disk.html Top 5 : 8.680 blockquote - .md Top 5 : 5.370 blockquote - .mdProcessing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/AwiQLK_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/AwiQLK_feeds2disk.html saved toDownloaded article: /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_1/AwiQLK_feeds2disk.xhtml That's it. Christmas is canceled! from http://www.reddit.com/r/talesfromtechsupport/comments/2orn2t/thats_it_christmas_is_canceled/ Downloaded article:Processing images... "I hope you lose your job and that your entire company goes under" Recursion limit reached. Skipping links infrom Downloadinghttp://www.reddit.com/r/talesfromtechsupport/comments/2ou9ax/i_hope_you_lose_your_job_and_that_your_entire/file:///tmp/calibre_1.33.0_tmp_RlxFFq/SG6sj6_feeds2disk.html Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/ECBqLN_feeds2disk.html Candid: 36.150 .md - div link density 0.000 -> 36.150 file:///tmp/calibre_1.33.0_tmp_RlxFFq/SG6sj6_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_3/SG6sj6_feeds2disk.xhtml Candid: 20.575 div - body#readabilityBody link density 0.024 -> 20.084 DownloadingTop 5 : 36.150 .md - div Top 5 : 20.084 div - body#readabilityBody Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/7W3NCX_feeds2disk.html Candid: 31.910 .md - div link density 0.449 -> 17.590 Processing images... Candid: 38.170 .md - div link density 0.000 -> 38.170 Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/eI7GXz_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/eI7GXz_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_5/eI7GXz_feeds2disk.xhtml Candid: 21.585 div - body#readabilityBody link density 0.021 -> 21.142 Top 5 : 38.170 .md - div Top 5 : 21.142 div - body#readabilityBody Downloading Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/RUaZn3_feeds2disk.html Processing images... Candid: 45.690 .md - div link density 0.000 -> 45.690Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/ECBqLN_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/ECBqLN_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_6/ECBqLN_feeds2disk.xhtml Candid: 25.345 div - body#readabilityBody link density 0.015 -> 24.953 Top 5 : 45.690 .md - div Top 5 : 24.953 div - body#readabilityBody Downloading Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/CALAQg_feeds2disk.html WARNING: Encoding detection confidence 99% Candid: 34.085 .md - div link density 0.000 -> 34.085 Candid: 156.025 .md - div link density 0.002 -> 155.711 Candid: 16.520 div - body#readabilityBody link density 0.023 -> 16.138 Candid: 15.090 blockquote - .md link density 0.000 -> 15.090 Top 5 : 34.085 .md - div Top 5 : 16.138 div - body#readabilityBody Processing images...Top 5 : 15.090 blockquote - .md Candid: 47.745 div - body#readabilityBody link density 0.006 -> 47.482 Recursion limit reached. Skipping links inCandid: 18.455 div - body#readabilityBody link density 0.452 -> 10.106 Top 5 : 17.590 .md - divfile:///tmp/calibre_1.33.0_tmp_RlxFFq/7W3NCX_feeds2disk.htmlCandid: 24.540 blockquote - .md link density 0.000 -> 24.540 Top 5 : 10.106 div - body#readabilityBody Downloaded article:Candid: 46.250 blockquote - .md link density 0.000 -> 46.250 "Hey, did you throw away our records?" from http://www.reddit.com/r/talesfromtechsupport/comments/2ounzq/hey_did_you_throw_away_our_records/ file:///tmp/calibre_1.33.0_tmp_RlxFFq/7W3NCX_feeds2disk.htmlDownloaded article: saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_7/7W3NCX_feeds2disk.xhtmlOh, I'm sorry - was that important? *crunch crunch* fromCandid: 6.630 blockquote - .md link density 0.000 -> 6.630 http://www.reddit.com/r/talesfromtechsupport/comments/2ottif/oh_im_sorry_was_that_important_crunch_crunch/ Downloaded article: The end user and their phone from http://www.reddit.com/r/talesfromtechsupport/comments/2oupr0/the_end_user_and_their_phone/ Downloaded article: 12 year old PHP scripts and SQL server 2008R2 Candid: 5.440 blockquote - .md link density 0.000 -> 5.440 from http://www.reddit.com/r/talesfromtechsupport/comments/2ovblb/12_year_old_php_scripts_and_sql_server_2008r2/Downloading Candid: 10.840 blockquote - .md link density 0.000 -> 10.840Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/cvJFgv_feeds2disk.html Candid: 11.590 blockquote - .md link density 0.000 -> 11.590 Candid: 46.780 blockquote - .md link density 0.000 -> 46.780 Top 5 : 155.711 .md - div Top 5 : 47.482 div - body#readabilityBody Top 5 : 46.780 blockquote - .md Top 5 : 46.250 blockquote - .md Top 5 : 24.540 blockquote - .md Processing images... Candid: 25.890 .md - div link density 0.000 -> 25.890 Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/RUaZn3_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/RUaZn3_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_8/RUaZn3_feeds2disk.xhtml Candid: 15.445 div - body#readabilityBody link density 0.032 -> 14.946 Processing images... Top 5 : 25.890 .md - div DownloadingTop 5 : 14.946 div - body#readabilityBody Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/BpWnCz_feeds2disk.html Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/8NuxtU_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/BpWnCz_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_2/BpWnCz_feeds2disk.xhtml Processing images... Downloading Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/cvJFgv_feeds2disk.htmlFetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/vxYygt_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/cvJFgv_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_10/cvJFgv_feeds2disk.xhtml Candid: 106.475 .md - div link density 0.000 -> 106.475 Cleaned 0.000 .md - div with weight 0 cause it has too many links 0.449 for its weight 0. DownloadingCandid: 33.665 div - body#readabilityBody link density 0.005 -> 33.489 Candid: 22.170 .md - div link density 0.000 -> 22.170 Cleaned 0.000 div - body with weight 0 cause it has too short content length 0 without a single image. Fetching Candid: 32.420 blockquote - .md link density 0.000 -> 32.420file:///tmp/calibre_1.33.0_tmp_RlxFFq/R8MqWc_feeds2disk.html Candid: 9.960 blockquote - .md link density 0.000 -> 9.960 Candid: 18.460 blockquote - .md link density 0.000 -> 18.460 Candid: 10.825 div - body#readabilityBody link density 0.041 -> 10.380 Downloaded article: Something must be wrong with your system!Candid: 39.450 blockquote - .md link density 0.000 -> 39.450 Candid: 5.400 blockquote - .md link density 0.000 -> 5.400 Top 5 : 106.475 .md - div Top 5 : 39.450 blockquote - .md from http://www.reddit.com/r/talesfromtechsupport/comments/2ott55/something_must_be_wrong_with_your_system/ Candid: 11.640 blockquote - .md link density 0.000 -> 11.640 Top 5 : 33.489 div - body#readabilityBodyDownloaded article: Top 5 : 22.170 .md - div Some people are traidies for a reason. Top 5 : 32.420 blockquote - .md fromTop 5 : 11.640 blockquote - .md Top 5 : 10.380 div - body#readabilityBodyTop 5 : 18.460 blockquote - .mdhttp://www.reddit.com/r/talesfromtechsupport/comments/2ourw7/some_people_are_traidies_for_a_reason/ Downloaded article:Top 5 : 5.400 blockquote - .md You can't complain about not having something if you have it! from http://www.reddit.com/r/talesfromtechsupport/comments/2ou956/you_cant_complain_about_not_having_something_if/ Processing images... Processing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/CALAQg_feeds2disk.html Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/8NuxtU_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/CALAQg_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_9/CALAQg_feeds2disk.xhtml file:///tmp/calibre_1.33.0_tmp_RlxFFq/8NuxtU_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_11/8NuxtU_feeds2disk.xhtml Downloading Fetching Downloading file:///tmp/calibre_1.33.0_tmp_RlxFFq/p_2C_i_feeds2disk.html Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/YRg0Ad_feeds2disk.html Candid: 87.415 .md - div link density 0.001 -> 87.328 Candid: 23.265 div - body#readabilityBody link density 0.008 -> 23.082 Candid: 30.840 .md - div link density 0.000 -> 30.840 Candid: 11.040 blockquote - .md link density 0.000 -> 11.040 Candid: 10.460 blockquote - .md link density 0.000 -> 10.460 Candid: 9.110 blockquote - .md link density 0.000 -> 9.110 Candid: 17.920 div - body#readabilityBody link density 0.020 -> 17.565 Candid: 31.910 .md - div link density 0.449 -> 17.590 Top 5 : 30.840 .md - div Top 5 : 17.565 div - body#readabilityBodyCandid: 17.180 blockquote - .md link density 0.000 -> 17.180 Candid: 34.010 blockquote - .md link density 0.000 -> 34.010 Candid: 8.460 blockquote - .md link density 0.000 -> 8.460 Candid: 22.510 blockquote - .md link density 0.000 -> 22.510 Top 5 : 87.328 .md - div Top 5 : 34.010 blockquote - .md Top 5 : 23.082 div - body#readabilityBody Top 5 : 22.510 blockquote - .md Top 5 : 17.180 blockquote - .md Candid: 70.400 .md - div link density 0.000 -> 70.400 Candid: 37.700 div - body#readabilityBody link density 0.011 -> 37.291 Top 5 : 70.400 .md - div Top 5 : 37.291 div - body#readabilityBody Processing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/p_2C_i_feeds2disk.html Processing images... file:///tmp/calibre_1.33.0_tmp_RlxFFq/p_2C_i_feeds2disk.html Recursion limit reached. Skipping links inDownloaded article: Super Dooper VIP, or Screenshots Can Save Your Buttfile:///tmp/calibre_1.33.0_tmp_RlxFFq/vxYygt_feeds2disk.html from saved tohttp://www.reddit.com/r/talesfromtechsupport/comments/2osxj9/super_dooper_vip_or_screenshots_can_save_your_butt/ /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_14/p_2C_i_feeds2disk.xhtml Downloaded article: Closure Code: No fault found from http://www.reddit.com/r/talesfromtechsupport/comments/2oumkx/closure_code_no_fault_found/ file:///tmp/calibre_1.33.0_tmp_RlxFFq/vxYygt_feeds2disk.htmlProcessing images... Downloading saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_12/vxYygt_feeds2disk.xhtmlRecursion limit reached. Skipping links in Fetching Downloaded article:file:///tmp/calibre_1.33.0_tmp_RlxFFq/6Dly2y_feeds2disk.htmlfile:///tmp/calibre_1.33.0_tmp_RlxFFq/YRg0Ad_feeds2disk.html Lab Computer != your computer from http://www.reddit.com/r/talesfromtechsupport/comments/2oskhk/lab_computer_your_computer/ file:///tmp/calibre_1.33.0_tmp_RlxFFq/YRg0Ad_feeds2disk.html Downloading saved toDownloaded article: My story of woe .... Part 2 /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_15/YRg0Ad_feeds2disk.xhtml Fetching fromfile:///tmp/calibre_1.33.0_tmp_RlxFFq/_uGgKn_feeds2disk.html http://www.reddit.com/r/talesfromtechsupport/comments/2otwov/my_story_of_woe_part_2/ WARNING: Encoding detection confidence 99% Downloaded article: Downloading "That isn't possible. I cannot be making a typo" Candid: 18.455 div - body#readabilityBody link density 0.452 -> 10.106 fromTop 5 : 17.590 .md - div http://www.reddit.com/r/talesfromtechsupport/comments/2ospb3/that_isnt_possible_i_cannot_be_making_a_typo/ Top 5 : 10.106 div - body#readabilityBody Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/M_3jDj_feeds2disk.html Candid: 167.810 .md - div link density 0.004 -> 167.139 Candid: 38.710 .md - div link density 0.000 -> 38.710 Candid: 60.575 div - body#readabilityBody link density 0.009 -> 60.040 Candid: 17.500 blockquote - .md link density 0.000 -> 17.500 Candid: 30.840 blockquote - .md link density 0.000 -> 30.840 Candid: 21.855 div - body#readabilityBody link density 0.017 -> 21.478 Top 5 : 38.710 .md - divCandid: 5.320 blockquote - .md link density 0.000 -> 5.320 Top 5 : 21.478 div - body#readabilityBody Candid: 14.550 blockquote - .md link density 0.000 -> 14.550 Candid: 84.095 .md - div link density 0.000 -> 84.095 Candid: 10.210 blockquote - .md link density 0.000 -> 10.210 Candid: 48.240 .md - div link density 0.000 -> 48.240 Candid: 13.960 blockquote - .md link density 0.000 -> 13.960 Candid: 5.940 blockquote - .md link density 0.000 -> 5.940 Candid: 29.000 blockquote - .md link density 0.000 -> 29.000 Top 5 : 167.139 .md - div Candid: 38.015 div - body#readabilityBody link density 0.010 -> 37.633 Top 5 : 60.040 div - body#readabilityBody Top 5 : 30.840 blockquote - .mdCandid: 23.520 div - body#readabilityBody link density 0.014 -> 23.201 Candid: 8.530 blockquote - .md link density 0.000 -> 8.530Top 5 : 29.000 blockquote - .md Processing images... Top 5 : 17.500 blockquote - .md Candid: 9.420 blockquote - .md link density 0.000 -> 9.420 Candid: 5.950 blockquote - .md link density 0.000 -> 5.950 Candid: 5.520 blockquote - .md link density 0.000 -> 5.520 Recursion limit reached. Skipping links in Candid: 13.260 blockquote - .md link density 0.000 -> 13.260file:///tmp/calibre_1.33.0_tmp_RlxFFq/M_3jDj_feeds2disk.html Candid: 6.460 blockquote - .md link density 0.000 -> 6.460 Top 5 : 48.240 .md - div file:///tmp/calibre_1.33.0_tmp_RlxFFq/M_3jDj_feeds2disk.htmlTop 5 : 23.201 div - body#readabilityBody Candid: 10.390 blockquote - .md link density 0.000 -> 10.390 saved to Top 5 : 9.420 blockquote - .mdTop 5 : 84.095 .md - div /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_18/M_3jDj_feeds2disk.xhtml Top 5 : 6.460 blockquote - .mdTop 5 : 37.633 div - body#readabilityBody Top 5 : 5.520 blockquote - .mdTop 5 : 13.260 blockquote - .mdCleaned 0.000 .md - div with weight 0 cause it has too many links 0.449 for its weight 0. Top 5 : 10.390 blockquote - .md Top 5 : 8.530 blockquote - .md Cleaned 0.000 div - body with weight 0 cause it has too short content length 0 without a single image. Downloading Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/PfqqTB_feeds2disk.html Processing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/R8MqWc_feeds2disk.html Processing images... Downloaded article:file:///tmp/calibre_1.33.0_tmp_RlxFFq/R8MqWc_feeds2disk.html saved toThey make minimum wage for a reason.Processing images... /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_13/R8MqWc_feeds2disk.xhtml from http://www.reddit.com/r/talesfromtechsupport/comments/2osotl/they_make_minimum_wage_for_a_reason/ Recursion limit reached. Skipping links inRecursion limit reached. Skipping links in Processing images...file:///tmp/calibre_1.33.0_tmp_RlxFFq/qa24ai_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/_uGgKn_feeds2disk.html Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/6Dly2y_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/qa24ai_feeds2disk.html Candid: 32.180 .md - div link density 0.000 -> 32.180saved tofile:///tmp/calibre_1.33.0_tmp_RlxFFq/_uGgKn_feeds2disk.html Downloading /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_0/qa24ai_feeds2disk.xhtml saved to Fetching/tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_17/_uGgKn_feeds2disk.xhtml file:///tmp/calibre_1.33.0_tmp_RlxFFq/b1gRIR_feeds2disk.html Candid: 18.590 div - body#readabilityBody link density 0.029 -> 18.052 file:///tmp/calibre_1.33.0_tmp_RlxFFq/6Dly2y_feeds2disk.htmlTop 5 : 32.180 .md - div saved to Downloading /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_16/6Dly2y_feeds2disk.xhtmlTop 5 : 18.052 div - body#readabilityBody Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/is6VKn_feeds2disk.htmlDownloading Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/CHICZH_feeds2disk.html Downloading FetchingCandid: 28.110 .md - div link density 0.000 -> 28.110 file:///tmp/calibre_1.33.0_tmp_RlxFFq/bTyj9t_feeds2disk.html Downloaded article: My story of woe.... Part 3 from http://www.reddit.com/r/talesfromtechsupport/comments/2oviiz/my_story_of_woe_part_3/Processing images... Downloaded article: TFTS Top Tales - November 2014WARNING: Encoding detection confidence 88% Candid: 16.555 div - body#readabilityBody link density 0.039 -> 15.913 Top 5 : 28.110 .md - divfrom http://www.reddit.com/r/talesfromtechsupport/comments/2onlu7/tfts_top_tales_november_2014/ Top 5 : 15.913 div - body#readabilityBody Downloaded article: He obviously didn't attend at Winterhold... Recursion limit reached. Skipping links in from file:///tmp/calibre_1.33.0_tmp_RlxFFq/PfqqTB_feeds2disk.html http://www.reddit.com/r/talesfromtechsupport/comments/2oslif/he_obviously_didnt_attend_at_winterhold/ Downloaded article:Candid: 14.000 .md - div link density 0.000 -> 14.000 I'm using Chrome??? from file:///tmp/calibre_1.33.0_tmp_RlxFFq/PfqqTB_feeds2disk.html http://www.reddit.com/r/talesfromtechsupport/comments/2osb7c/im_using_chrome/saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_19/PfqqTB_feeds2disk.xhtml Candid: 9.500 div - body#readabilityBody link density 0.066 -> 8.875 Top 5 : 14.000 .md - div Top 5 : 8.875 div - body#readabilityBody Downloading Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/Bc5xNH_feeds2disk.htmlProcessing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/b1gRIR_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/b1gRIR_feeds2disk.html WARNING: Encoding detection confidence 84%saved to Downloaded article: /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_20/b1gRIR_feeds2disk.xhtmlDelete EVERYTHING! from http://www.reddit.com/r/talesfromtechsupport/comments/2ovkc9/delete_everything/ Downloaded article: It helps to read AND comprehend from http://www.reddit.com/r/talesfromtechsupport/comments/2osqtu/it_helps_to_read_and_comprehend/ Downloading Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/L4M4qG_feeds2disk.html Processing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/bTyj9t_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/bTyj9t_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_23/bTyj9t_feeds2disk.xhtml Candid: 65.260 .md - div link density 0.000 -> 65.260 Candid: 18.555 div - body#readabilityBody link density 0.008 -> 18.401 Candid: 10.920 blockquote - .md link density 0.000 -> 10.920 Candid: 15.100 blockquote - .md link density 0.000 -> 15.100 Candid: 49.280 blockquote - .md link density 0.000 -> 49.280 Top 5 : 65.260 .md - div Top 5 : 49.280 blockquote - .md Top 5 : 18.401 div - body#readabilityBody Top 5 : 15.100 blockquote - .md Top 5 : 10.920 blockquote - .md Candid: 63.730 .md - div link density 0.000 -> 63.730 Candid: 26.455 div - body#readabilityBody link density 0.012 -> 26.141 Processing images...Candid: 13.180 blockquote - .md link density 0.000 -> 13.180 Candid: 24.460 blockquote - .md link density 0.000 -> 24.460 Recursion limit reached. Skipping links inTop 5 : 63.730 .md - div file:///tmp/calibre_1.33.0_tmp_RlxFFq/CHICZH_feeds2disk.htmlTop 5 : 26.141 div - body#readabilityBody Top 5 : 24.460 blockquote - .md Top 5 : 13.180 blockquote - .md file:///tmp/calibre_1.33.0_tmp_RlxFFq/CHICZH_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_22/CHICZH_feeds2disk.xhtml Candid: 117.845 .md - div link density 0.000 -> 117.845 Candid: 170.135 .md - div link density 0.000 -> 170.135 Candid: 53.800 div - body#readabilityBody link density 0.006 -> 53.504 Candid: 7.690 blockquote - .md link density 0.000 -> 7.690 Candid: 70.890 div - body#readabilityBody link density 0.004 -> 70.584 Candid: 26.470 blockquote - .md link density 0.000 -> 26.470 Processing images... Candid: 5.470 blockquote - .md link density 0.000 -> 5.470 Candid: 5.330 blockquote - .md link density 0.000 -> 5.330Candid: 5.920 blockquote - .md link density 0.000 -> 5.920 Top 5 : 117.845 .md - div Candid: 5.970 blockquote - .md link density 0.000 -> 5.970Top 5 : 53.504 div - body#readabilityBody Top 5 : 26.470 blockquote - .mdCandid: 9.440 blockquote - .md link density 0.000 -> 9.440 Recursion limit reached. Skipping links inCandid: 13.690 blockquote - .md link density 0.000 -> 13.690 file:///tmp/calibre_1.33.0_tmp_RlxFFq/L4M4qG_feeds2disk.html Top 5 : 7.690 blockquote - .mdCandid: 6.910 blockquote - .md link density 0.000 -> 6.910 Top 5 : 5.330 blockquote - .md Candid: 6.060 blockquote - .md link density 0.000 -> 6.060 file:///tmp/calibre_1.33.0_tmp_RlxFFq/L4M4qG_feeds2disk.htmlCandid: 5.410 blockquote - .md link density 0.000 -> 5.410 saved toCandid: 8.900 blockquote - .md link density 0.000 -> 8.900 /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_25/L4M4qG_feeds2disk.xhtml Candid: 9.130 blockquote - .md link density 0.000 -> 9.130 Candid: 20.210 blockquote - .md link density 0.000 -> 20.210 Candid: 5.600 blockquote - .md link density 0.000 -> 5.600 Top 5 : 170.135 .md - div Top 5 : 70.584 div - body#readabilityBody Top 5 : 20.210 blockquote - .md Top 5 : 13.690 blockquote - .md Top 5 : 9.440 blockquote - .md Processing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/Bc5xNH_feeds2disk.html file:///tmp/calibre_1.33.0_tmp_RlxFFq/Bc5xNH_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_24/Bc5xNH_feeds2disk.xhtml Processing images... Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/is6VKn_feeds2disk.html Downloaded article: Today, someone found out a battery is needed to power a laptop. from http://www.reddit.com/r/talesfromtechsupport/comments/2oszes/today_someone_found_out_a_battery_is_needed_to/ file:///tmp/calibre_1.33.0_tmp_RlxFFq/is6VKn_feeds2disk.htmlDownloaded article: saved toWebsense doesn't block adult sites, but it blocks Facebook messages! from/tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_21/is6VKn_feeds2disk.xhtml http://www.reddit.com/r/talesfromtechsupport/comments/2orpym/websense_doesnt_block_adult_sites_but_it_blocks/ Downloaded article: A lesson in (natural) Disaster Recovery, the tale of a terrified intern. Part 1 from http://www.reddit.com/r/talesfromtechsupport/comments/2os84v/a_lesson_in_natural_disaster_recovery_the_tale_of/ Downloaded article: Hard of Hearing from http://www.reddit.com/r/talesfromtechsupport/comments/2ospp3/hard_of_hearing/ Downloaded article: IT destroys payment system to fix a printer (and failed) from http://www.reddit.com/r/talesfromtechsupport/comments/2oqxph/it_destroys_payment_system_to_fix_a_printer_and/ Parsing all content... Parsing feed_0/article_16/index.html ... Forcing feed_0/article_16/index.html into XHTML namespace Parsing feed_0/article_11/index.html ... Forcing feed_0/article_11/index.html into XHTML namespace Parsing feed_0/article_23/index.html ... Forcing feed_0/article_23/index.html into XHTML namespace Parsing feed_0/article_5/index.html ... Forcing feed_0/article_5/index.html into XHTML namespace Parsing feed_0/article_15/index.html ... Forcing feed_0/article_15/index.html into XHTML namespace Parsing feed_0/article_0/index.html ... Forcing feed_0/article_0/index.html into XHTML namespace Parsing feed_0/article_6/index.html ... Forcing feed_0/article_6/index.html into XHTML namespace Parsing feed_0/article_9/index.html ... Forcing feed_0/article_9/index.html into XHTML namespace Parsing feed_0/article_4/index.html ... Forcing feed_0/article_4/index.html into XHTML namespace Parsing feed_0/article_10/index.html ... Forcing feed_0/article_10/index.html into XHTML namespace Parsing feed_0/article_22/index.html ... Forcing feed_0/article_22/index.html into XHTML namespace Parsing feed_0/article_17/index.html ... Forcing feed_0/article_17/index.html into XHTML namespace Parsing feed_0/article_20/index.html ... Forcing feed_0/article_20/index.html into XHTML namespace Parsing feed_0/article_24/index.html ... Forcing feed_0/article_24/index.html into XHTML namespace Parsing feed_0/article_13/index.html ... Forcing feed_0/article_13/index.html into XHTML namespace Parsing feed_0/article_21/index.html ... Forcing feed_0/article_21/index.html into XHTML namespace Parsing feed_0/article_2/index.html ... Forcing feed_0/article_2/index.html into XHTML namespace Parsing feed_0/article_1/index.html ... Forcing feed_0/article_1/index.html into XHTML namespace Parsing feed_0/article_14/index.html ... Forcing feed_0/article_14/index.html into XHTML namespace Parsing feed_0/article_18/index.html ... Forcing feed_0/article_18/index.html into XHTML namespace Parsing feed_0/article_3/index.html ... Forcing feed_0/article_3/index.html into XHTML namespace Parsing feed_0/article_8/index.html ... Forcing feed_0/article_8/index.html into XHTML namespace Parsing feed_0/index.html ... Initial parse failed, using more forgiving parsers Parsing feed_0/index.html as HTML Parsing index.html ... Forcing index.html into XHTML namespace Parsing feed_0/article_7/index.html ... Forcing feed_0/article_7/index.html into XHTML namespace Parsing feed_0/article_25/index.html ... Forcing feed_0/article_25/index.html into XHTML namespace Parsing feed_0/article_19/index.html ... Forcing feed_0/article_19/index.html into XHTML namespace Parsing feed_0/article_12/index.html ... Forcing feed_0/article_12/index.html into XHTML namespace Referenced file u'feed_1/index.html' not found Referenced file u'feed_0/article_2/file%3a/u/reaganFF' not found Reading TOC from NCX... Merging user specified metadata... Detecting structure... Flattening CSS and remapping font sizes... Source base font size is 12.00000pt Removing fake margins... Found 28 items of level: div_1 Found 27 items of level: div_2 Found 26 items of level: div_4 Found 224 items of level: p_4 Found 2 items of level: p_2 Found 253 items of level: p_3 Ignoring level p_2 div_1 left margin stats: Counter({u'': 23}) div_1 right margin stats: Counter({u'': 23}) div_2 left margin stats: Counter({u'': 21}) div_2 right margin stats: Counter({u'': 21}) div_4 left margin stats: Counter({u'': 26}) div_4 right margin stats: Counter({u'': 26}) p_4 left margin stats: Counter({u'0': 224}) p_4 right margin stats: Counter({u'0': 224}) p_3 left margin stats: Counter({u'0': 253}) p_3 right margin stats: Counter({u'0': 253}) Cleaning up manifest... Trimming unused files from manifest... Creating MOBI Output... Serializing resources... Converting TOC for MOBI periodical indexing... Using mastheadImage supplied in manifest... Creating MOBI 6 output Generating in-line TOC... Applying case-transforming CSS... Parsing manglecase.css ... Parsing tocstyle.css ... Rasterizing SVG images..../ Converting XHTML to Mobipocket markup... Serializing markup content... Compressing markup content... Generating MOBI index for a periodical MOBI output written to /tmp/calibre_1.33.0_tmp_RlxFFq/oRCRF6_recipe_out.mobi |
12-10-2014, 03:14 PM | #2 |
Zealot
Posts: 118
Karma: 202232
Join Date: Jun 2010
Location: Texas
Device: Kindle Paperwhite Gen2
|
I've done a little further digging on this, and I'm seeing this error in the log when the book gets assembled:
Code:
Recursion limit reached. Skipping links in... Code:
recursion = 1 Code:
Fetching http://www.reddit.com/r/talesfromtechsupport/comments/2osotl/they_make_minimum_wage_for_a_reason/ Candid: 14.380 div - body#readabilityBody link density 0.021 -> 14.078 Cleaned 0.000 .md - div with weight 0 cause it has too many links 0.449 for its weight 0. Downloaded article:Candid: 6.020 blockquote - .md link density 0.000 -> 6.020 Delete EVERYTHING! from http://www.reddit.com/r/talesfromtechsupport/comments/2ovkc9/delete_everything/Candid: 6.760 blockquote - .md link density 0.000 -> 6.760 |
Advert | |
|
12-10-2014, 10:01 PM | #3 |
creator of calibre
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Remove auto_cleanup = True from your recipe and use keep_only_tags/remove_tags instead.
|
12-10-2014, 11:28 PM | #4 |
Zealot
Posts: 118
Karma: 202232
Join Date: Jun 2010
Location: Texas
Device: Kindle Paperwhite Gen2
|
Brilliant. Worked beautifully!
Thanks for the help. I made a way overdue contribution to calibre on your website. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
No Author in RSS-Feed "newest" | dosser | Recipes | 0 | 09-13-2013 09:53 AM |
Pulling RSS from listserv w/a login? | Towerblock | Calibre | 1 | 09-13-2010 11:18 PM |
RSS Feed | timezone | Feedback | 8 | 01-02-2010 06:55 PM |
Sci-Fi Author to Answer Reddit Questions | Moejoe | News | 1 | 04-07-2009 04:25 PM |
RSS Feed Prob... | AKninja04 | Calibre | 6 | 08-25-2008 07:51 PM |