Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Recipes

Notices

Reply
 
Thread Tools Search this Thread
Old 12-10-2014, 12:32 PM   #1
jasonfedelem
Zealot
jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.
 
jasonfedelem's Avatar
 
Posts: 118
Karma: 202232
Join Date: Jun 2010
Location: Texas
Device: Kindle Paperwhite Gen2
Reddit RSS feed not pulling author info

Hello everyone.

The following is something I've been trying to figure out for several months. My knowledge of RSS is limited and my knowledge of Python is almost non-existent.

I like reading certain subreddits on reddit.com. I would like to be able to read them on Kindle. The problem is that when I do this, Calibre does not show me who the author of the story is. This becomes important because many of the stories are continuations or have over-arching themes, so its important to know who is writing it.

This is the rss feed for Tales from Tech Support: http://www.reddit.com/r/talesfromtechsupport/.rss . If you click that link, you will see something similar to this:



Notice that "submitted by" includes the author. However, when I "view source" I don't see anything indicating that as an "author" tag. Still, I would think that calibre should include all data that is present in the RSS feed. Instead it strips the author from the ouput:



Here is the recipe text copied from calibre:

Code:
class AdvancedUserRecipe1418232189(BasicNewsRecipe):
    title          = u'Unknown News Source'
    oldest_article = 7
    max_articles_per_feed = 100
    auto_cleanup = True

    feeds          = []
And here is the output log from when it creates the ebook. Thanks in advance to anyone who can help figure this out as the usage of my kindle would skyrocket and incidence of eye strain would plummet:

Code:
Fetch news from TFTS
Resolved conversion options
calibre version: 1.33.0
{'asciiize': False,
 'author_sort': None,
 'authors': None,
 'base_font_size': 0,
 'book_producer': None,
 'change_justification': 'original',
 'chapter': None,
 'chapter_mark': 'pagebreak',
 'comments': None,
 'cover': None,
 'debug_pipeline': None,
 'dehyphenate': True,
 'delete_blank_paragraphs': True,
 'disable_font_rescaling': False,
 'dont_compress': False,
 'dont_download_recipe': False,
 'duplicate_links_in_toc': False,
 'embed_all_fonts': False,
 'embed_font_family': None,
 'enable_heuristics': False,
 'expand_css': False,
 'extra_css': None,
 'extract_to': None,
 'filter_css': None,
 'fix_indents': True,
 'font_size_mapping': None,
 'format_scene_breaks': True,
 'html_unwrap_factor': 0.4,
 'input_encoding': None,
 'input_profile': <calibre.customize.profiles.InputProfile object at 0x1689650>,
 'insert_blank_line': False,
 'insert_blank_line_size': 0.5,
 'insert_metadata': False,
 'isbn': None,
 'italicize_common_cases': True,
 'keep_ligatures': False,
 'language': None,
 'level1_toc': None,
 'level2_toc': None,
 'level3_toc': None,
 'line_height': 0,
 'linearize_tables': False,
 'lrf': False,
 'margin_bottom': 5.0,
 'margin_left': 5.0,
 'margin_right': 5.0,
 'margin_top': 5.0,
 'markup_chapter_headings': True,
 'max_toc_links': 50,
 'minimum_line_height': 120.0,
 'mobi_file_type': 'old',
 'mobi_ignore_margins': False,
 'mobi_keep_original_images': False,
 'mobi_toc_at_start': False,
 'no_chapters_in_toc': False,
 'no_inline_navbars': True,
 'no_inline_toc': False,
 'output_profile': <calibre.customize.profiles.KindlePaperWhiteOutput object at 0x1689d90>,
 'page_breaks_before': None,
 'personal_doc': '[PDOC]',
 'prefer_author_sort': False,
 'prefer_metadata_cover': False,
 'pretty_print': False,
 'pubdate': None,
 'publisher': None,
 'rating': None,
 'read_metadata_from_opf': None,
 'remove_fake_margins': True,
 'remove_first_image': False,
 'remove_paragraph_spacing': False,
 'remove_paragraph_spacing_indent_size': 1.5,
 'renumber_headings': True,
 'replace_scene_breaks': '',
 'search_replace': None,
 'series': None,
 'series_index': None,
 'share_not_sync': False,
 'smarten_punctuation': False,
 'sr1_replace': '',
 'sr1_search': '',
 'sr2_replace': '',
 'sr2_search': '',
 'sr3_replace': '',
 'sr3_search': '',
 'start_reading_at': None,
 'subset_embedded_fonts': False,
 'tags': None,
 'test': False,
 'timestamp': None,
 'title': None,
 'title_sort': None,
 'toc_filter': None,
 'toc_threshold': 6,
 'toc_title': None,
 'unsmarten_punctuation': False,
 'unwrap_lines': True,
 'use_auto_toc': False,
 'verbose': 2}
InputFormatPlugin: Recipe Input running
Using custom recipe
Synthesizing mastheadImage
DownloadingDownloading

FetchingDownloading
Fetching  file:///tmp/calibre_1.33.0_tmp_RlxFFq/qa24ai_feeds2disk.htmlfile:///tmp/calibre_1.33.0_tmp_RlxFFq/AwiQLK_feeds2disk.html
Fetching
DownloadingDownloading
 
file:///tmp/calibre_1.33.0_tmp_RlxFFq/BpWnCz_feeds2disk.html
FetchingFetching  file:///tmp/calibre_1.33.0_tmp_RlxFFq/SG6sj6_feeds2disk.htmlfile:///tmp/calibre_1.33.0_tmp_RlxFFq/dqYnLk_feeds2disk.html

WARNING: Encoding detection confidence 82%
Candid: 21.410 .md - div link density 0.000 -> 21.410
Candid: 13.205 div - body#readabilityBody link density 0.050 -> 12.550
Top 5 : 21.410 .md - div
Top 5 : 12.550 div - body#readabilityBody
Candid: 45.510 .md - div link density 0.000 -> 45.510
Candid: 25.255 div - body#readabilityBody link density 0.038 -> 24.287
Top 5 : 45.510 .md - div
Top 5 : 24.287 div - body#readabilityBody
Candid: 98.005 .md - div link density 0.000 -> 98.005
Processing images...
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/dqYnLk_feeds2disk.html
Candid: 46.990 div - body#readabilityBody link density 0.008 -> 46.607
Candid: 13.000 blockquote - .md link density 0.000 -> 13.000file:///tmp/calibre_1.33.0_tmp_RlxFFq/dqYnLk_feeds2disk.html 
saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_4/dqYnLk_feeds2disk.xhtml
Candid:  5.370 blockquote - .md link density 0.000 ->  5.370
Candid:  8.680 blockquote - .md link density 0.000 ->  8.680Downloading
Top 5 : 98.005 .md - div

FetchingTop 5 : 46.607 div - body#readabilityBody
 Top 5 : 13.000 blockquote - .mdfile:///tmp/calibre_1.33.0_tmp_RlxFFq/eI7GXz_feeds2disk.html

Top 5 :  8.680 blockquote - .md
Top 5 :  5.370 blockquote - .mdProcessing images...

Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/AwiQLK_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/AwiQLK_feeds2disk.html saved toDownloaded article: /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_1/AwiQLK_feeds2disk.xhtml That's it. Christmas is canceled! from 
http://www.reddit.com/r/talesfromtechsupport/comments/2orn2t/thats_it_christmas_is_canceled/
Downloaded article:Processing images...
 "I hope you lose your job and that your entire company goes under" Recursion limit reached. Skipping links infrom  Downloadinghttp://www.reddit.com/r/talesfromtechsupport/comments/2ou9ax/i_hope_you_lose_your_job_and_that_your_entire/file:///tmp/calibre_1.33.0_tmp_RlxFFq/SG6sj6_feeds2disk.html


Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/ECBqLN_feeds2disk.html
Candid: 36.150 .md - div link density 0.000 -> 36.150
file:///tmp/calibre_1.33.0_tmp_RlxFFq/SG6sj6_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_3/SG6sj6_feeds2disk.xhtml
Candid: 20.575 div - body#readabilityBody link density 0.024 -> 20.084
DownloadingTop 5 : 36.150 .md - div

Top 5 : 20.084 div - body#readabilityBody
Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/7W3NCX_feeds2disk.html
Candid: 31.910 .md - div link density 0.449 -> 17.590
Processing images...
Candid: 38.170 .md - div link density 0.000 -> 38.170
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/eI7GXz_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/eI7GXz_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_5/eI7GXz_feeds2disk.xhtml
Candid: 21.585 div - body#readabilityBody link density 0.021 -> 21.142
Top 5 : 38.170 .md - div
Top 5 : 21.142 div - body#readabilityBody
Downloading
Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/RUaZn3_feeds2disk.html
Processing images...
Candid: 45.690 .md - div link density 0.000 -> 45.690Recursion limit reached. Skipping links in
 file:///tmp/calibre_1.33.0_tmp_RlxFFq/ECBqLN_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/ECBqLN_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_6/ECBqLN_feeds2disk.xhtml
Candid: 25.345 div - body#readabilityBody link density 0.015 -> 24.953
Top 5 : 45.690 .md - div
Top 5 : 24.953 div - body#readabilityBody
Downloading
Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/CALAQg_feeds2disk.html
WARNING: Encoding detection confidence 99%
Candid: 34.085 .md - div link density 0.000 -> 34.085
Candid: 156.025 .md - div link density 0.002 -> 155.711
Candid: 16.520 div - body#readabilityBody link density 0.023 -> 16.138
Candid: 15.090 blockquote - .md link density 0.000 -> 15.090
Top 5 : 34.085 .md - div
Top 5 : 16.138 div - body#readabilityBody
Processing images...Top 5 : 15.090 blockquote - .md

Candid: 47.745 div - body#readabilityBody link density 0.006 -> 47.482
Recursion limit reached. Skipping links inCandid: 18.455 div - body#readabilityBody link density 0.452 -> 10.106 
Top 5 : 17.590 .md - divfile:///tmp/calibre_1.33.0_tmp_RlxFFq/7W3NCX_feeds2disk.htmlCandid: 24.540 blockquote - .md link density 0.000 -> 24.540
Top 5 : 10.106 div - body#readabilityBody


Downloaded article:Candid: 46.250 blockquote - .md link density 0.000 -> 46.250 
"Hey, did you throw away our records?" from http://www.reddit.com/r/talesfromtechsupport/comments/2ounzq/hey_did_you_throw_away_our_records/
file:///tmp/calibre_1.33.0_tmp_RlxFFq/7W3NCX_feeds2disk.htmlDownloaded article: saved to  /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_7/7W3NCX_feeds2disk.xhtmlOh, I'm sorry - was that important? *crunch crunch* fromCandid:  6.630 blockquote - .md link density 0.000 ->  6.630 
http://www.reddit.com/r/talesfromtechsupport/comments/2ottif/oh_im_sorry_was_that_important_crunch_crunch/

Downloaded article: The end user and their phone from http://www.reddit.com/r/talesfromtechsupport/comments/2oupr0/the_end_user_and_their_phone/
Downloaded article: 12 year old PHP scripts and SQL server 2008R2 Candid:  5.440 blockquote - .md link density 0.000 ->  5.440
from http://www.reddit.com/r/talesfromtechsupport/comments/2ovblb/12_year_old_php_scripts_and_sql_server_2008r2/Downloading

Candid: 10.840 blockquote - .md link density 0.000 -> 10.840Fetching
 file:///tmp/calibre_1.33.0_tmp_RlxFFq/cvJFgv_feeds2disk.html
Candid: 11.590 blockquote - .md link density 0.000 -> 11.590
Candid: 46.780 blockquote - .md link density 0.000 -> 46.780
Top 5 : 155.711 .md - div
Top 5 : 47.482 div - body#readabilityBody
Top 5 : 46.780 blockquote - .md
Top 5 : 46.250 blockquote - .md
Top 5 : 24.540 blockquote - .md
Processing images...
Candid: 25.890 .md - div link density 0.000 -> 25.890
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/RUaZn3_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/RUaZn3_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_8/RUaZn3_feeds2disk.xhtml
Candid: 15.445 div - body#readabilityBody link density 0.032 -> 14.946
Processing images...
Top 5 : 25.890 .md - div
DownloadingTop 5 : 14.946 div - body#readabilityBody
Recursion limit reached. Skipping links in 
file:///tmp/calibre_1.33.0_tmp_RlxFFq/BpWnCz_feeds2disk.html
Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/8NuxtU_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/BpWnCz_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_2/BpWnCz_feeds2disk.xhtml
Processing images...
Downloading
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/cvJFgv_feeds2disk.htmlFetching
 file:///tmp/calibre_1.33.0_tmp_RlxFFq/vxYygt_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/cvJFgv_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_10/cvJFgv_feeds2disk.xhtml
Candid: 106.475 .md - div link density 0.000 -> 106.475
Cleaned  0.000 .md - div with weight 0 cause it has too many links 0.449 for its weight 0.
DownloadingCandid: 33.665 div - body#readabilityBody link density 0.005 -> 33.489

Candid: 22.170 .md - div link density 0.000 -> 22.170
Cleaned  0.000 div - body with weight 0 cause it has too short content length 0 without a single image.
Fetching Candid: 32.420 blockquote - .md link density 0.000 -> 32.420file:///tmp/calibre_1.33.0_tmp_RlxFFq/R8MqWc_feeds2disk.html

Candid:  9.960 blockquote - .md link density 0.000 ->  9.960
Candid: 18.460 blockquote - .md link density 0.000 -> 18.460
Candid: 10.825 div - body#readabilityBody link density 0.041 -> 10.380
Downloaded article: Something must be wrong with your system!Candid: 39.450 blockquote - .md link density 0.000 -> 39.450
Candid:  5.400 blockquote - .md link density 0.000 ->  5.400
Top 5 : 106.475 .md - div
Top 5 : 39.450 blockquote - .md from http://www.reddit.com/r/talesfromtechsupport/comments/2ott55/something_must_be_wrong_with_your_system/
Candid: 11.640 blockquote - .md link density 0.000 -> 11.640

Top 5 : 33.489 div - body#readabilityBodyDownloaded article: Top 5 : 22.170 .md - div
Some people are traidies for a reason. Top 5 : 32.420 blockquote - .md

fromTop 5 : 11.640 blockquote - .md 
Top 5 : 10.380 div - body#readabilityBodyTop 5 : 18.460 blockquote - .mdhttp://www.reddit.com/r/talesfromtechsupport/comments/2ourw7/some_people_are_traidies_for_a_reason/


Downloaded article:Top 5 :  5.400 blockquote - .md You can't complain about not having something if you have it!
 from http://www.reddit.com/r/talesfromtechsupport/comments/2ou956/you_cant_complain_about_not_having_something_if/
Processing images...
Processing images...
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/CALAQg_feeds2disk.html
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/8NuxtU_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/CALAQg_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_9/CALAQg_feeds2disk.xhtml
file:///tmp/calibre_1.33.0_tmp_RlxFFq/8NuxtU_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_11/8NuxtU_feeds2disk.xhtml
Downloading
Fetching Downloading
file:///tmp/calibre_1.33.0_tmp_RlxFFq/p_2C_i_feeds2disk.html
Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/YRg0Ad_feeds2disk.html
Candid: 87.415 .md - div link density 0.001 -> 87.328
Candid: 23.265 div - body#readabilityBody link density 0.008 -> 23.082
Candid: 30.840 .md - div link density 0.000 -> 30.840
Candid: 11.040 blockquote - .md link density 0.000 -> 11.040
Candid: 10.460 blockquote - .md link density 0.000 -> 10.460
Candid:  9.110 blockquote - .md link density 0.000 ->  9.110
Candid: 17.920 div - body#readabilityBody link density 0.020 -> 17.565
Candid: 31.910 .md - div link density 0.449 -> 17.590
Top 5 : 30.840 .md - div
Top 5 : 17.565 div - body#readabilityBodyCandid: 17.180 blockquote - .md link density 0.000 -> 17.180

Candid: 34.010 blockquote - .md link density 0.000 -> 34.010
Candid:  8.460 blockquote - .md link density 0.000 ->  8.460
Candid: 22.510 blockquote - .md link density 0.000 -> 22.510
Top 5 : 87.328 .md - div
Top 5 : 34.010 blockquote - .md
Top 5 : 23.082 div - body#readabilityBody
Top 5 : 22.510 blockquote - .md
Top 5 : 17.180 blockquote - .md
Candid: 70.400 .md - div link density 0.000 -> 70.400
Candid: 37.700 div - body#readabilityBody link density 0.011 -> 37.291
Top 5 : 70.400 .md - div
Top 5 : 37.291 div - body#readabilityBody
Processing images...
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/p_2C_i_feeds2disk.html
Processing images...
file:///tmp/calibre_1.33.0_tmp_RlxFFq/p_2C_i_feeds2disk.html Recursion limit reached. Skipping links inDownloaded article:  Super Dooper VIP, or Screenshots Can Save Your Buttfile:///tmp/calibre_1.33.0_tmp_RlxFFq/vxYygt_feeds2disk.html 
from saved tohttp://www.reddit.com/r/talesfromtechsupport/comments/2osxj9/super_dooper_vip_or_screenshots_can_save_your_butt/ 
/tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_14/p_2C_i_feeds2disk.xhtml
Downloaded article: Closure Code: No fault found from http://www.reddit.com/r/talesfromtechsupport/comments/2oumkx/closure_code_no_fault_found/
file:///tmp/calibre_1.33.0_tmp_RlxFFq/vxYygt_feeds2disk.htmlProcessing images...
 Downloading
saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_12/vxYygt_feeds2disk.xhtmlRecursion limit reached. Skipping links in 
Fetching Downloaded article:file:///tmp/calibre_1.33.0_tmp_RlxFFq/6Dly2y_feeds2disk.htmlfile:///tmp/calibre_1.33.0_tmp_RlxFFq/YRg0Ad_feeds2disk.html

 Lab Computer != your computer from http://www.reddit.com/r/talesfromtechsupport/comments/2oskhk/lab_computer_your_computer/
file:///tmp/calibre_1.33.0_tmp_RlxFFq/YRg0Ad_feeds2disk.html Downloading
saved toDownloaded article: My story of woe .... Part 2 /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_15/YRg0Ad_feeds2disk.xhtml
 Fetching fromfile:///tmp/calibre_1.33.0_tmp_RlxFFq/_uGgKn_feeds2disk.html
 http://www.reddit.com/r/talesfromtechsupport/comments/2otwov/my_story_of_woe_part_2/
WARNING: Encoding detection confidence 99%
Downloaded article: Downloading
"That isn't possible. I cannot be making a typo" Candid: 18.455 div - body#readabilityBody link density 0.452 -> 10.106
fromTop 5 : 17.590 .md - div http://www.reddit.com/r/talesfromtechsupport/comments/2ospb3/that_isnt_possible_i_cannot_be_making_a_typo/
Top 5 : 10.106 div - body#readabilityBody
Fetching
 file:///tmp/calibre_1.33.0_tmp_RlxFFq/M_3jDj_feeds2disk.html
Candid: 167.810 .md - div link density 0.004 -> 167.139
Candid: 38.710 .md - div link density 0.000 -> 38.710
Candid: 60.575 div - body#readabilityBody link density 0.009 -> 60.040
Candid: 17.500 blockquote - .md link density 0.000 -> 17.500
Candid: 30.840 blockquote - .md link density 0.000 -> 30.840
Candid: 21.855 div - body#readabilityBody link density 0.017 -> 21.478
Top 5 : 38.710 .md - divCandid:  5.320 blockquote - .md link density 0.000 ->  5.320
Top 5 : 21.478 div - body#readabilityBody

Candid: 14.550 blockquote - .md link density 0.000 -> 14.550
Candid: 84.095 .md - div link density 0.000 -> 84.095
Candid: 10.210 blockquote - .md link density 0.000 -> 10.210
Candid: 48.240 .md - div link density 0.000 -> 48.240
Candid: 13.960 blockquote - .md link density 0.000 -> 13.960
Candid:  5.940 blockquote - .md link density 0.000 ->  5.940
Candid: 29.000 blockquote - .md link density 0.000 -> 29.000
Top 5 : 167.139 .md - div
Candid: 38.015 div - body#readabilityBody link density 0.010 -> 37.633
Top 5 : 60.040 div - body#readabilityBody
Top 5 : 30.840 blockquote - .mdCandid: 23.520 div - body#readabilityBody link density 0.014 -> 23.201

Candid:  8.530 blockquote - .md link density 0.000 ->  8.530Top 5 : 29.000 blockquote - .md

Processing images...
Top 5 : 17.500 blockquote - .md
Candid:  9.420 blockquote - .md link density 0.000 ->  9.420
Candid:  5.950 blockquote - .md link density 0.000 ->  5.950
Candid:  5.520 blockquote - .md link density 0.000 ->  5.520
Recursion limit reached. Skipping links in Candid: 13.260 blockquote - .md link density 0.000 -> 13.260file:///tmp/calibre_1.33.0_tmp_RlxFFq/M_3jDj_feeds2disk.html

Candid:  6.460 blockquote - .md link density 0.000 ->  6.460
Top 5 : 48.240 .md - div
file:///tmp/calibre_1.33.0_tmp_RlxFFq/M_3jDj_feeds2disk.htmlTop 5 : 23.201 div - body#readabilityBody Candid: 10.390 blockquote - .md link density 0.000 -> 10.390
saved to
Top 5 :  9.420 blockquote - .mdTop 5 : 84.095 .md - div /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_18/M_3jDj_feeds2disk.xhtml


Top 5 :  6.460 blockquote - .mdTop 5 : 37.633 div - body#readabilityBody

Top 5 :  5.520 blockquote - .mdTop 5 : 13.260 blockquote - .mdCleaned  0.000 .md - div with weight 0 cause it has too many links 0.449 for its weight 0.


Top 5 : 10.390 blockquote - .md
Top 5 :  8.530 blockquote - .md
Cleaned  0.000 div - body with weight 0 cause it has too short content length 0 without a single image.
Downloading
Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/PfqqTB_feeds2disk.html
Processing images...
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/R8MqWc_feeds2disk.html
Processing images...
Downloaded article:file:///tmp/calibre_1.33.0_tmp_RlxFFq/R8MqWc_feeds2disk.html  saved toThey make minimum wage for a reason.Processing images...  /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_13/R8MqWc_feeds2disk.xhtml
from
 http://www.reddit.com/r/talesfromtechsupport/comments/2osotl/they_make_minimum_wage_for_a_reason/
Recursion limit reached. Skipping links inRecursion limit reached. Skipping links in Processing images...file:///tmp/calibre_1.33.0_tmp_RlxFFq/qa24ai_feeds2disk.html 
file:///tmp/calibre_1.33.0_tmp_RlxFFq/_uGgKn_feeds2disk.html

Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/6Dly2y_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/qa24ai_feeds2disk.html Candid: 32.180 .md - div link density 0.000 -> 32.180saved tofile:///tmp/calibre_1.33.0_tmp_RlxFFq/_uGgKn_feeds2disk.html Downloading /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_0/qa24ai_feeds2disk.xhtml

saved to
 Fetching/tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_17/_uGgKn_feeds2disk.xhtml
 file:///tmp/calibre_1.33.0_tmp_RlxFFq/b1gRIR_feeds2disk.html
Candid: 18.590 div - body#readabilityBody link density 0.029 -> 18.052
file:///tmp/calibre_1.33.0_tmp_RlxFFq/6Dly2y_feeds2disk.htmlTop 5 : 32.180 .md - div saved to
 Downloading
/tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_16/6Dly2y_feeds2disk.xhtmlTop 5 : 18.052 div - body#readabilityBody
Fetching 
file:///tmp/calibre_1.33.0_tmp_RlxFFq/is6VKn_feeds2disk.htmlDownloading

Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/CHICZH_feeds2disk.html
Downloading
FetchingCandid: 28.110 .md - div link density 0.000 -> 28.110
 file:///tmp/calibre_1.33.0_tmp_RlxFFq/bTyj9t_feeds2disk.html
Downloaded article: My story of woe.... Part 3 from http://www.reddit.com/r/talesfromtechsupport/comments/2oviiz/my_story_of_woe_part_3/Processing images...
Downloaded article: TFTS Top Tales - November 2014WARNING: Encoding detection confidence 88% 
Candid: 16.555 div - body#readabilityBody link density 0.039 -> 15.913

Top 5 : 28.110 .md - divfrom http://www.reddit.com/r/talesfromtechsupport/comments/2onlu7/tfts_top_tales_november_2014/

Top 5 : 15.913 div - body#readabilityBody
Downloaded article: He obviously didn't attend at Winterhold... Recursion limit reached. Skipping links in from file:///tmp/calibre_1.33.0_tmp_RlxFFq/PfqqTB_feeds2disk.html
http://www.reddit.com/r/talesfromtechsupport/comments/2oslif/he_obviously_didnt_attend_at_winterhold/
Downloaded article:Candid: 14.000 .md - div link density 0.000 -> 14.000 
I'm using Chrome??? from file:///tmp/calibre_1.33.0_tmp_RlxFFq/PfqqTB_feeds2disk.html http://www.reddit.com/r/talesfromtechsupport/comments/2osb7c/im_using_chrome/saved to
 /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_19/PfqqTB_feeds2disk.xhtml
Candid:  9.500 div - body#readabilityBody link density 0.066 ->  8.875
Top 5 : 14.000 .md - div
Top 5 :  8.875 div - body#readabilityBody
Downloading
Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/Bc5xNH_feeds2disk.htmlProcessing images...

Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/b1gRIR_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/b1gRIR_feeds2disk.html WARNING: Encoding detection confidence 84%saved to
 Downloaded article: /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_20/b1gRIR_feeds2disk.xhtmlDelete EVERYTHING! 
from http://www.reddit.com/r/talesfromtechsupport/comments/2ovkc9/delete_everything/
Downloaded article: It helps to read AND comprehend from http://www.reddit.com/r/talesfromtechsupport/comments/2osqtu/it_helps_to_read_and_comprehend/
Downloading
Fetching file:///tmp/calibre_1.33.0_tmp_RlxFFq/L4M4qG_feeds2disk.html
Processing images...
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/bTyj9t_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/bTyj9t_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_23/bTyj9t_feeds2disk.xhtml
Candid: 65.260 .md - div link density 0.000 -> 65.260
Candid: 18.555 div - body#readabilityBody link density 0.008 -> 18.401
Candid: 10.920 blockquote - .md link density 0.000 -> 10.920
Candid: 15.100 blockquote - .md link density 0.000 -> 15.100
Candid: 49.280 blockquote - .md link density 0.000 -> 49.280
Top 5 : 65.260 .md - div
Top 5 : 49.280 blockquote - .md
Top 5 : 18.401 div - body#readabilityBody
Top 5 : 15.100 blockquote - .md
Top 5 : 10.920 blockquote - .md
Candid: 63.730 .md - div link density 0.000 -> 63.730
Candid: 26.455 div - body#readabilityBody link density 0.012 -> 26.141
Processing images...Candid: 13.180 blockquote - .md link density 0.000 -> 13.180

Candid: 24.460 blockquote - .md link density 0.000 -> 24.460
Recursion limit reached. Skipping links inTop 5 : 63.730 .md - div
 file:///tmp/calibre_1.33.0_tmp_RlxFFq/CHICZH_feeds2disk.htmlTop 5 : 26.141 div - body#readabilityBody

Top 5 : 24.460 blockquote - .md
Top 5 : 13.180 blockquote - .md
file:///tmp/calibre_1.33.0_tmp_RlxFFq/CHICZH_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_22/CHICZH_feeds2disk.xhtml
Candid: 117.845 .md - div link density 0.000 -> 117.845
Candid: 170.135 .md - div link density 0.000 -> 170.135
Candid: 53.800 div - body#readabilityBody link density 0.006 -> 53.504
Candid:  7.690 blockquote - .md link density 0.000 ->  7.690
Candid: 70.890 div - body#readabilityBody link density 0.004 -> 70.584
Candid: 26.470 blockquote - .md link density 0.000 -> 26.470
Processing images...
Candid:  5.470 blockquote - .md link density 0.000 ->  5.470
Candid:  5.330 blockquote - .md link density 0.000 ->  5.330Candid:  5.920 blockquote - .md link density 0.000 ->  5.920

Top 5 : 117.845 .md - div
Candid:  5.970 blockquote - .md link density 0.000 ->  5.970Top 5 : 53.504 div - body#readabilityBody

Top 5 : 26.470 blockquote - .mdCandid:  9.440 blockquote - .md link density 0.000 ->  9.440

Recursion limit reached. Skipping links inCandid: 13.690 blockquote - .md link density 0.000 -> 13.690 
file:///tmp/calibre_1.33.0_tmp_RlxFFq/L4M4qG_feeds2disk.html
Top 5 :  7.690 blockquote - .mdCandid:  6.910 blockquote - .md link density 0.000 ->  6.910

Top 5 :  5.330 blockquote - .md
Candid:  6.060 blockquote - .md link density 0.000 ->  6.060
file:///tmp/calibre_1.33.0_tmp_RlxFFq/L4M4qG_feeds2disk.htmlCandid:  5.410 blockquote - .md link density 0.000 ->  5.410
 saved toCandid:  8.900 blockquote - .md link density 0.000 ->  8.900
 /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_25/L4M4qG_feeds2disk.xhtml
Candid:  9.130 blockquote - .md link density 0.000 ->  9.130
Candid: 20.210 blockquote - .md link density 0.000 -> 20.210
Candid:  5.600 blockquote - .md link density 0.000 ->  5.600
Top 5 : 170.135 .md - div
Top 5 : 70.584 div - body#readabilityBody
Top 5 : 20.210 blockquote - .md
Top 5 : 13.690 blockquote - .md
Top 5 :  9.440 blockquote - .md
Processing images...
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/Bc5xNH_feeds2disk.html
file:///tmp/calibre_1.33.0_tmp_RlxFFq/Bc5xNH_feeds2disk.html saved to /tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_24/Bc5xNH_feeds2disk.xhtml
Processing images...
Recursion limit reached. Skipping links in file:///tmp/calibre_1.33.0_tmp_RlxFFq/is6VKn_feeds2disk.html
Downloaded article: Today, someone found out a battery is needed to power a laptop. from http://www.reddit.com/r/talesfromtechsupport/comments/2oszes/today_someone_found_out_a_battery_is_needed_to/
file:///tmp/calibre_1.33.0_tmp_RlxFFq/is6VKn_feeds2disk.htmlDownloaded article:  saved toWebsense doesn't block adult sites, but it blocks Facebook messages!  from/tmp/calibre_1.33.0_tmp_RlxFFq/xByULf_plumber/feed_0/article_21/is6VKn_feeds2disk.xhtml 
http://www.reddit.com/r/talesfromtechsupport/comments/2orpym/websense_doesnt_block_adult_sites_but_it_blocks/
Downloaded article: A lesson in (natural) Disaster Recovery, the tale of a terrified intern. Part 1 from http://www.reddit.com/r/talesfromtechsupport/comments/2os84v/a_lesson_in_natural_disaster_recovery_the_tale_of/
Downloaded article: Hard of Hearing from http://www.reddit.com/r/talesfromtechsupport/comments/2ospp3/hard_of_hearing/
Downloaded article: IT destroys payment system to fix a printer (and failed) from http://www.reddit.com/r/talesfromtechsupport/comments/2oqxph/it_destroys_payment_system_to_fix_a_printer_and/
Parsing all content...
Parsing feed_0/article_16/index.html ...
Forcing feed_0/article_16/index.html into XHTML namespace
Parsing feed_0/article_11/index.html ...
Forcing feed_0/article_11/index.html into XHTML namespace
Parsing feed_0/article_23/index.html ...
Forcing feed_0/article_23/index.html into XHTML namespace
Parsing feed_0/article_5/index.html ...
Forcing feed_0/article_5/index.html into XHTML namespace
Parsing feed_0/article_15/index.html ...
Forcing feed_0/article_15/index.html into XHTML namespace
Parsing feed_0/article_0/index.html ...
Forcing feed_0/article_0/index.html into XHTML namespace
Parsing feed_0/article_6/index.html ...
Forcing feed_0/article_6/index.html into XHTML namespace
Parsing feed_0/article_9/index.html ...
Forcing feed_0/article_9/index.html into XHTML namespace
Parsing feed_0/article_4/index.html ...
Forcing feed_0/article_4/index.html into XHTML namespace
Parsing feed_0/article_10/index.html ...
Forcing feed_0/article_10/index.html into XHTML namespace
Parsing feed_0/article_22/index.html ...
Forcing feed_0/article_22/index.html into XHTML namespace
Parsing feed_0/article_17/index.html ...
Forcing feed_0/article_17/index.html into XHTML namespace
Parsing feed_0/article_20/index.html ...
Forcing feed_0/article_20/index.html into XHTML namespace
Parsing feed_0/article_24/index.html ...
Forcing feed_0/article_24/index.html into XHTML namespace
Parsing feed_0/article_13/index.html ...
Forcing feed_0/article_13/index.html into XHTML namespace
Parsing feed_0/article_21/index.html ...
Forcing feed_0/article_21/index.html into XHTML namespace
Parsing feed_0/article_2/index.html ...
Forcing feed_0/article_2/index.html into XHTML namespace
Parsing feed_0/article_1/index.html ...
Forcing feed_0/article_1/index.html into XHTML namespace
Parsing feed_0/article_14/index.html ...
Forcing feed_0/article_14/index.html into XHTML namespace
Parsing feed_0/article_18/index.html ...
Forcing feed_0/article_18/index.html into XHTML namespace
Parsing feed_0/article_3/index.html ...
Forcing feed_0/article_3/index.html into XHTML namespace
Parsing feed_0/article_8/index.html ...
Forcing feed_0/article_8/index.html into XHTML namespace
Parsing feed_0/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_0/index.html as HTML
Parsing index.html ...
Forcing index.html into XHTML namespace
Parsing feed_0/article_7/index.html ...
Forcing feed_0/article_7/index.html into XHTML namespace
Parsing feed_0/article_25/index.html ...
Forcing feed_0/article_25/index.html into XHTML namespace
Parsing feed_0/article_19/index.html ...
Forcing feed_0/article_19/index.html into XHTML namespace
Parsing feed_0/article_12/index.html ...
Forcing feed_0/article_12/index.html into XHTML namespace
Referenced file u'feed_1/index.html' not found
Referenced file u'feed_0/article_2/file%3a/u/reaganFF' not found
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 28 items of level: div_1
Found 27 items of level: div_2
Found 26 items of level: div_4
Found 224 items of level: p_4
Found 2 items of level: p_2
Found 253 items of level: p_3
Ignoring level p_2
div_1  left margin stats: Counter({u'': 23})
div_1  right margin stats: Counter({u'': 23})
div_2  left margin stats: Counter({u'': 21})
div_2  right margin stats: Counter({u'': 21})
div_4  left margin stats: Counter({u'': 26})
div_4  right margin stats: Counter({u'': 26})
p_4  left margin stats: Counter({u'0': 224})
p_4  right margin stats: Counter({u'0': 224})
p_3  left margin stats: Counter({u'0': 253})
p_3  right margin stats: Counter({u'0': 253})
Cleaning up manifest...
Trimming unused files from manifest...
Creating MOBI Output...
Serializing resources...
Converting TOC for MOBI periodical indexing...
Using mastheadImage supplied in manifest...
Creating MOBI 6 output
Generating in-line TOC...
Applying case-transforming CSS...
Parsing manglecase.css ...
Parsing tocstyle.css ...
Rasterizing SVG images..../
Converting XHTML to Mobipocket markup...
Serializing markup content...
  Compressing markup content...
Generating MOBI index for a periodical
MOBI output written to /tmp/calibre_1.33.0_tmp_RlxFFq/oRCRF6_recipe_out.mobi
If I have not given complete information please let me know and I can get whatever information is missing. Any help would be greatly appreciated.
jasonfedelem is offline   Reply With Quote
Old 12-10-2014, 03:14 PM   #2
jasonfedelem
Zealot
jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.
 
jasonfedelem's Avatar
 
Posts: 118
Karma: 202232
Join Date: Jun 2010
Location: Texas
Device: Kindle Paperwhite Gen2
I've done a little further digging on this, and I'm seeing this error in the log when the book gets assembled:

Code:
Recursion limit reached. Skipping links in...
Therefore, I added the following statement to the recipe:
Code:
recursion = 1
I deleted the previous copy of the "book" and re-fetched it. It still doesn't work but it looks like a different root cause:

Code:
Fetching http://www.reddit.com/r/talesfromtechsupport/comments/2osotl/they_make_minimum_wage_for_a_reason/
Candid: 14.380 div - body#readabilityBody link density 0.021 -> 14.078
Cleaned  0.000 .md - div with weight 0 cause it has too many links 0.449 for its weight 0.
Downloaded article:Candid:  6.020 blockquote - .md link density 0.000 ->  6.020 
Delete EVERYTHING! from http://www.reddit.com/r/talesfromtechsupport/comments/2ovkc9/delete_everything/Candid:  6.760 blockquote - .md link density 0.000 ->  6.760
Off to try to figure out what weighting of links means.
jasonfedelem is offline   Reply With Quote
Advert
Old 12-10-2014, 10:01 PM   #3
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,860
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Remove auto_cleanup = True from your recipe and use keep_only_tags/remove_tags instead.
kovidgoyal is offline   Reply With Quote
Old 12-10-2014, 11:28 PM   #4
jasonfedelem
Zealot
jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.jasonfedelem ought to be getting tired of karma fortunes by now.
 
jasonfedelem's Avatar
 
Posts: 118
Karma: 202232
Join Date: Jun 2010
Location: Texas
Device: Kindle Paperwhite Gen2
Brilliant. Worked beautifully!

Thanks for the help. I made a way overdue contribution to calibre on your website.
jasonfedelem is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
No Author in RSS-Feed "newest" dosser Recipes 0 09-13-2013 09:53 AM
Pulling RSS from listserv w/a login? Towerblock Calibre 1 09-13-2010 11:18 PM
RSS Feed timezone Feedback 8 01-02-2010 06:55 PM
Sci-Fi Author to Answer Reddit Questions Moejoe News 1 04-07-2009 04:25 PM
RSS Feed Prob... AKninja04 Calibre 6 08-25-2008 07:51 PM


All times are GMT -4. The time now is 05:22 PM.


MobileRead.com is a privately owned, operated and funded community.