![]() |
#1 |
Junior Member
![]() Posts: 6
Karma: 10
Join Date: Aug 2010
Device: Kindle 3
|
Slate News Source omits most articles and has duplicates
Hi,
First of all I want to express my appreciation for such a great application! Having just got my Kindle 3 I am loving Calibre sending feeds to my Kindle to keep on top of news. It's working great! Except for the Slate news source. This recipe omits most articles (only a small subset of the days articles from Slate.com are actually in the mobi output). Further, the first 5 articles are duplicated, showing up twice, though the rest of the articles aren't duplicated. This behavior is consistent for each fectch over the last few days. I've tried to muck with the python source code for this feed to fix the issue but without success. I simply don't have the technical know-how to fix this on my own. On the omitting articles problem, I know the recipe is set up to skip Slate Video and Gabfest podcast entries, but there are many true articles it skips. I noticed the python script is configured to skip any entry that has "http://twitter.com/Slate' in it, maybe that is an overzealous filter that's causing this? I'm not sure, since when I try to remove that text string from the python code the resulting MOBI is empty, so it's not so simple as that. Here is the job detail log. Can someone provide feedback on how to fix this? Thanks! Code:
Fetch news from Slate Resolved conversion options calibre version: 0.7.16 {'asciiize': False, 'author_sort': None, 'authors': None, 'base_font_size': 0, 'book_producer': None, 'change_justification': 'original', 'chapter': None, 'chapter_mark': 'pagebreak', 'comments': None, 'cover': None, 'debug_pipeline': None, 'disable_font_rescaling': False, 'dont_compress': False, 'dont_download_recipe': False, 'extra_css': None, 'font_size_mapping': None, 'footer_regex': '(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)', 'header_regex': '(?i)(?<=<hr>)((\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?\\d+<br>\\s*.*?\\s*)|(\\s*<a name=\\d+></a>((<img.+?>)*<br>\\s*)?.*?<br>\\s*\\d+))(?=<br>)', 'input_encoding': None, 'input_profile': <calibre.customize.profiles.InputProfile object at 0x04C69950>, 'insert_blank_line': False, 'insert_metadata': False, 'isbn': None, 'keep_ligatures': False, 'language': None, 'level1_toc': None, 'level2_toc': None, 'level3_toc': None, 'line_height': 0, 'linearize_tables': False, 'lrf': False, 'margin_bottom': 5.0, 'margin_left': 5.0, 'margin_right': 5.0, 'margin_top': 5.0, 'max_toc_links': 50, 'no_chapters_in_toc': False, 'no_inline_navbars': True, 'no_inline_toc': False, 'output_profile': <calibre.customize.profiles.KindleOutput object at 0x04C69C30>, 'page_breaks_before': None, 'password': None, 'personal_doc': '[PDOC]', 'prefer_author_sort': False, 'prefer_metadata_cover': False, 'preprocess_html': False, 'pretty_print': False, 'pubdate': None, 'publisher': None, 'rating': None, 'read_metadata_from_opf': None, 'remove_first_image': False, 'remove_footer': False, 'remove_header': False, 'remove_paragraph_spacing': False, 'remove_paragraph_spacing_indent_size': 1.5, 'rescale_images': False, 'series': None, 'series_index': None, 'tags': None, 'test': False, 'timestamp': None, 'title': None, 'title_sort': None, 'toc_filter': None, 'toc_threshold': 6, 'toc_title': None, 'use_auto_toc': False, 'username': None, 'verbose': 2} InputFormatPlugin: Recipe Input running >>> skipping The Golden Touch Gabfest (title keyword exclusion: Gabfest) <<< >>> skipping DoubleX Gabfest: The Mercurial Mommy Edition (title keyword exclusion: Gabfest) <<< >>> skipping The Culture Gabfest, "Rambo in Blush" Edition (title keyword exclusion: Gabfest) <<< Synthesizing mastheadImage Downloading Downloading Downloading Downloading Fetching Fetchinghttp://www.slate.com/id/2265196/pagenum/all/ http://www.slate.com/id/2265343/pagenum/all/ Downloading FetchingFetching http://www.slate.com/id/2265297/pagenum/all/http://www.slate.com/id/2265095/pagenum/all/ Fetching http://www.slate.com/id/2265204/pagenum/all/ Processing images... Processing images...Fetching http://img.slate.com/media/1/123125/123073/2240479/2262539/100826_EX_carryHeadTN.jpg Processing images... Fetching http://img.slate.com/images/tool_buttons/tweet.gif Processing images... Fetching http://img.slate.com/media/1/123125/123097/2248073/2265203/1008266_WH_EtisalatTN.jpg Processing images... Fetching http://img.slate.com/media/1/123125/123051/2240279/2262529/100826_$B_BubbleTN.jpg Fetching http://img.slate.com/media/1/123125/122984/2243388/2265254/100827_HN_RoyceLamberthTN.jpg Fetching http://img.slate.com/images/tool_buttons/facebook.gif Fetching http://img.slate.com/images/redesign2008/Slate_logo_onMaroon252x252.gif Fetching http://img.slate.com/images/redesign2008/Slate_logo_onMaroon252x252.gif Fetching http://img.slate.com/images/redesign2008/Slate_logo_onMaroon252x252.gif Fetching http://img.slate.com/images/redesign2008/Slate_logo_onMaroon252x252.gif Fetching http://img.slate.com/images/tool_buttons/rss.jpg Fetching http://img.slate.com/images/tool_buttons/rss.jpg Fetching http://img.slate.com/images/tool_buttons/rss.jpg Fetching http://img.slate.com/images/tool_buttons/rss.jpg Fetching http://img.slate.com/images/tool_buttons/rss.jpg Fetching http://img.slate.com/images/tool_buttons/print.gif Fetching http://img.slate.com/images/tool_buttons/print.gif Fetching http://img.slate.com/images/tool_buttons/print.gif Fetching http://img.slate.com/images/tool_buttons/print.gif Fetching http://img.slate.com/images/tool_buttons/print.gif Fetching http://img.slate.com/images/tool_buttons/email.gif Fetching http://img.slate.com/images/tool_buttons/email.gif Fetching http://img.slate.com/images/tool_buttons/email.gif Fetching http://img.slate.com/images/tool_buttons/email.gif Fetching http://img.slate.com/images/tool_buttons/email.gif Fetching http://img.slate.com/images/recommend_icons/facebook.gif Fetching http://img.slate.com/images/recommend_icons/facebook.gif Fetching http://img.slate.com/images/recommend_icons/facebook.gif Fetching http://img.slate.com/images/recommend_icons/facebook.gif Fetching http://img.slate.com/images/recommend_icons/facebook.gif Fetching http://img.slate.com/images/recommend_icons/digg.gif Fetching http://img.slate.com/images/recommend_icons/digg.gif Fetching http://img.slate.com/images/recommend_icons/digg.gif Fetching http://img.slate.com/images/recommend_icons/digg.gif Fetching http://img.slate.com/images/recommend_icons/digg.gif Fetching http://img.slate.com/images/recommend_icons/reddit.gif Fetching http://img.slate.com/images/recommend_icons/reddit.gif Fetching http://img.slate.com/images/recommend_icons/reddit.gif Fetching http://img.slate.com/images/recommend_icons/reddit.gif Fetching http://img.slate.com/images/recommend_icons/reddit.gif Fetching http://img.slate.com/images/recommend_icons/stumbledupon.jpg Fetching http://img.slate.com/images/recommend_icons/stumbledupon.jpg Fetching http://img.slate.com/images/recommend_icons/stumbledupon.jpg Fetching http://img.slate.com/images/recommend_icons/stumbledupon.jpg Fetching http://img.slate.com/images/recommend_icons/stumbledupon.jpg Recursion limit reached. Skipping links in http://www.slate.com/id/2265343/pagenum/all/ Recursion limit reached. Skipping links in http://www.slate.com/id/2265297/pagenum/all/ Recursion limit reached. Skipping links in http://www.slate.com/id/2265196/pagenum/all/ Recursion limit reached. Skipping links in http://www.slate.com/id/2265095/pagenum/all/ Recursion limit reached. Skipping links in http://www.slate.com/id/2265204/pagenum/all/ http://www.slate.com/id/2265343/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_1\index.xhtml Downloading Fetching http://www.slate.com/id/2263165/pagenum/all/ http://www.slate.com/id/2265196/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_0\index.xhtml Downloading Fetching http://www.slate.com/id/2265196/pagenum/all/ http://www.slate.com/id/2265297/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_3\index.xhtml Downloading Fetching http://www.slate.com/id/2265343/pagenum/all/ http://www.slate.com/id/2265204/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_2\index.xhtml Downloading Fetching http://www.slate.com/id/2265204/pagenum/all/ http://www.slate.com/id/2265095/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_4\index.xhtml Downloading Fetching http://www.slate.com/id/2265297/pagenum/all/ Downloaded article: The Bubble That Isn't from http://www.slate.com/id/2265343/ Downloaded article: Head Case from http://www.slate.com/id/2265196/ Downloaded article: A Distinction Without Deference from http://www.slate.com/id/2265297/ Downloaded article: The Internet's Secret Back Door from http://www.slate.com/id/2265204/ Downloaded article: Corrections from http://www.slate.com/id/2265095/ Processing images... Processing images... Processing images... Recursion limit reached. Skipping links in http://www.slate.com/id/2265196/pagenum/all/ Processing images... Processing images... Fetching http://labs.slate.com/media/img/slate_labs_logo_big.jpg Recursion limit reached. Skipping links in http://www.slate.com/id/2265204/pagenum/all/ Recursion limit reached. Skipping links in http://www.slate.com/id/2265343/pagenum/all/ Recursion limit reached. Skipping links in http://www.slate.com/id/2265297/pagenum/all/ Fetching http://labs.slate.com/media/img/thumbs/newsdots.jpg Fetching http://labs.slate.com/media/img/thumbs/lean-lock.jpg Fetching http://labs.slate.com/media/img/thumbs/jobmap.jpg http://www.slate.com/id/2265196/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_6\index.xhtml Downloading Fetching http://www.slate.com/id/2265095/pagenum/all/ Fetching http://labs.slate.com/media/img/thumbs/nametree.jpg http://www.slate.com/id/2265204/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_8\index.xhtml Downloading Fetching http://www.slate.com/id/2263165/pagenum/all/ http://www.slate.com/id/2265343/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_7\index.xhtml Downloading Fetching http://www.slate.com/id/2265214/pagenum/all/ http://www.slate.com/id/2265297/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_9\index.xhtml Downloading Fetching http://www.slate.com/id/2265255/entry/2265256/pagenum/all/ Downloaded article: Head Case from http://www.slate.com/id/2265196/ Downloaded article: The Internet's Secret Back Door from http://www.slate.com/id/2265204/ Downloaded article: The Bubble That Isn't from http://www.slate.com/id/2265343/ Downloaded article: A Distinction Without Deference from http://www.slate.com/id/2265297/ Fetching http://labs.slate.com/media/img/thumbs/51flag.jpg Fetching http://labs.slate.com/media/img/thumbs/bp-oil.jpg Fetching http://labs.slate.com/media/img/thumbs/teaparty.jpg Fetching http://labs.slate.com/media/img/thumbs/plain-english.png Fetching http://labs.slate.com/media/img/thumbs/media-map.jpg Fetching http://labs.slate.com/media/img/thumbs/favre.png Fetching http://labs.slate.com/media/img/thumbs/movie-scatterplot.png Fetching http://labs.slate.com/media/img/thumbs/sapometer.jpg Fetching http://labs.slate.com/media/img/thumbs/danbrown.jpg Fetching http://labs.slate.com/media/img/thumbs/senate-social-network.jpg Recursion limit reached. Skipping links in http://www.slate.com/id/2263165/pagenum/all/ Could not fetch link http://www.slate.com/id/2263165/pagenum/all/ Traceback (most recent call last): File "site-packages\calibre\web\fetch\simple.py", line 463, in process_links File "site-packages\calibre\web\feeds\news.py", line 686, in _postprocess_html File "c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_s8at24_recipes\recipe0.py", line 346, in postprocess_html AttributeError: 'NoneType' object has no attribute 'find' http://www.slate.com/id/2263165/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_5\index.xhtml Downloading Fetching http://www.slate.com/id/2265201/pagenum/all/ Failed to download article: Welcome to Slate Labs from http://www.slate.com/id/2263165/ Traceback (most recent call last): File "site-packages\calibre\utils\threadpool.py", line 95, in run File "site-packages\calibre\web\feeds\news.py", line 813, in fetch_article File "site-packages\calibre\web\feeds\news.py", line 809, in _fetch_article Exception: Could not fetch article. Run with -vv to see the reason Processing images... Recursion limit reached. Skipping links in http://www.slate.com/id/2265095/pagenum/all/ Processing images... Recursion limit reached. Skipping links in http://www.slate.com/id/2263165/pagenum/all/ http://www.slate.com/id/2265095/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_10\index.xhtml Downloading Fetching http://www.slate.com/id/2265192/pagenum/all/ Processing images... Downloaded article: Corrections from http://www.slate.com/id/2265095/ Fetching http://img.slate.com/media/1/123125/123019/2240599/2262533/100826_PB_printingpress02TN.jpg Recursion limit reached. Skipping links in http://www.slate.com/id/2265214/pagenum/all/ Processing images... Fetching http://img.slate.com/media/1/123125/122954/2241435/2262941/100826_MOV_INFERNO.jpg Could not fetch link http://www.slate.com/id/2263165/pagenum/all/ Traceback (most recent call last): File "site-packages\calibre\web\fetch\simple.py", line 463, in process_links File "site-packages\calibre\web\feeds\news.py", line 686, in _postprocess_html File "c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_s8at24_recipes\recipe0.py", line 346, in postprocess_html AttributeError: 'NoneType' object has no attribute 'find' http://www.slate.com/id/2263165/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_11\index.xhtml Downloading Fetching http://www.slate.com/id/2265205/pagenum/all/ Failed to download article: Welcome to Slate Labs from http://www.slate.com/id/2263165/ Traceback (most recent call last): File "site-packages\calibre\utils\threadpool.py", line 95, in run File "site-packages\calibre\web\feeds\news.py", line 813, in fetch_article File "site-packages\calibre\web\feeds\news.py", line 809, in _fetch_article Exception: Could not fetch article. Run with -vv to see the reason Recursion limit reached. Skipping links in http://www.slate.com/id/2265255/entry/2265256/pagenum/all/ http://www.slate.com/id/2265214/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_12\index.xhtml Downloading Fetching http://www.slate.com/id/2265191/pagenum/all/ Downloaded article: The Proto-Internet of 1704 from http://www.slate.com/id/2265214/ Could not fetch link http://www.slate.com/id/2265255/entry/2265256/pagenum/all/ Traceback (most recent call last): File "site-packages\calibre\web\fetch\simple.py", line 463, in process_links File "site-packages\calibre\web\feeds\news.py", line 686, in _postprocess_html File "c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_s8at24_recipes\recipe0.py", line 347, in postprocess_html AttributeError: 'NoneType' object has no attribute 'name' http://www.slate.com/id/2265255/entry/2265256/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_13\index.xhtml Downloading Fetching http://www.slate.com/id/2265079/pagenum/all/ Failed to download article: The Great 3-D Debate from http://www.slate.com/id/2265255/entry/2265256/ Traceback (most recent call last): File "site-packages\calibre\utils\threadpool.py", line 95, in run File "site-packages\calibre\web\feeds\news.py", line 813, in fetch_article File "site-packages\calibre\web\feeds\news.py", line 809, in _fetch_article Exception: Could not fetch article. Run with -vv to see the reason Processing images... Fetching http://img.slate.com/media/1/123125/2126996/2240593/2262953/100826_TECH_gmailPhoneTN.jpg Fetching http://img.slate.com/media/1/123125/2126996/2240593/2262953/100826_TECH_gmailstill.jpg Recursion limit reached. Skipping links in http://www.slate.com/id/2265201/pagenum/all/ Processing images... Fetching http://img.slate.com/media/1/123125/122958/2240716/2262946/100826_TV_kidsHallTN.jpg http://www.slate.com/id/2265201/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_14\index.xhtml Downloading Fetching http://www.slate.com/id/2264657/pagenum/all/ Downloaded article: The Home Phone Is Back! from http://www.slate.com/id/2265201/ Recursion limit reached. Skipping links in http://www.slate.com/id/2265192/pagenum/all/ http://www.slate.com/id/2265192/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_15\index.xhtml Downloaded article: Aesthetes of Absurdity from http://www.slate.com/id/2265192/ Processing images... Recursion limit reached. Skipping links in http://www.slate.com/id/2265191/pagenum/all/ Processing images... Fetching http://img.slate.com/media/1/123125/2218698/2241481/2262542/100825_XX_grizzlyTN.jpg Recursion limit reached. Skipping links in http://www.slate.com/id/2265079/pagenum/all/ http://www.slate.com/id/2265191/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_17\index.xhtml Downloaded article: The Literary Critic as Humanist from http://www.slate.com/id/2265191/ http://www.slate.com/id/2265079/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_18\index.xhtml Downloaded article: Bear Market from http://www.slate.com/id/2265079/ Processing images... Fetching http://img.slate.com/media/1/123125/2093564/2243695/2264310/100826_SCI_squatTN.jpg WARNING: Encoding detection confidence 99% Recursion limit reached. Skipping links in http://www.slate.com/id/2264657/pagenum/all/ http://www.slate.com/id/2264657/pagenum/all/ saved to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_w8cci0_plumber\feed_0\article_19\index.xhtml no allowed content found, removing article Could not fetch link http://www.slate.com/id/2265205/pagenum/all/ Traceback (most recent call last): File "site-packages\calibre\web\fetch\simple.py", line 434, in process_links File "site-packages\calibre\web\fetch\simple.py", line 195, in get_soup File "c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_6mdfxh\calibre_0.7.16_s8at24_recipes\recipe0.py", line 302, in preprocess_html Exception: String error http://www.slate.com/id/2265205/pagenum/all/ saved to Downloaded article: Don't Just Sit There! from http://www.slate.com/id/2264657/ Failed to download article: Create Your Own Virtual Cloud from http://www.slate.com/id/2265205/ Traceback (most recent call last): File "site-packages\calibre\utils\threadpool.py", line 95, in run File "site-packages\calibre\web\feeds\news.py", line 813, in fetch_article File "site-packages\calibre\web\feeds\news.py", line 809, in _fetch_article Exception: Could not fetch article. Run with -vv to see the reason Failed to download the following articles: Welcome to Slate Labs from All Articles http://www.slate.com/id/2263165/ Traceback (most recent call last): File "site-packages\calibre\utils\threadpool.py", line 95, in run File "site-packages\calibre\web\feeds\news.py", line 813, in fetch_article File "site-packages\calibre\web\feeds\news.py", line 809, in _fetch_article Exception: Could not fetch article. Run with -vv to see the reason Welcome to Slate Labs from All Articles http://www.slate.com/id/2263165/ Traceback (most recent call last): File "site-packages\calibre\utils\threadpool.py", line 95, in run File "site-packages\calibre\web\feeds\news.py", line 813, in fetch_article File "site-packages\calibre\web\feeds\news.py", line 809, in _fetch_article Exception: Could not fetch article. Run with -vv to see the reason The Great 3-D Debate from All Articles http://www.slate.com/id/2265255/entry/2265256/ Traceback (most recent call last): File "site-packages\calibre\utils\threadpool.py", line 95, in run File "site-packages\calibre\web\feeds\news.py", line 813, in fetch_article File "site-packages\calibre\web\feeds\news.py", line 809, in _fetch_article Exception: Could not fetch article. Run with -vv to see the reason Create Your Own Virtual Cloud from All Articles http://www.slate.com/id/2265205/ Traceback (most recent call last): File "site-packages\calibre\utils\threadpool.py", line 95, in run File "site-packages\calibre\web\feeds\news.py", line 813, in fetch_article File "site-packages\calibre\web\feeds\news.py", line 809, in _fetch_article Exception: Could not fetch article. Run with -vv to see the reason Parsing all content... Parsing feed_0/article_1/index.html ... Parsing feed_0/article_14/index.html ... Parsing feed_0/article_9/index.html ... Initial parse failed: Traceback (most recent call last): File "site-packages\calibre\ebooks\oeb\base.py", line 816, in first_pass File "lxml.etree.pyx", line 2532, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48270) File "parser.pxi", line 1545, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71812) File "parser.pxi", line 1417, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70608) File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67148) File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63824) File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64745) File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64088) XMLSyntaxError: xmlns: URI xhtml is not absolute, line 1, column 26 Parsing file 'feed_0/article_9/index.html' as HTML Forcing feed_0/article_9/index.html into XHTML namespace Parsing feed_0/article_17/index.html ... Parsing feed_0/article_12/index.html ... Parsing feed_0/article_0/index.html ... Parsing feed_0/article_4/index.html ... Parsing feed_0/article_8/index.html ... Parsing feed_0/article_18/index.html ... Parsing feed_0/article_10/index.html ... Parsing feed_0/article_2/index.html ... Parsing feed_0/article_3/index.html ... Initial parse failed: Traceback (most recent call last): File "site-packages\calibre\ebooks\oeb\base.py", line 816, in first_pass File "lxml.etree.pyx", line 2532, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48270) File "parser.pxi", line 1545, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71812) File "parser.pxi", line 1417, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70608) File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67148) File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63824) File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64745) File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64088) XMLSyntaxError: xmlns: URI xhtml is not absolute, line 1, column 26 Parsing file 'feed_0/article_3/index.html' as HTML Forcing feed_0/article_3/index.html into XHTML namespace Parsing feed_0/article_19/index.html ... Parsing feed_0/index.html ... Forcing feed_0/index.html into XHTML namespace Parsing feed_0/article_15/index.html ... Parsing index.html ... Forcing index.html into XHTML namespace Parsing feed_0/article_6/index.html ... Parsing feed_0/article_7/index.html ... Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265196' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265095' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265079' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265201' not found Referenced file 'feed_0/article_16/index.html' not found Referenced file 'feed_1/index.html' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265191' not found Referenced file 'feed_0/article_13/index.html' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2264657' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265192' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265297' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265204' not found Referenced file 'feed_0/article_5/index.html' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265343' not found Referenced file '/toolbar.aspx%3faction%3dprint%26id%3d2265214' not found Referenced file 'feed_0/article_11/index.html' not found Reading TOC from NCX... Merging user specified metadata... Detecting structure... Flattening CSS and remapping font sizes... Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 195 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 338 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 354 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 321 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: normal 1em/1.5em [1:1: font] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 195 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 338 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 354 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 321 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: normal 1em/1.5em [1:1: font] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 283 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 250 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 89 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 539 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 305 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 195 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 252 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 195 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 250 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 216 [1:13: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Property: Invalid value for "CSS Level 2.1" property: 18 [1:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [1:12: height] Source base font size is 12.00000pt Cleaning up manifest... Trimming unused files from manifest... Parsing stylesheet.css ... Property: Invalid value for "CSS Level 2.1" property: 195 [74:1: height] Property: Invalid value for "CSS Level 2.1" property: 252 [75:1: width] Property: Invalid value for "CSS Level 2.1" property: 18 [158:1: height] Property: Invalid value for "CSS Level 2.1" property: 18 [159:1: width] Property: Invalid value for "CSS Level 2.1" property: 338 [169:1: height] Property: Invalid value for "CSS Level 2.1" property: 252 [170:1: width] Property: Invalid value for "CSS Level 2.1" property: 354 [185:1: height] Property: Invalid value for "CSS Level 2.1" property: 252 [186:1: width] Property: Invalid value for "CSS Level 2.1" property: 321 [190:1: height] Property: Invalid value for "CSS Level 2.1" property: 252 [191:1: width] Property: Invalid value for "CSS Level 2.1" property: 283 [250:1: height] Property: Invalid value for "CSS Level 2.1" property: 252 [251:1: width] Property: Invalid value for "CSS Level 2.1" property: 89 [261:1: height] Property: Invalid value for "CSS Level 2.1" property: 250 [262:1: width] Property: Invalid value for "CSS Level 2.1" property: 305 [266:1: height] Property: Invalid value for "CSS Level 2.1" property: 539 [267:1: width] Property: Invalid value for "CSS Level 2.1" property: middle [281:1: text-align] Property: Invalid value for "CSS Level 2.1" property: 216 [286:1: height] Property: Invalid value for "CSS Level 2.1" property: 250 [287:1: width] Trimming 'feed_0/article_5/images/img15.jpg' from manifest Trimming 'feed_0/article_5/images/img1.jpg' from manifest Trimming 'feed_0/article_5/images/img11.jpg' from manifest Trimming 'feed_0/article_5/images/img3.jpg' from manifest Trimming 'feed_0/article_5/images/img8.jpg' from manifest Trimming 'feed_0/article_5/images/img2.jpg' from manifest Trimming 'feed_0/article_5/images/img10.jpg' from manifest Trimming 'feed_0/article_5/images/img4.jpg' from manifest Trimming 'feed_0/article_13/images/img1.jpg' from manifest Trimming 'feed_0/article_5/images/img9.jpg' from manifest Trimming 'feed_0/article_5/images/img6.jpg' from manifest Trimming 'feed_0/article_5/images/img7.jpg' from manifest Trimming 'feed_0/article_5/images/img13.jpg' from manifest Trimming 'feed_0/article_5/images/img12.jpg' from manifest Trimming 'feed_0/article_5/images/img5.jpg' from manifest Trimming 'feed_0/article_5/images/img14.jpg' from manifest Creating MOBI Output... Generating in-line TOC... Applying case-transforming CSS... Parsing manglecase.css ... Parsing tocstyle.css ... Rasterizing SVG images... Converting XHTML to Mobipocket markup... Converting TOC for MOBI periodical indexing... Using mastheadImage supplied in manifest... Serializing markup content... Hyperlink target 'feed_0/article_8/index.html#add-comment' not found Hyperlink target 'feed_0/article_7/index.html#add-comment' not found Hyperlink target 'feed_0/article_3/index.html#add-comment' not found Hyperlink target 'feed_0/article_12/index.html#add-comment' not found Hyperlink target 'feed_0/article_6/index.html#add-comment' not found Hyperlink target 'feed_0/article_0/index.html#add-comment' not found Hyperlink target 'feed_0/article_19/index.html#add-comment' not found Hyperlink target 'feed_0/article_18/index.html#add-comment' not found Hyperlink target 'feed_0/article_4/index.html#add-comment' not found Hyperlink target 'feed_0/article_1/index.html#add-comment' not found Hyperlink target 'feed_0/article_15/index.html#add-comment' not found Hyperlink target 'feed_0/article_2/index.html#add-comment' not found Hyperlink target 'feed_0/article_14/index.html#add-comment' not found Hyperlink target 'feed_0/article_10/index.html#add-comment' not found Hyperlink target 'feed_0/article_17/index.html#add-comment' not found Hyperlink target 'feed_0/article_9/index.html#add-comment' not found Compressing markup content... MOBI periodical specified, evaluating TOC for periodical conformance ... TOC structure conforms Generating structured CTOC ... CNCX utilization: 1 record, 3% full Indexing navPoints ... Generating INDX ... Serializing images... MOBI output written to c:\users\ganesh\appdata\local\temp\calibre_0.7.16_tmp_y2cdkc\calibre_0.7.16__m679j_recipe_out.mobi Last edited by gk_jam; 08-28-2010 at 05:32 PM. |
![]() |
![]() |
![]() |
#2 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
However, I'll start you off. The 5 duplicate articles appear to result from a typo in the recipe. A line of code appears twice at line 102. Comment out or delete one of the two lines that reads: Code:
self.section_dates.append(self.tag_to_string(todays_section,use_alt=False)) |
|
![]() |
![]() |
![]() |
#3 |
Junior Member
![]() Posts: 6
Karma: 10
Join Date: Aug 2010
Device: Kindle 3
|
Thanks for the tip! I tried the fix you mentioned and that worked to remove the duplication issue!
I'll take your advice and post in the dedicated recipe thread for feedback on the other issue. Also, I assume that the way to alert the maintainers to update the recipe with the fix is to make a bug tracker entry? I'll give that a shot. |
![]() |
![]() |
![]() |
#4 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
This recipe is fairly complex. It would be best to have whoever wrote it fix it. It's likely the slate.com site has changed. Posting in the bug tracker will alert Kovid, and he'll usually alert whoever wrote it. Posting in the recipe thread may attract the attention of the author or someone else who has the skill to fix it.
|
![]() |
![]() |
![]() |
#5 |
Junior Member
![]() Posts: 6
Karma: 10
Join Date: Aug 2010
Device: Kindle 3
|
Looks like both issues have been resolved with that fix! Although they seem unrelated, there may have been some cascading effect. All relevant articles appear to now show up, and only once each. Thanks again!
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Best English News Source? | Gideon | Reading Recommendations | 24 | 11-16-2010 05:14 PM |
News Articles List in 0.7.20 | Ola | Calibre | 3 | 09-27-2010 07:38 AM |
Sharing/saving articles in news downloads for Kindle | f1nkster | Calibre | 4 | 07-28-2010 01:53 PM |
Reversing articles order in a custom news recipe? | retired_anon_25 | Calibre | 5 | 12-12-2009 05:24 PM |
Custom news source | JayCeeEll | Calibre | 2 | 11-14-2009 04:01 AM |