View Single Post
Old 06-20-2025, 06:46 AM   #8
xkcklzn
Junior Member
xkcklzn began at the beginning.
 
Posts: 8
Karma: 10
Join Date: Nov 2024
Device: pc
Code:
Fetch news from The Economist Espresso
Conversion options changed from defaults:
  output_profile: 'generic_eink_hd'
  verbose: 2
Resolved conversion options
calibre version: 8.5.0
{'add_alt_text_to_img': False,
 'asciiize': False,
 'author_sort': None,
 'authors': None,
 'base_font_size': 0,
 'book_producer': None,
 'change_justification': 'original',
 'chapter': None,
 'chapter_mark': 'pagebreak',
 'comments': None,
 'cover': None,
 'debug_pipeline': None,
 'dehyphenate': True,
 'delete_blank_paragraphs': True,
 'disable_font_rescaling': False,
 'dont_download_recipe': False,
 'dont_split_on_page_breaks': True,
 'duplicate_links_in_toc': False,
 'embed_all_fonts': False,
 'embed_font_family': None,
 'enable_heuristics': False,
 'epub_flatten': False,
 'epub_inline_toc': False,
 'epub_max_image_size': 'none',
 'epub_toc_at_end': False,
 'epub_version': '2',
 'expand_css': False,
 'extra_css': None,
 'extract_to': None,
 'filter_css': None,
 'fix_indents': True,
 'flow_size': 260,
 'font_size_mapping': None,
 'format_scene_breaks': True,
 'html_unwrap_factor': 0.4,
 'input_encoding': None,
 'input_profile': <calibre.customize.profiles.InputProfile object at 0x00000227DC5B7A90>,
 'insert_blank_line': False,
 'insert_blank_line_size': 0.5,
 'insert_metadata': False,
 'isbn': None,
 'italicize_common_cases': True,
 'keep_ligatures': False,
 'language': None,
 'level1_toc': None,
 'level2_toc': None,
 'level3_toc': None,
 'line_height': 0,
 'linearize_tables': False,
 'lrf': False,
 'margin_bottom': 5.0,
 'margin_left': 5.0,
 'margin_right': 5.0,
 'margin_top': 5.0,
 'markup_chapter_headings': True,
 'max_toc_links': 50,
 'minimum_line_height': 120.0,
 'no_chapters_in_toc': False,
 'no_default_epub_cover': False,
 'no_inline_navbars': False,
 'no_svg_cover': False,
 'output_profile': <calibre.customize.profiles.GenericEinkHD object at 0x00000227DB7671D0>,
 'page_breaks_before': None,
 'prefer_metadata_cover': False,
 'preserve_cover_aspect_ratio': False,
 'pretty_print': True,
 'pubdate': None,
 'publisher': None,
 'rating': None,
 'read_metadata_from_opf': None,
 'recipe_specific_option': None,
 'remove_fake_margins': True,
 'remove_first_image': False,
 'remove_paragraph_spacing': False,
 'remove_paragraph_spacing_indent_size': 1.5,
 'renumber_headings': True,
 'replace_scene_breaks': '',
 'search_replace': None,
 'series': None,
 'series_index': None,
 'smarten_punctuation': False,
 'sr1_replace': '',
 'sr1_search': '',
 'sr2_replace': '',
 'sr2_search': '',
 'sr3_replace': '',
 'sr3_search': '',
 'start_reading_at': None,
 'subset_embedded_fonts': False,
 'tags': None,
 'test': False,
 'timestamp': None,
 'title': None,
 'title_sort': None,
 'toc_filter': None,
 'toc_threshold': 6,
 'toc_title': None,
 'transform_css_rules': None,
 'transform_html_rules': None,
 'unsmarten_punctuation': False,
 'unwrap_lines': True,
 'use_auto_toc': False,
 'verbose': 2}
InputFormatPlugin: Recipe Input running
Downloading recipe urn: builtin:economist_espresso
Trying to get latest version of recipe: economist_espresso
Using proxies: {'http': '127.0.0.1:10808', 'https': '127.0.0.1:10808', 'ftp': 'http://127.0.0.1:10808'}
Using user agent: Mozilla/5.0 (Linux; Android 14) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/127.0.6533.103 Mobile Safari/537.36 Liskov
Using proxies: {'http': '127.0.0.1:10808', 'https': '127.0.0.1:10808', 'ftp': 'http://127.0.0.1:10808'}
Fetching https://www.economist.com/the-world-in-brief
Could not fetch link https://www.economist.com/the-world-in-brief
Traceback (most recent call last):
  File "calibre\web\fetch\simple.py", line 286, in fetch_url
  File "mechanize\_mechanize.py", line 241, in open_novisit
  File "mechanize\_mechanize.py", line 313, in _mech_open
mechanize._response.get_seek_wrapper_class.<locals>.httperror_seek_wrapper: HTTP Error 403: Forbidden

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "calibre\web\fetch\simple.py", line 544, in process_links
  File "calibre\web\fetch\simple.py", line 291, in fetch_url
calibre.web.fetch.simple.FetchError: Forbidden

https://www.economist.com/the-world-in-brief saved to 
Failed to download article: The World in Brief from https://www.economist.com/the-world-in-brief
Traceback (most recent call last):
  File "calibre\utils\threadpool.py", line 100, in run
  File "calibre\web\feeds\news.py", line 1250, in fetch_article
  File "calibre\web\feeds\news.py", line 1245, in _fetch_article
Exception: Could not fetch article. The debug traceback is available earlier in this log



Failed to download the following articles:
The World in Brief from Espresso
https://www.economist.com/the-world-in-brief
Traceback (most recent call last):
  File "calibre\utils\threadpool.py", line 100, in run
  File "calibre\web\feeds\news.py", line 1250, in fetch_article
  File "calibre\web\feeds\news.py", line 1245, in _fetch_article
Exception: Could not fetch article. The debug traceback is available earlier in this log

Parsing all content...
Parsing feed_0/index.html ...
Forcing feed_0/index.html into XHTML namespace
Parsing index.html ...
Forcing index.html into XHTML namespace
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 3 items of level: div_1
Found 2 items of level: p_2
Found 1 items of level: div_2
Ignoring level p_2
div_1  left margin stats: Counter()
div_1  right margin stats: Counter()
div_2  left margin stats: Counter()
div_2  right margin stats: Counter()
Cleaning up manifest...
Trimming unused files from manifest...
Creating EPUB Output...
Found non-unique filenames, renaming to support broken EPUB readers like FBReader, Aldiko and Stanza...
{'index.html': 'index_u1.html'}
Splitting markup on page breaks and flow limits, if any...
	Looking for large trees in feed_0/index.html...
	No large trees found
	Looking for large trees in index_u1.html...
	No large trees found
This EPUB file has no Table of Contents. Creating a default TOC
The cover image has an id != "cover". Renaming to work around bug in Nook Color
EPUB output written to C:\Users\Joea\AppData\Local\Temp\calibre-gq3aqxka\zdnuyqog_recipe_out.epub
The Economist Espresso recipe fails due to the change of the website's HTML structure. Thanks for fixing!
xkcklzn is offline   Reply With Quote