I fetched news from "PC Magazine" [Wed, 18 Mar, 2015].
Its cover page seems OK, but instead of content, the
pages contain either just headers/titles such as
Failed feed: Tech Commentary from the Editors of PC
Magazine
HTTP Error 403: Forbidden
or some garbage text like this:
"PC Magazine Small Business
<noscript> <img src="http://b.scorecardresearch.com/p?
c1=2&c2=6885615&c3=&c4=&c5=&a
mp;c6=&c15=&cj=1" /> </noscript>
&&&&&&&&
Magazine
Traceback (most recent call last):
File "site-packages\calibre\web\feeds\news.py", line
1599, in parse_feeds
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 203, in open
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 255, in _mech_open
httperror_seek_wrapper: HTTP Error 403: Forbidden
Failed feed: PC Magazine Breaking News
Traceback (most recent call last):
File "site-packages\calibre\web\feeds\news.py", line
1599, in parse_feeds
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 203, in open
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 255, in _mech_open
httperror_seek_wrapper: HTTP Error 403: Forbidden
Failed feed: PC Magazine Tips and Solutions
Traceback (most recent call last):
File "site-packages\calibre\web\feeds\news.py", line
1599, in parse_feeds
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 203, in open
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 255, in _mech_open
httperror_seek_wrapper: HTTP Error 403: Forbidden
Failed feed: PC Magazine: the Official John C. Dvorak RSS
Feed
Traceback (most recent call last):
File "site-packages\calibre\web\feeds\news.py", line
1599, in parse_feeds
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 203, in open
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 255, in _mech_open
httperror_seek_wrapper: HTTP Error 403: Forbidden
Failed feed: PC Magazine Editor-in-Chief Lance Ulanoff
Traceback (most recent call last):
File "site-packages\calibre\web\feeds\news.py", line
1599, in parse_feeds
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 203, in open
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 255, in _mech_open
httperror_seek_wrapper: HTTP Error 403: Forbidden
Failed feed: Technology News from Ziff Davis
Traceback (most recent call last):
File "site-packages\calibre\web\feeds\news.py", line
1599, in parse_feeds
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 203, in open
File "site-packages\mechanize-0.2.5-py2.7.egg
\mechanize\_mechanize.py", line 255, in _mech_open
httperror_seek_wrapper: HTTP Error 403: Forbidden
Synthesizing mastheadImage
Parsing all content...
Parsing feed_5/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_5/index.html as HTML
Parsing feed_8/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_8/index.html as HTML
Parsing feed_3/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_3/index.html as HTML
Parsing index.html ...
Forcing index.html into XHTML namespace
Parsing feed_6/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_6/index.html as HTML
Parsing feed_1/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_1/index.html as HTML
Parsing feed_0/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_0/index.html as HTML
Parsing feed_7/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_7/index.html as HTML
Parsing feed_4/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_4/index.html as HTML
Parsing feed_2/index.html ...
Initial parse failed, using more forgiving parsers
Parsing feed_2/index.html as HTML
Reading TOC from NCX...
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 2 items of level: p_2
Found 19 items of level: div_1
Found 18 items of level: div_2
Ignoring level p_2
div_1 left margin stats: Counter()
div_1 right margin stats: Counter()
div_2 left margin stats: Counter()
div_2 right margin stats: Counter()
Cleaning up manifest...
Trimming unused files from manifest...
Creating EPUB Output...
Found non-unique filenames, renaming to support broken
EPUB readers like FBReader, Aldiko and Stanza...
{u'feed_0/index.html': u'feed_0/index_u6.html',
u'feed_1/index.html': u'feed_1/index_u5.html',
u'feed_2/index.html': u'feed_2/index_u9.html',
u'feed_3/index.html': u'feed_3/index_u2.html',
u'feed_4/index.html': u'feed_4/index_u8.html',
u'feed_6/index.html': u'feed_6/index_u4.html',
u'feed_7/index.html': u'feed_7/index_u7.html',
u'feed_8/index.html': u'feed_8/index_u1.html',
u'index.html': u'index_u3.html'}
Rescaling image from 600x60 to 566x56
mastheadImage.jpg
Splitting markup on page breaks and flow limits, if any...
Looking for large trees in feed_6/index_u4.html...
No large trees found
Looking for large trees in index_u3.html...
No large trees found
Looking for large trees in feed_4/index_u8.html...
No large trees found
Looking for large trees in feed_3/index_u2.html...
No large trees found
Looking for large trees in feed_8/index_u1.html...
No large trees found
Looking for large trees in feed_5/index.html...
No large trees found
Looking for large trees in feed_2/index_u9.html...
No large trees found
Looking for large trees in feed_1/index_u5.html...
No large trees found
Looking for large trees in feed_0/index_u6.html...
No large trees found
Looking for large trees in feed_7/index_u7.html...
No large trees found
The cover image has an id != "cover". Renaming to work
around bug in Nook Color
EPUB output written to C:\Users\Danesh\AppData\Local