| 
			
			 | 
		#1 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 2 
				Karma: 10 
				Join Date: Nov 2014 
				
				
				
				Device: Kindle keyboard 3g 
				
				
				 | 
	
	
	
		
		
			
			 
				
				Problem with The Guardian & The Observer recipe?
			 
			
			
			I've had problems with this recipe since updating from Calibre 2.5 to 2.8, but reverting back to 2.5 doesn't fix it.  At first the recipe returned very little (0.2Mb vs the usual ~10Mb on a weekend), then it stopped running with the error message "NoneType' object has no attribute 'findAll'.  I contacted Calibre with a bug report but they redirected me here. 
		
	
		
		
		
		
		
		
		
		
		
		
	
	Can anyone shed any light?  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#2 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 2 
				Karma: 10 
				Join Date: Nov 2014 
				
				
				
				Device: Calibre 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			[ Reasons right now mean I can only use Calibre 0.7.28, last version, for Mac OS X 10.5.8 ].  
		
	
		
		
		
		
		
		
		
		
		
		
	
	I have downloaded The Guardian/Observer for many weeks with this. Last week, about Oct 27, 28 only some pages became available to read, the remainder the URLs only, those entries always at the end of each page. Thereafter I received only the URLs, no pages at all. I have tried various ways to resolve that, but not being so softwarish, and finding 'python' hard to understand (and apparently anyway requires indents. Hmm?), I was not successful. Oh AND the Guardian web sights are now rather different from what they were up to a week ago, that is ... changed. I can see each subject title being downloaded in sequence. The resulting table of contents set works fine. But click on a page title and get only the URL. I tried to set up the feed[], based on the Daily Telegraph feed [] set, but observe the guardian.recipe is currently rather different from the telegraph.recipe, and the Guardian I tried, in my lack of knowledge and meaning, says 'no!'. This is all perhaps a significant montypython software construct. It has been easy to set the ignore section[] -- in my case 'Sport', although seemed not to work for 'Observer Sport' -- but there is no way I see to ADD other sections I want, presumably not "basic" enough for the aroma of beautiful.soup <s> So please, software pythons wind around these things and squeeze them to rights.  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| Advert | |
| 
         | 
    
| 
			
			 | 
		#3 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 9 
				Karma: 10 
				Join Date: Apr 2012 
				
				
				
				Device: kindle 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			I'm finding a similar problem -- frequently articles in the index are not in the file.. Eg, today the link to the article "DNA Scientist James Watson sells Nobel prize medal" fails -- the article is not there. Sometimes up to 25% of the articles fail in this way....
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#4 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 8 
				Karma: 10 
				Join Date: Dec 2014 
				
				
				
				Device: Kindle 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Working through today's paper a pattern becomes obvious.  This link fails: http://www.theguardian.com/science/2...el-prize-medal  but this does not: http://www.theguardian.com/politics/...-neoliberalism 
		
	
		
		
		
		
		
		
		
		
		
		
	
	It depends on the value of XXX in www.theguardian.com/XXX/. All links are OK if XXX is world, or business, or commentisfree, or us-news, or uk-news, or politics, or society (or a few others). All links fail if XXX is stage, music, science, books, media, film, money, technology (and a few others). Any suggestions on how to fix it? How can I look at any intermediate files that get built and then purged?  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#5 | 
| 
			
			
			
			 Grand Sorcerer 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,698 
				Karma: 79983758 
				Join Date: Nov 2007 
				Location: Toronto 
				
				
				Device: Libra H2O, Libra Colour 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			I sense there is a new design coming to the Guardian web site and at some point all content will be like stage / music / ....
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| Advert | |
| 
         | 
    
| 
			
			 | 
		#6 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			This commit, should allow extracting content from the new design, however, you will have to wait till that website stabilizes before this recipe can be updated properly. 
		
	
		
		
		
		
		
		
		
		
		
		
	
	https://github.com/kovidgoyal/calibr...23e1f52a82958e  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#7 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 8 
				Karma: 10 
				Join Date: Dec 2014 
				
				
				
				Device: Kindle 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Thanks, Kovid, for your quick response.  At first sight it works OK with the UK version of the website. 
		
	
		
		
		
		
		
		
		
		
		
		
	
	Alan  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#8 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 9 
				Karma: 10 
				Join Date: Apr 2012 
				
				
				
				Device: kindle 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Yes -- looks good so far! Thanks
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
![]()  | 
            
        
    
            
  | 
    
			 
			Similar Threads
		 | 
	||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| The Guardian and The Observer missing Sport Section | colint | Recipes | 0 | 05-23-2014 07:36 AM | 
| The guardian &Observer | didsbury | Calibre | 1 | 01-26-2013 08:57 AM | 
| The Guardian and Observer Books Power 100 | Ben Thornton | News | 4 | 10-02-2011 12:04 PM | 
| The Guardian/The observer broken recipe ? | wingmongyee | Recipes | 6 | 07-08-2011 11:38 PM | 
| Review of the Kindle 3 from the Observer in the Guardian UK | DMcCunney | News | 18 | 08-29-2010 08:03 PM |