| 
			
			 | 
		#1 | 
| 
			
			
			
			 Zealot 
			
			![]() ![]() Posts: 119 
				Karma: 100 
				Join Date: Jan 2011 
				Location: Germany / NRW /Köln 
				
				
				Device: prs-650 / prs-350 /kindle 3 
				
				
				 | 
	
	
	
		
		
			
			 
				
				grab the big-ones (img)
			 
			
			
			hi, 
		
	
		
		
		
		
		
		
		
		
		
		
	
	little prob to fetch the real pictures from the feeds to fetch the thumbnails is not the goal i think. can i fetch the big ones? here are the links from inside html: the thumbnail Code: 
	http://www.ngz-online.de/polopoly_fs/1.1255103.1305792647!/httpImage/1593972352.jpg_gen/derivatives/rpo54_195/1593972352.jpg Code: 
	http://www.ngz-online.de/polopoly_fs/1.1255103.1305792647!/httpImage/1593972352.jpg_gen/derivatives/rpoPanorama_786/1593972352.jpg  
		 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#2 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Use preprocess_html() and change the links.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#3 | 
| 
			
			
			
			 Zealot 
			
			![]() ![]() Posts: 119 
				Karma: 100 
				Join Date: Jan 2011 
				Location: Germany / NRW /Köln 
				
				
				Device: prs-650 / prs-350 /kindle 3 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			have got work on it right the hole day. 
		
	
		
		
		
		
		
		
		
		
		
		
	
	i don't get it work please can you help me a bit with right syntax? in this there is the code to change Code: 
	<div class="goodiebox l box_right"> <div class="boxframe"> <div class="headline_empty"/> <div class="content"> <center> <div class="imgtop"> <a href="/polopoly_fs/1.248951.1297710265!/httpImage/16361081.jpg_gen/derivatives/rpoPanorama_786/16361081.jpg" class="lightbox" rel="lightbox" title="Erster Gast auf dem Blauen Sofa in Grevenbroich: Bürgermeisterin Ursula Kwasny. Foto: Michael Reuter"> <img title="Erster Gast auf dem Blauen Sofa in Grevenbroich: Bürgermeisterin Ursula Kwasny. Foto: Michael Reuter" height="156" style="" alt="" width="195" class="" src="/polopoly_fs/1.248951.1297710265!/httpImage/16361081.jpg_gen/derivatives/rpo54_195/16361081.jpg"/> </a> <a title="Erster Gast auf dem Blauen Sofa in Grevenbroich: Bürgermeisterin Ursula Kwasny. Foto: Michael Reuter" class="iconpic_zoom lightbox" rel="lightbox" href="/polopoly_fs/1.248951.1297710265!/httpImage/16361081.jpg_gen/derivatives/rpoPanorama_786/16361081.jpg"/> <script type="text/javascript" src="http://www.ngz-online.de:80/js/lightbox.js" language="JavaScript"/> rpo54_195 with this one rpoPanorama_786  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#4 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 Code: 
	for img in soup.findAll('img', src=True):
   img['src'] = img['src'].replace('rpo54_195', 'rpoPanorama_786')
 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#5 | 
| 
			
			
			
			 Zealot 
			
			![]() ![]() Posts: 119 
				Karma: 100 
				Join Date: Jan 2011 
				Location: Germany / NRW /Köln 
				
				
				Device: prs-650 / prs-350 /kindle 3 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			o.k. that is next step, the string are changed 
		
	
		
		
		
		
		
		
		
		
		
		
	
	and now i got this: Code: 
	Referenced file '/polopoly_fs/1.1255129.1305794277%21/httpImage/2533432514.jpg_gen/derivatives/rpoPanorama_786/2533432514.jpg' not found Referenced file '/polopoly_fs/1.1255125.1305793701%21/httpImage/4171936915.jpg_gen/derivatives/rpoPanorama_786/4171936915.jpg' not found Referenced file '/polopoly_fs/1.1264249.1305799303%21/image/1299281949.jpg_gen/derivatives/rpoPanorama_786/1299281949.jpg' not found  
		 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#6 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Make the URL absolute.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#7 | 
| 
			
			
			
			 Zealot 
			
			![]() ![]() Posts: 119 
				Karma: 100 
				Join Date: Jan 2011 
				Location: Germany / NRW /Köln 
				
				
				Device: prs-650 / prs-350 /kindle 3 
				
				
				 | 
	
	
	
		
		
		
		
		![]() o.k. that is tooooo difficult for me at beginner. i want to read books to learn it. is this the right way to make it absolut: soup = BeautifulSoup() tag1 = Tag(soup, "mytag") tag2 = Tag(soup, "myOtherTag")  
		 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#8 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			I just meant add the http://servername part to the links.  
		
	
		
		
		
		
		
		
		
		
		
		
	
	img['src'] = 'http://servername-whatever/' + img['src'].replace('xx', 'yy')  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#9 | 
| 
			
			
			
			 Zealot 
			
			![]() ![]() Posts: 119 
				Karma: 100 
				Join Date: Jan 2011 
				Location: Germany / NRW /Köln 
				
				
				Device: prs-650 / prs-350 /kindle 3 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			ah o.k. , 
		
	
		
		
		
		
		
		
		
		
		
		
	
	i understand now! but it show's like that the target picture is in a script-container also. i believe , this is much more difficult. i let it be and make a bit more simpler stuff. thank you for assistance. greetings  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
![]()  | 
            
        
    
            
  | 
    
			 
			Similar Threads
		 | 
	||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| <img longdesc= | traskilajussi | EPUBReader | 3 | 02-23-2011 09:15 AM | 
| Grab news error with PDF output | chatainsim | Calibre | 3 | 02-28-2010 07:59 PM | 
| How to grab plain (Sciencedirect) HTML? | johndoesecond | Workshop | 7 | 02-02-2010 05:17 PM | 
| Automatically grab news at windows startup ? | phkoech | Calibre | 3 | 08-25-2009 03:14 AM | 
| The Sunday Times:Google makes a grab for e-books | Kris777 | News | 8 | 03-29-2009 02:18 PM |