|  01-31-2008, 11:49 PM | #166 | |
| Addict     Posts: 274 Karma: 332 Join Date: Nov 2003 Location: San Francisco, USA Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U | Quote: 
 | |
|   |   | 
|  02-01-2008, 12:05 AM | #167 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Probably a difference in the two profiles. I just tested newsweek, commandline and GUI were 113s and 116s  I should probably update the wsj profile  EDIT: Oldest_article is 3 vs. 7 which probably explains it. Also JTravers, is that the correct print url mapping? Last edited by kovidgoyal; 02-01-2008 at 12:08 AM. | 
|   |   | 
| Advert | |
|  | 
|  02-01-2008, 02:05 AM | #168 | |
| Groupie            Posts: 182 Karma: 1078201 Join Date: Sep 2007 Device: iPad Air 2 | Quote: 
 That print url mapping has always worked for me. You could probably clean up the end of the url too, but I've never found that to be necessary. | |
|   |   | 
|  02-01-2008, 02:11 AM | #169 | 
| Groupie            Posts: 182 Karma: 1078201 Join Date: Sep 2007 Device: iPad Air 2 | |
|   |   | 
|  02-05-2008, 06:12 PM | #170 | 
| Junior Member  Posts: 4 Karma: 10 Join Date: Feb 2008 Device: Sony | 
				
				The Nation -- subcriber info
			 
			
			I've downloaded thenation.py and run web2lrf with it.  Sort of works, but I can't quite get it.  First problem is that I'm not sure about the dates that need to be inserted (one short and one long).  And second (and bigger) problem is that I can't figure where to put my login and password.   Without that, I receive notices about needing to subscribe to download some content. And most of the articles seem to come from web postings, not the actual issue. Any help would be appreciated. | 
|   |   | 
| Advert | |
|  | 
|  02-11-2008, 11:31 AM | #171 | 
| Created Sigil, FlightCrew            Posts: 1,982 Karma: 350515 Join Date: Feb 2008 Device: Kobo Clara HD | 
			
			I posted user profiles for Jutarni.hr (the online version of Croatia's most popular newspaper) and USATODAY to the ticket system. I apologize if the ticket system was not the correct way of informing you about them, but it just seemed like it was the right way to do it. I saw that ticket with all those different requests for news feeds, and if I have the time, I'll try to work through the list. I'm currently working on The New Yorker. Will add it when it's done. If I mess up a profile, please tell me about it and I'll try to fix it. | 
|   |   | 
|  02-11-2008, 12:48 PM | #172 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Cool, I'll add them in the next release.
		 | 
|   |   | 
|  02-11-2008, 02:45 PM | #173 | 
| Created Sigil, FlightCrew            Posts: 1,982 Karma: 350515 Join Date: Feb 2008 Device: Kobo Clara HD | 
			
			Kovid, you have a terrible little bug in web2lrf... maybe not so a bug as a design oversight... For the last 5 hours I have been attempting to create a The New Yorker user profile, and no matter what I did, the code only retrieved TWO articles from the site... I tried everything... and then I realized what was the problem. Your code that checks the oldest_article variable... It starts at the top of the feed and continues down, checking each article's date. When it finds an article older than the number in oldest_article, it stops checking subsequent articles. WELL! The RSS feeds on TNY website are not sorted by date, but by some quasi-alphabetical sort, so when this code finds an old article at the very top of the feed (very very likely), it doesn't grab the newer ones which are lower in the listing. Please fix this so it checks each and every article in the list. I have uploaded the The New Yorker profile with its oldest_article variable set to 90, it was the only way I could get the newer articles. When you fix the bug, fix the profile accordingly. Everything else about it works fine. | 
|   |   | 
|  02-17-2008, 12:16 PM | #174 | 
| Junior Member  Posts: 4 Karma: 10 Join Date: Jul 2007 Device: Sony PRS-500 | 
			
			Kovid, I've said this before but with the Economist profile feel the need to say this again.  This program is phenomenal, particularly given its OS independence and the .deb packages and ebuilds.  I'm a subscriber to the Economist and will I imagine often use your service rather than reading the paper edition more often than not. Thanks again. | 
|   |   | 
|  02-17-2008, 01:36 PM | #175 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			Interesting, I know about the ebuilds, but are the deb packages being maintained as well?
		 | 
|   |   | 
|  02-17-2008, 06:22 PM | #176 | |
| Resident Curmudgeon            Posts: 80,746 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | Quote: 
 | |
|   |   | 
|  02-21-2008, 06:09 PM | #177 | 
| Addict     Posts: 274 Karma: 332 Join Date: Nov 2003 Location: San Francisco, USA Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U | 
				
				Get Full WSJ?
			 
			
			Hi guys. I'm using the WSJ profile and it works very well (thanks to JTravers for the profile). I have a quick question: is is possible to get all the articles from a page, not from a feed? RSS feed for "Today's Newspaper" has only 5 articles from front page plus few more from other sections. I'd like to get as many articles from printed edition ("http://online.wsj.com/page/2_0133.html") as possible. I replaced an existing link with this one, but got a blank page: def get_feeds(self): return [ (' Today\'s Newspaper - All', 'http://online.wsj.com/page/2_0133.html'), ## (' Today\'s Newspaper - Page One', 'http://online.wsj.com/xml/rss/3_7205.xml'), ] Any advise? I want all the links from "http://online.wsj.com/page/2_0133.html" page that have "article" in their address. I don't think I need to change the clean-up part, current profile all the work. This must be a simple question for Kovid, JTravers and others who have created their profiles. Thanks in advance, David | 
|   |   | 
|  02-21-2008, 06:13 PM | #178 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			It's certainly doable, but in irder to do it, you have to parse the HTML from that page, see for example the feed for The Atlantic.
		 | 
|   |   | 
|  02-21-2008, 06:17 PM | #179 | 
| Addict     Posts: 274 Karma: 332 Join Date: Nov 2003 Location: San Francisco, USA Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U | 
			
			Do you live here? :-) I didn't see Atlantic under UserProfiles. Where can I find it? Thanks, David | 
|   |   | 
|  02-21-2008, 06:31 PM | #180 | 
| Addict     Posts: 274 Karma: 332 Join Date: Nov 2003 Location: San Francisco, USA Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U | 
			
			Kovid, ignore my previous message. A quick search and I found the thread about Atlantic. Have to search first. | 
|   |   | 
|  | 
| Tags | 
| libprs500, web2lrf | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| web2lrf to capture blog archive? | Deputy-Dawg | Sony Reader Dev Corner | 1 | 02-14-2008 11:41 PM | 
| web2lrf: La Repubblica | alexxxm | Sony Reader | 1 | 11-13-2007 12:27 PM |