01-31-2008, 11:49 PM | #166 | |
Addict
Posts: 274
Karma: 332
Join Date: Nov 2003
Location: San Francisco, USA
Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U
|
Quote:
|
|
02-01-2008, 12:05 AM | #167 |
creator of calibre
Posts: 44,356
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Probably a difference in the two profiles. I just tested newsweek, commandline and GUI were 113s and 116s
I should probably update the wsj profile EDIT: Oldest_article is 3 vs. 7 which probably explains it. Also JTravers, is that the correct print url mapping? Last edited by kovidgoyal; 02-01-2008 at 12:08 AM. |
Advert | |
|
02-01-2008, 02:05 AM | #168 | |
Groupie
Posts: 182
Karma: 1078201
Join Date: Sep 2007
Device: iPad Air 2
|
Quote:
That print url mapping has always worked for me. You could probably clean up the end of the url too, but I've never found that to be necessary. |
|
02-01-2008, 02:11 AM | #169 |
Groupie
Posts: 182
Karma: 1078201
Join Date: Sep 2007
Device: iPad Air 2
|
|
02-05-2008, 06:12 PM | #170 |
Junior Member
Posts: 4
Karma: 10
Join Date: Feb 2008
Device: Sony
|
The Nation -- subcriber info
I've downloaded thenation.py and run web2lrf with it. Sort of works, but I can't quite get it. First problem is that I'm not sure about the dates that need to be inserted (one short and one long). And second (and bigger) problem is that I can't figure where to put my login and password.
Without that, I receive notices about needing to subscribe to download some content. And most of the articles seem to come from web postings, not the actual issue. Any help would be appreciated. |
Advert | |
|
02-11-2008, 11:31 AM | #171 |
Created Sigil, FlightCrew
Posts: 1,982
Karma: 350515
Join Date: Feb 2008
Device: Kobo Clara HD
|
I posted user profiles for Jutarni.hr (the online version of Croatia's most popular newspaper) and USATODAY to the ticket system. I apologize if the ticket system was not the correct way of informing you about them, but it just seemed like it was the right way to do it.
I saw that ticket with all those different requests for news feeds, and if I have the time, I'll try to work through the list. I'm currently working on The New Yorker. Will add it when it's done. If I mess up a profile, please tell me about it and I'll try to fix it. |
02-11-2008, 12:48 PM | #172 |
creator of calibre
Posts: 44,356
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Cool, I'll add them in the next release.
|
02-11-2008, 02:45 PM | #173 |
Created Sigil, FlightCrew
Posts: 1,982
Karma: 350515
Join Date: Feb 2008
Device: Kobo Clara HD
|
Kovid, you have a terrible little bug in web2lrf... maybe not so a bug as a design oversight...
For the last 5 hours I have been attempting to create a The New Yorker user profile, and no matter what I did, the code only retrieved TWO articles from the site... I tried everything... and then I realized what was the problem. Your code that checks the oldest_article variable... It starts at the top of the feed and continues down, checking each article's date. When it finds an article older than the number in oldest_article, it stops checking subsequent articles. WELL! The RSS feeds on TNY website are not sorted by date, but by some quasi-alphabetical sort, so when this code finds an old article at the very top of the feed (very very likely), it doesn't grab the newer ones which are lower in the listing. Please fix this so it checks each and every article in the list. I have uploaded the The New Yorker profile with its oldest_article variable set to 90, it was the only way I could get the newer articles. When you fix the bug, fix the profile accordingly. Everything else about it works fine. |
02-17-2008, 12:16 PM | #174 |
Junior Member
Posts: 4
Karma: 10
Join Date: Jul 2007
Device: Sony PRS-500
|
Kovid, I've said this before but with the Economist profile feel the need to say this again. This program is phenomenal, particularly given its OS independence and the .deb packages and ebuilds. I'm a subscriber to the Economist and will I imagine often use your service rather than reading the paper edition more often than not.
Thanks again. |
02-17-2008, 01:36 PM | #175 |
creator of calibre
Posts: 44,356
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Interesting, I know about the ebuilds, but are the deb packages being maintained as well?
|
02-17-2008, 06:22 PM | #176 | |
Resident Curmudgeon
Posts: 75,917
Karma: 134368292
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
02-21-2008, 06:09 PM | #177 |
Addict
Posts: 274
Karma: 332
Join Date: Nov 2003
Location: San Francisco, USA
Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U
|
Get Full WSJ?
Hi guys.
I'm using the WSJ profile and it works very well (thanks to JTravers for the profile). I have a quick question: is is possible to get all the articles from a page, not from a feed? RSS feed for "Today's Newspaper" has only 5 articles from front page plus few more from other sections. I'd like to get as many articles from printed edition ("http://online.wsj.com/page/2_0133.html") as possible. I replaced an existing link with this one, but got a blank page: def get_feeds(self): return [ (' Today\'s Newspaper - All', 'http://online.wsj.com/page/2_0133.html'), ## (' Today\'s Newspaper - Page One', 'http://online.wsj.com/xml/rss/3_7205.xml'), ] Any advise? I want all the links from "http://online.wsj.com/page/2_0133.html" page that have "article" in their address. I don't think I need to change the clean-up part, current profile all the work. This must be a simple question for Kovid, JTravers and others who have created their profiles. Thanks in advance, David |
02-21-2008, 06:13 PM | #178 |
creator of calibre
Posts: 44,356
Karma: 23661992
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
It's certainly doable, but in irder to do it, you have to parse the HTML from that page, see for example the feed for The Atlantic.
|
02-21-2008, 06:17 PM | #179 |
Addict
Posts: 274
Karma: 332
Join Date: Nov 2003
Location: San Francisco, USA
Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U
|
Do you live here? :-)
I didn't see Atlantic under UserProfiles. Where can I find it? Thanks, David |
02-21-2008, 06:31 PM | #180 |
Addict
Posts: 274
Karma: 332
Join Date: Nov 2003
Location: San Francisco, USA
Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U
|
Kovid, ignore my previous message. A quick search and I found the thread about Atlantic.
Have to search first. |
Tags |
libprs500, web2lrf |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
web2lrf to capture blog archive? | Deputy-Dawg | Sony Reader Dev Corner | 1 | 02-14-2008 11:41 PM |
web2lrf: La Repubblica | alexxxm | Sony Reader | 1 | 11-13-2007 12:27 PM |