Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Other formats > LRF

Notices

Reply
 
Thread Tools Search this Thread
Old 04-10-2007, 11:19 PM   #181
fritz_the_blank
Member
fritz_the_blank began at the beginning.
 
Posts: 20
Karma: 10
Join Date: Jan 2007
Device: Sony PRS-500
Hosting options

Quote:
Originally Posted by geekraver
The basic requirement of a publish/subscribe server is WebDAV filesystem support; I believe IIS has this and Apache certainly does via the DAV module. If anyone has a reliable server with that then I can make the switch.
My server runs windows with IIS. If you would like to try using that, and it works, you are more than welcome to use me as a mirror or as your primary hosting. It is the least that I can do after all of your work.

FtB
fritz_the_blank is offline   Reply With Quote
Old 04-11-2007, 01:52 AM   #182
geekraver
Addict
geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.
 
Posts: 364
Karma: 1035291
Join Date: Jul 2006
Location: Redmond, WA
Device: iPad Mini,Kindle Paperwhite
It seems that the problem may have been with the router I use (WRT54G) and the way it was configured (long idle timeouts and small connection table size). I have updated to the latest DD-WRT firmware and reconfigured it, so I'd be interested to hear if people start finding the connectivity issues improve.
geekraver is offline   Reply With Quote
Advert
Old 06-12-2007, 01:40 PM   #183
geekraver
Addict
geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.
 
Posts: 364
Karma: 1035291
Join Date: Jul 2006
Location: Redmond, WA
Device: iPad Mini,Kindle Paperwhite
I have just finished migrating my server at home from Linux to FreeBSD. I switched from about 12 years of using FreeBSD to (Gentoo) Linux about 4 years ago, and the server worked well, until a couple of months back when I did my 6-monthly system update. Those of you who use web2book have probably found the publish/subscribe feature almost totally unusable in this period. I never managed to figure out why the machine had become so unreliable, and decided to flatten it, and also go back to FreeBSD, as I have many years experience with the latter and think it will cause me less work. I'd be interested to hear if people find the server more reliable now (if not then I have router trouble I guess).
geekraver is offline   Reply With Quote
Old 06-12-2007, 07:30 PM   #184
Hadrien
Feedbooks.com Co-Founder
Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.Hadrien understands the importance of being earnest.
 
Hadrien's Avatar
 
Posts: 2,263
Karma: 145123
Join Date: Nov 2006
Location: Paris, France
Device: Sony PRS-t-1/350/300/500/505/600/700, Nexus S, iPad
Quote:
Originally Posted by geekraver View Post
I have just finished migrating my server at home from Linux to FreeBSD. I switched from about 12 years of using FreeBSD to (Gentoo) Linux about 4 years ago, and the server worked well, until a couple of months back when I did my 6-monthly system update. Those of you who use web2book have probably found the publish/subscribe feature almost totally unusable in this period. I never managed to figure out why the machine had become so unreliable, and decided to flatten it, and also go back to FreeBSD, as I have many years experience with the latter and think it will cause me less work. I'd be interested to hear if people find the server more reliable now (if not then I have router trouble I guess).
FreeBSD is a kickass OS. We're running Feedbooks on FreeBSD+Lighttpd. Better and faster than the Linux+Apache combo.
Hadrien is offline   Reply With Quote
Old 06-26-2007, 01:54 PM   #185
squeezebag
Junior Member
squeezebag began at the beginning.
 
Posts: 7
Karma: 10
Join Date: Jun 2007
Device: Sony Reader
Help with Web2Book

All:

My Wife and I decided to sell the house, the car, and move to Mexico. One of the things that I would really miss is my subscription to the NewYorker (the surface mail there is useless). Someone told me that I could get a Sony Reader and pick up the weekly subscription via RSS. Imagine my dissapointment when I brought it home and discovered the out-of-the-box RSS support - weak.

Then I stumbled upon GeekRavers excellent software - and thought that my problems were solved. I'm trying to work my way through the examples given in the post but without a lot of luck. I'm wondering if someone could show me what the feed settings should be to achieve my goal.

What I'd like is to retrieve, on a weekly basis, is the full printable versions of the content at this URL: http://www.newyorker.com/services/rss/summary.

If I could end up with a weekly, sony-reader-friendly, version of the NewYorker, I'd be a happy, happy, camper. Any assistance with the settings would be greatly appreciated.
squeezebag is offline   Reply With Quote
Advert
Old 06-29-2007, 03:43 AM   #186
geekraver
Addict
geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.
 
Posts: 364
Karma: 1035291
Join Date: Jul 2006
Location: Redmond, WA
Device: iPad Mini,Kindle Paperwhite
I published an entry for the New Yorker; you should be able to use Subscribe to add it.

It doesn't include the leading pictures; if you want those modify the content extraction pattern to say "start article rail" instead of "start article body".

BTW it doesn't work well with the built-in PDF and RTF converters (due to unprocessed 16-bit characters); seems to work fine with LRF though. I haven't tried it with the htmldoc PDF conversion option.

Last edited by geekraver; 06-29-2007 at 03:46 AM.
geekraver is offline   Reply With Quote
Old 06-29-2007, 04:02 PM   #187
geekraver
Addict
geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.
 
Posts: 364
Karma: 1035291
Join Date: Jul 2006
Location: Redmond, WA
Device: iPad Mini,Kindle Paperwhite
BTW you may have trouble hitting the server. I'm now convinced the issue is with my router (Linksys WRT54G with DD-WRT firmware), and not the server. I can access the server just fine from behind the router, and from outside I have no problem hitting the server on IMAP and SMTP ports, but for some reason HTTP isn't being forwarded even though it is configured just the same.

I think I'll try different firmware on the router tonight.
geekraver is offline   Reply With Quote
Old 07-01-2007, 01:41 AM   #188
adinb
RSS & Gadget Addict!
adinb is on a distinguished road
 
adinb's Avatar
 
Posts: 82
Karma: 67
Join Date: May 2005
Location: Albuquerque, NM
Device: Sony PRS-500, iPod Touch, iPhone
I published a feed for the New Yorker as well.

The issue seems to be that the New Yorker won't allow the robot fetch to go directly to the printable version of the page/article. (cookies, spoofing referring pages, and/or spoofing the user agent string might fix that..I hope to see something like that in future versions of web2book)

One thing that I can't seem to remember how to do--how to get the link reformatter to reference the original link element *and* the regex string fetched via the link extractactor pattern when the "apply extractor to linked content instead of link text" option is selected. From my dim memory, I don't remember being able to really use the link reformatter if the follow option is checked, but I could be *totally* wrong.

Oh, BTW, publish appeared to work for me tonight.

-adin
adinb is offline   Reply With Quote
Old 07-02-2007, 12:31 AM   #189
squeezebag
Junior Member
squeezebag began at the beginning.
 
Posts: 7
Karma: 10
Join Date: Jun 2007
Device: Sony Reader
Regarding the NewYorker feeds,

Thanks a ton. I'm now able to pick up the full articles from the print links (including the pictures and captions). I used the following settings:

URL: http://feeds.newyorker.com/services/...everything.xml
Link Element: Link
Apply extractor to linked contentchecked)
Link Reformatter: {0}?printable=true
Content Extraction pattern: <!-- start article rail -->(.*) <!-- end article body -->

Converts to LRF perfectly. I have two remaining questions.

-I've been able to filter out most of the garbage with the Content Extraction Pattern but I'm still picking up a "keywords" section that I'd like to exclude. Does the Content Extraction thing allow me to extract from A to B, and then from C to D? In other words, there is stuff in the beginning and stuff on the end that I'd like to exclude. There is also a block of stuff in the middle that I'd like to filter out. What's the format for this?

-Also, is there any way to build a table of contents? I can pick up the section summaries from: http://feeds.newyorker.com/services/...everything.xml but is there any way that I can prepend the full extraction with this file? A perfect world would allow me to link from the TOC to the full articles but I'll live with whatever I can get.

Thanks again for your help.

Also, the subscribe function works flawlessly now!
squeezebag is offline   Reply With Quote
Old 07-02-2007, 08:44 PM   #190
geekraver
Addict
geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.
 
Posts: 364
Karma: 1035291
Join Date: Jul 2006
Location: Redmond, WA
Device: iPad Mini,Kindle Paperwhite
For TOC, you have a couple of options: using htmldoc for PDF, or writing your own output plugin that pre-massages the HTML. I may add this as a feature later.

For content extraction, in the regular expression pattern you need to group the various parts you want in parentheses; you then use {0}, {1}, {2}, etc in the formatter to represent the matched blocks. So you might use a pattern like:

<!-- start article rail -->(.*)<foo>.*<bar>(.*)<!-- end article body -->

assuming <foo> started the tag section you wanted to skip and <bar> ended it (".*" represent any sequence of characters, in case you don't know that already) .
geekraver is offline   Reply With Quote
Old 07-08-2007, 12:40 AM   #191
ddavtian
Addict
ddavtian has a complete set of Star Wars action figures.ddavtian has a complete set of Star Wars action figures.ddavtian has a complete set of Star Wars action figures.ddavtian has a complete set of Star Wars action figures.
 
Posts: 271
Karma: 332
Join Date: Nov 2003
Location: San Francisco, USA
Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U
TOC possible?

Asked a question that was answered right in front of my post.
Sorry for stupid post.

Waiting for GeekRaver to add a ToC feature.

Last edited by ddavtian; 07-16-2007 at 07:36 PM. Reason: Already answered
ddavtian is offline   Reply With Quote
Old 07-20-2007, 02:33 PM   #192
_underzcore_
Junior Member
_underzcore_ began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Jul 2007
Device: Sony Reader
Web2book

geekraver, your app is tantalizingly good . . . but I'm getting killed on the feed I'm trying to save (The Economist print edition). There seem to be two big hurdles the way they have the feed set up:

1) It's hosted by a second party, so there's an intermediary link through pheedo.com that then points the browser back to the article at economist.com.

2) The articles at economist.com are stored in different directories (e.g. ".../opinion/," ".../world/la/"). This seems to be fouling up my efforts to change a "displaystory.cfm?story_id=" into a "PrinterFriendly.cfm?story_id=" with just one set of regular expressions. And it's hard to tell if I'm properly sidestepping the pheedo.com blind alley.

Help?
_underzcore_ is offline   Reply With Quote
Old 07-27-2007, 02:53 PM   #193
flamaest
Groupie
flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.flamaest ought to be getting tired of karma fortunes by now.
 
Posts: 155
Karma: 1044459
Join Date: Jul 2007
Device: prs-500
Full RSS feed links anyone..?

Does someone have a list of RSS feeds which are FULL article feeds..?

from any news sources, I don't care...

Most of the one's i find are intro-snippet only..

POST YOUR FULL RSS URLs.. please..?

Help??
F.
flamaest is offline   Reply With Quote
Old 07-27-2007, 03:15 PM   #194
geekraver
Addict
geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.
 
Posts: 364
Karma: 1035291
Join Date: Jul 2006
Location: Redmond, WA
Device: iPad Mini,Kindle Paperwhite
Quote:
Originally Posted by flamaest View Post
Does someone have a list of RSS feeds which are FULL article feeds..?

from any news sources, I don't care...

Most of the one's i find are intro-snippet only..

POST YOUR FULL RSS URLs.. please..?

Help??
F.
Much of the usefulness of web2book is that it turns partial feeds into full feeds. Just try use the subscribe feature (on the file menu). My server might be a bit slow as it is doing a on-line backup but be patient and you should get a list.
geekraver is offline   Reply With Quote
Old 07-27-2007, 03:17 PM   #195
geekraver
Addict
geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.geekraver ought to be getting tired of karma fortunes by now.
 
Posts: 364
Karma: 1035291
Join Date: Jul 2006
Location: Redmond, WA
Device: iPad Mini,Kindle Paperwhite
I'm heading out on vacation; I'll respond when I get back.

Quote:
Originally Posted by _underzcore_ View Post
geekraver, your app is tantalizingly good . . . but I'm getting killed on the feed I'm trying to save (The Economist print edition). There seem to be two big hurdles the way they have the feed set up:

1) It's hosted by a second party, so there's an intermediary link through pheedo.com that then points the browser back to the article at economist.com.

2) The articles at economist.com are stored in different directories (e.g. ".../opinion/," ".../world/la/"). This seems to be fouling up my efforts to change a "displaystory.cfm?story_id=" into a "PrinterFriendly.cfm?story_id=" with just one set of regular expressions. And it's hard to tell if I'm properly sidestepping the pheedo.com blind alley.

Help?
geekraver is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
rss2book release 20 now available geekraver Sony Reader 4 01-26-2007 01:36 PM
rss2book release 19 geekraver Sony Reader 2 12-30-2006 10:51 AM
rss2book release 18 geekraver Sony Reader 0 12-22-2006 03:57 AM
rss2book release 16 geekraver Sony Reader 1 12-13-2006 05:56 AM
rss2book release 13 geekraver Sony Reader 0 11-13-2006 02:41 AM


All times are GMT -4. The time now is 05:45 AM.


MobileRead.com is a privately owned, operated and funded community.