Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Readers > Sony Reader

Notices

Reply
 
Thread Tools Search this Thread
Old 11-13-2007, 10:02 AM   #1
alexxxm
Addict
alexxxm has a complete set of Star Wars action figures.alexxxm has a complete set of Star Wars action figures.alexxxm has a complete set of Star Wars action figures.alexxxm has a complete set of Star Wars action figures.
 
Posts: 223
Karma: 356
Join Date: Aug 2007
Device: Rocket; Hiebook; N700; Sony 505; Kindle DX ...
web2lrf: La Repubblica

Hi everybody (hi Kovid!),
I read a bit about web2lrf, and came up with a profile for downloading the news from the italian newspaper "La Repubblica".
It is not perfect of course, until this morning I knew everything about Perl and nothing at all about Python - any feedback is welcome.
I hope I did the right thing putting the code here.

Let me know if it works for you too...

Alessandro


Code:
import re

from libprs500.ebooks.lrf.web.profiles import DefaultProfile


class LaRepubblica(DefaultProfile):

        title = 'La Repubblica Feed'
        max_recursions = 2

        preprocess_regexps = \
                    [ (re.compile(i[0], re.IGNORECASE | re.DOTALL), i[1]) for i in 
                      [
                        (r'<div id="ge-network-top">.*?</div>', lambda match : ''),
                        (r'<div id="ge-network-middle">.*?</div>', lambda match : ''),
                        (r'<div id="ge-network-bottom">.*?</div>', lambda match : ''),
                        (r'<div id="cerca">.*?</div>', lambda match : ''),
                        (r'<div id="topmenu">.*?</div>', lambda match : ''),
                        (r'<div id="menu">.*?</div>', lambda match : ''),
                        (r'<div id="stripa">.*?</div>', lambda match : ''),
                        (r'<div id="stripb">.*?</div>', lambda match : ''),
                        (r'<div id="gee-contA">.*?</div>', lambda match : ''),
                        (r'<div id="addons">.*?</div>', lambda match : ''),
                        (r'<div id="menu">.*?</div>', lambda match : ''),
                        (r'<div id="newprefooter">.*?</div>', lambda match : ''),
                        (r'<div id="newfooter">.*?</div>', lambda match : ''),
                        (r'<div id="update">.*?</div>', lambda match : ''),
                        (r'<div id="menu">.*?</div>', lambda match : ''),
                        (r'<div id="menu">.*?</div>', lambda match : ''),
                        (r'<div id="menu">.*?</div>', lambda match : ''),
                        (r'<div class="wikipedia">.*?</div>', lambda match : ''),
                        (r'<div class="contselect">.*?</div>', lambda match : ''),
                        (r'<div class="generalbox gen">.*?</div>', lambda match : ''),
                       ]
                ]

        def get_feeds(self):
                return [ ('Feed 1', 'http://www.repubblica.it/rss/homepage/rss2.0.xml') ]
alexxxm is offline   Reply With Quote
Old 11-13-2007, 12:27 PM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,779
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Looks fine to me. If you email/PM me with your account name at libprs500, I'll give you write access to the wiki and then you can upload the file to the UserProfiles section so other people who come there can use it.
kovidgoyal is online now   Reply With Quote
Advert
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
web2lrf kovidgoyal LRF 353 09-10-2008 07:41 AM
web2lrf to capture blog archive? Deputy-Dawg Sony Reader Dev Corner 1 02-14-2008 11:41 PM


All times are GMT -4. The time now is 10:58 AM.


MobileRead.com is a privately owned, operated and funded community.