![]() |
#1 |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Oct 2011
Device: Kindle
|
Recipe for "Robb Wolf"
Trying to play with Calibre instead of fighting with the browser on the Kindle, just for giggles.
Starting to go through my various blogs, and started with http://www.robbwolf.com Here's the recipe so far: PHP Code:
Is it possible to use regexp in the keep/remove/etc tags lines? This is a standard Wordpress blog, but only the abstracts are presented. I tried messing around with the recipe for Mish's Global Economic Analysis but end up basically only getting the abstracts and no real articles. Any suggestions here? |
![]() |
![]() |
![]() |
#2 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Yes.
Here's how I used it in the Skeptic recipe to remove div tags with an id that started with "follow": Code:
remove_tags = [dict(name='div', attrs={'class':['Introduction','divider']}), dict(name='div', attrs={'id':['feature', 'podcast']}), dict(name='div', attrs={'id':re.compile(r'follow.*', re.DOTALL|re.IGNORECASE)}), dict(name='hr'), ] |
![]() |
![]() |
Advert | |
|
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Recipe for "Galicia Confidencial" and "De L a V" | roebek | Recipes | 1 | 07-19-2011 09:17 AM |
Recipe for Dutch newspaper "Dagblad van het Noorden" | reijndert | Recipes | 2 | 05-18-2011 07:52 AM |
How to prevent recipe read "files" pdf on web rss? | KRorschachZ | Recipes | 12 | 11-10-2010 02:59 PM |
"The Were Wolf" by Clemence Housman | Lobolover | Reading Recommendations | 0 | 04-25-2008 06:33 AM |