![]() |
#1 |
Connoisseur
![]() Posts: 85
Karma: 10
Join Date: Dec 2015
Device: Kindle
|
New York Post recipe extraneous sections
The New York Post downloads from the standard recipe contain several extraneous sections that appear in many of the articles. Attached text file shows an example thread of these sections as they appear in the downloaded mobi (Social Links, View Author Archive, Contact The Author, More On, More From).
Is there a way to remove these? I've tried several remove_tags commands but am unable to translate all of the nested class definitions involved into a proper command. Thanks in advance. |
![]() |
![]() |
![]() |
#2 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,256
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Connoisseur
![]() Posts: 85
Karma: 10
Join Date: Dec 2015
Device: Kindle
|
Looks great! I did then add this to keep_only to include the byline author and date. What I added was the last three.
classes('byline byline-date source article-info entry-content entry-content-read-more featured-image' 'headline headline--single' 'date meta meta--byline' 'Date published'), Thanks so much. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
New York Times (Web) recipe only fetching three sections | ajkessel | Recipes | 1 | 02-18-2019 12:29 AM |
New York Times articles include extraneous text | nelson1379 | Recipes | 5 | 11-06-2016 10:46 AM |
WSJ recipe recommendations: a summary page and exclusion of unwanted sections | itzika | Recipes | 0 | 09-25-2016 01:29 PM |
The Guardian recipe, more sections ? | mrwout | Recipes | 0 | 04-11-2011 05:22 PM |
Recipe help needed for looping through sections of a website | Acey | Calibre | 1 | 10-16-2008 01:09 PM |