Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Recipes

Notices

Reply
 
Thread Tools Search this Thread
Old 11-09-2021, 10:09 AM   #1
jma1
Connoisseur
jma1 began at the beginning.
 
Posts: 85
Karma: 10
Join Date: Dec 2015
Device: Kindle
New York Post recipe extraneous sections

The New York Post downloads from the standard recipe contain several extraneous sections that appear in many of the articles. Attached text file shows an example thread of these sections as they appear in the downloaded mobi (Social Links, View Author Archive, Contact The Author, More On, More From).

Is there a way to remove these? I've tried several remove_tags commands but am unable to translate all of the nested class definitions involved into a proper command. Thanks in advance.
Attached Files
File Type: txt New York Post Article extraneous sections.txt (883 Bytes, 131 views)
jma1 is offline   Reply With Quote
Old 11-09-2021, 10:39 AM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,256
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
https://github.com/kovidgoyal/calibr...4cfc66a83cb155
kovidgoyal is offline   Reply With Quote
Advert
Old 11-09-2021, 02:11 PM   #3
jma1
Connoisseur
jma1 began at the beginning.
 
Posts: 85
Karma: 10
Join Date: Dec 2015
Device: Kindle
Looks great! I did then add this to keep_only to include the byline author and date. What I added was the last three.

classes('byline byline-date source article-info entry-content entry-content-read-more featured-image'
'headline headline--single' 'date meta meta--byline' 'Date published'),

Thanks so much.
jma1 is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
New York Times (Web) recipe only fetching three sections ajkessel Recipes 1 02-18-2019 12:29 AM
New York Times articles include extraneous text nelson1379 Recipes 5 11-06-2016 10:46 AM
WSJ recipe recommendations: a summary page and exclusion of unwanted sections itzika Recipes 0 09-25-2016 01:29 PM
The Guardian recipe, more sections ? mrwout Recipes 0 04-11-2011 05:22 PM
Recipe help needed for looping through sections of a website Acey Calibre 1 10-16-2008 01:09 PM


All times are GMT -4. The time now is 01:06 AM.


MobileRead.com is a privately owned, operated and funded community.