![]() |
#1 |
Member
![]() Posts: 14
Karma: 10
Join Date: Dec 2016
Device: Tolino
|
How to save each news article to a separate file
Hi all,
for sientific purposes I need to split the downloaded news in a bunch of files. So for each article in the newspaper I would get something like this: [title]_[language]_[date]_[time].txt Which I could archive every day. The txt should only contain the title and the text of the article. I don't want any images or table of contens or anything. For NY-Times from today for instance NyTimesSub_en_20170112_2045.txt NyTimesSub_en_20170113_2159.txt . . . NyTimesSub_en_20170113_1041.txt NyTimesSub_en_20170113_1045.txt NyTimesSub_en_20170113_1053.txt What would be the most easy way to achieve this? Thanks for any hint! Last edited by Idefix; 01-13-2017 at 04:59 AM. |
![]() |
![]() |
![]() |
#2 |
Member
![]() Posts: 14
Karma: 10
Join Date: Dec 2016
Device: Tolino
|
No Idea? No hint?
It would be a great help if anybody with experience in how to export/convert ebooks would have an approach to this. Should I for example rather learn how to write an output plugin for calibre, or use an existing output format and parse it later on in a certain way? |
![]() |
![]() |
![]() |
#3 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,174
Karma: 60406498
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
You did not say Separate Book
Editor: Split , breaks up a (page) file EPUB Split Plugin to break into Books |
![]() |
![]() |
![]() |
#4 |
Member
![]() Posts: 14
Karma: 10
Join Date: Dec 2016
Device: Tolino
|
thanks for your reply.
But what do you mean with "You did not say separate books". I downloaded news. Calibe converts it to a book, right? So I want to split news in articles. Using a editor is not really a solution that can be automated, as doing this for thousands of articles is not a good idea. Can you explain a little further what the epub split plugin can do? And how I can apply it? Is it an option the ebook-convert command? |
![]() |
![]() |
![]() |
#5 | |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,902
Karma: 30277270
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
BR Last edited by BetterRed; 01-13-2017 at 05:19 PM. Reason: add quote |
|
![]() |
![]() |
![]() |
#6 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,527
Karma: 28548962
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
EPUB is just a zip file unzip it and you will have all your articles in individual html files -- then do whatever you want with them.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Seperate news lists by device | Drumm | Recipes | 4 | 10-03-2011 11:39 AM |
How to retain a news article? | wilsonch | Calibre | 3 | 04-20-2010 11:12 AM |
Unutterably Silly Should we band together to save the liseuse wiki article? | pshrynk | Lounge | 31 | 09-16-2009 10:20 PM |
News Article | AJ Starr | News | 0 | 08-14-2009 01:13 PM |