|  01-13-2017, 04:56 AM | #1 | 
| Member  Posts: 14 Karma: 10 Join Date: Dec 2016 Device: Tolino | 
				
				How to save each news article to a separate file
			 
			
			Hi all, for sientific purposes I need to split the downloaded news in a bunch of files. So for each article in the newspaper I would get something like this: [title]_[language]_[date]_[time].txt Which I could archive every day. The txt should only contain the title and the text of the article. I don't want any images or table of contens or anything. For NY-Times from today for instance NyTimesSub_en_20170112_2045.txt NyTimesSub_en_20170113_2159.txt . . . NyTimesSub_en_20170113_1041.txt NyTimesSub_en_20170113_1045.txt NyTimesSub_en_20170113_1053.txt What would be the most easy way to achieve this? Thanks for any hint! Last edited by Idefix; 01-13-2017 at 04:59 AM. | 
|   |   | 
|  01-13-2017, 01:23 PM | #2 | 
| Member  Posts: 14 Karma: 10 Join Date: Dec 2016 Device: Tolino | 
			
			No Idea? No hint? It would be a great help if anybody with experience in how to export/convert ebooks would have an approach to this. Should I for example rather learn how to write an output plugin for calibre, or use an existing output format and parse it later on in a certain way? | 
|   |   | 
| Advert | |
|  | 
|  01-13-2017, 01:38 PM | #3 | 
| Well trained by Cats            Posts: 31,249 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | 
			
			You did not say Separate Book Editor: Split , breaks up a (page) file EPUB Split Plugin to break into Books | 
|   |   | 
|  01-13-2017, 04:02 PM | #4 | 
| Member  Posts: 14 Karma: 10 Join Date: Dec 2016 Device: Tolino | 
			
			thanks for your reply. But what do you mean with "You did not say separate books". I downloaded news. Calibe converts it to a book, right? So I want to split news in articles. Using a editor is not really a solution that can be automated, as doing this for thousands of articles is not a good idea. Can you explain a little further what the epub split plugin can do? And how I can apply it? Is it an option the ebook-convert command? | 
|   |   | 
|  01-13-2017, 04:25 PM | #5 | |
| null operator (he/him)            Posts: 22,010 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | Quote: 
 BR Last edited by BetterRed; 01-13-2017 at 05:19 PM. Reason: add quote | |
|   |   | 
| Advert | |
|  | 
|  01-13-2017, 10:57 PM | #6 | 
| creator of calibre            Posts: 45,604 Karma: 28548974 Join Date: Oct 2006 Location: Mumbai, India Device: Various | 
			
			EPUB is just a zip file unzip it and you will have all your articles in individual html files -- then do whatever you want with them.
		 | 
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Seperate news lists by device | Drumm | Recipes | 4 | 10-03-2011 11:39 AM | 
| How to retain a news article? | wilsonch | Calibre | 3 | 04-20-2010 11:12 AM | 
| Unutterably Silly Should we band together to save the liseuse wiki article? | pshrynk | Lounge | 31 | 09-16-2009 10:20 PM | 
| News Article | AJ Starr | News | 0 | 08-14-2009 01:13 PM |