|  03-03-2020, 11:17 AM | #1 | 
| Member  Posts: 18 Karma: 10 Join Date: Feb 2020 Device: Kindle Paperwhite 4 | 
				
				Clear metadata more completely
			 
			
			I'd like to clear the metadata from many ebooks (.EPUB) except for certain fields, namely title, author, and possibly the identifier and source that corresponds to the Project Gutenberg book number. I am currently using Edit metadata in bulk to do this, i.e. Spoiler: 
 However, when I use Edit book, I still find unwanted bits in the content.opf file between the <metadata> tags, including <dc:rights> and <dc:date opf:event="conversion">. Moreover, when I convert from .EPUB to .AZW3, I lose the <dc:identifier opf:scheme="URI" id="id"> and <dc:source> tags, which may not be desirable should I decide to keep/modify these. Here is an example of metadata from A Tale of Two Cities by Charles Dickens found at https://www.gutenberg.org/ebooks/98.epub.images : Original Spoiler: 
 After Edit metadata in bulk Spoiler: 
 Subsequently using Polish books and Convert books (from .EPUB to .AZW3) Spoiler: 
 Is there a more thorough way of clearing metadata that is faster than individually opening and modifying the metadata in Edit books? | 
|   |   | 
|  03-03-2020, 11:25 AM | #2 | 
| Well trained by Cats            Posts: 31,240 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | 
			
			certain DC metadata tags is mandatory  I never figured why folk feel the need to strip out a few bytes. I can see having those, that exist, Correct   | 
|   |   | 
|  03-03-2020, 11:30 AM | #3 | 
| Member  Posts: 18 Karma: 10 Join Date: Feb 2020 Device: Kindle Paperwhite 4 | 
			
			Yea, it is not an existential crisis lol. I thought I would ask. From https://wiki.mobileread.com/wiki/Metadata , I understand that title, language, and identifier are required.
		 | 
|   |   | 
|  03-03-2020, 11:46 AM | #4 | |
| null operator (he/him)            Posts: 22,006 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | Quote: 
 BR | |
|   |   | 
|  03-03-2020, 12:18 PM | #5 | 
| Member  Posts: 18 Karma: 10 Join Date: Feb 2020 Device: Kindle Paperwhite 4 | 
			
			@BetterRed Thanks again for the great information. This time, unfortunately, I tested it on A Tale of Two Cities, both the original and after Edit metadata in bulk, and got the following error message: Spoiler: 
 On another note, I noticed some neat-sounding options in the Modify ePub plugin, including Remove calibre bookmark files. I was bothered that calibre remembered the last page viewed without knowing where it saved this data or how to clear it. I suppose calibre is writing to the individual ebooks, or perhaps their separate metadata.opf, when you view them. However, I haven't done a hash check. | 
|   |   | 
|  03-03-2020, 01:29 PM | #6 | |
| Well trained by Cats            Posts: 31,240 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | Quote: 
  That is controlled from Preferences INSIDE the viewer. | |
|   |   | 
|  03-03-2020, 01:35 PM | #7 | 
| Grand Sorcerer            Posts: 7,155 Karma: 92500001 Join Date: Nov 2011 Location: Charlottesville, VA Device: Kindles | 
			
			The mandatory metadata in EPUB varies by version. Newer versions require a modification date. Metadata in AZW3 format is stored in an EXTH data structure. Calibre fakes the existence of an OPF for editing purposes, but some metadata fields do not translate exactly. | 
|   |   | 
|  03-03-2020, 03:09 PM | #8 | 
| Member  Posts: 18 Karma: 10 Join Date: Feb 2020 Device: Kindle Paperwhite 4 | |
|   |   | 
|  03-03-2020, 03:12 PM | #9 | |
| Member  Posts: 18 Karma: 10 Join Date: Feb 2020 Device: Kindle Paperwhite 4 | Quote: 
 | |
|   |   | 
|  03-03-2020, 03:46 PM | #10 | |
| null operator (he/him)            Posts: 22,006 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | Quote: 
 BR | |
|   |   | 
|  03-03-2020, 04:20 PM | #11 | 
| Member  Posts: 18 Karma: 10 Join Date: Feb 2020 Device: Kindle Paperwhite 4 | 
			
			I'm using Fedora 31 inside of a virtual machine. I'm not using the latest version anymore, as I found calibre in my repository. It is version 4.10.1.
		 | 
|   |   | 
|  03-03-2020, 06:29 PM | #12 | |
| null operator (he/him)            Posts: 22,006 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | Quote: 
 Could you try installing the attached version of Modify I wonder what your ePub has that causes it to follow that code path. Let me know if works (it works for me but I wasn't running up against the error), I have asked the person who changed the code last (Rev.Bob) to take a took at this thread, he is the person most familiar with code. Interpreted languages that use code indentation to define scope ain't one of my favourite things - I'm more of a schnitzel with noodles sort of guy   BR Last edited by BetterRed; 03-03-2020 at 06:53 PM. | |
|   |   | 
|  03-03-2020, 07:16 PM | #13 | 
| Member  Posts: 18 Karma: 10 Join Date: Feb 2020 Device: Kindle Paperwhite 4 | 
			
			(Downloading plugins from a forum   ) Thanks for taking the time to review the code, though I regret that I have a similar--if not an identical--error message: Spoiler: 
 If I can offer more about the system that would cause such a headache, feel free to ask. And shoutouts to Python :/ Last edited by gol8erl8; 03-03-2020 at 07:19 PM. | 
|   |   | 
|  03-03-2020, 07:44 PM | #14 | |
| null operator (he/him)            Posts: 22,006 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | Quote: 
 What options did you check in Modify - did you check Deindent? if so try unchecking it and we might dodge that bullet for now  BTW all the plugins you can install from Preferences plugins originate from here. BR Last edited by BetterRed; 03-03-2020 at 07:50 PM. | |
|   |   | 
|  03-03-2020, 07:59 PM | #15 | 
| null operator (he/him)            Posts: 22,006 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | 
			
			Please install the latest 'official' release of calibre from here ==>> calibre - Download for Linux I was going to ask earlier but… maybe... who knows. Can't understand why these errors haven't showed up before now - the last update was in Oct 2019. BR Last edited by BetterRed; 03-03-2020 at 08:03 PM. | 
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| How to clear cache? | odamizu | Kindle Developer's Corner | 8 | 04-11-2017 11:15 AM | 
| Completely Automatic Download of Metadata? | saefuob | Library Management | 3 | 03-07-2012 03:41 PM | 
| Did I clear out Best Buy ? | carpetmojo | Sony Reader | 8 | 11-13-2011 04:20 AM | 
| Way to Clear Library? | Lobonca | Calibre | 5 | 01-05-2010 04:24 PM |