View Single Post
Old 12-21-2020, 04:30 PM   #21
jackie_w
Grand Sorcerer
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 6,252
Karma: 16544692
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
Quote:
Originally Posted by mirage View Post
I notice that some of the metadata in the opf file of an epub does not get scrambled.

Code:
<dc:creator id="id">... ...</dc:creator>
    <dc:identifier>isbn:...</dc:identifier>
    <dc:identifier>google:...</dc:identifier>
    <dc:identifier>goodreads:...</dc:identifier>
    <dc:identifier>amazon:...</dc:identifier>
    <dc:identifier>kobo:...</dc:identifier>
    <dc:identifier>calibre:...</dc:identifier>
    <dc:identifier>uuid:...-...-...-...-...</dc:identifier>
    <dc:identifier id="uid">...</dc:identifier>
    <dc:source id="src-id">urn:isbn:...</dc:source>
    <dc:description>*removed*</dc:description>
    <dc:publisher>... </dc:publisher>
I replaced the unscrambled data with periods myself. But post-scramble, the original data was still there. Maybe this is because it was info that had been inserted by Calibre actions generated by me, at least in some cases? More specifically, the identifiers came when I used the "Edit metadata" window to download the book's metadata

Anyway, I thought I'd point it out. Had I not looked within the file, I'd have unintentionally posted unscrambled identifying data.

I'd vote for scrambling the <dc:description> content instead of removing it, at least if the amount of work is not disproportional to the value of preserving that data.

No need for anyone to do any of this on my account, of course. Mainly I wanted to alert people that opf content may not get scrambled, so be aware.
The MR Moderator team decided it would not be necessary to scramble OPF Title, Author or any identifier codes e.g. ISBN, MOBI-ASIN.

They did request that some calibre metadata was removed including calibre custom metadata and description to be on the safe side.

Here are a couple of relevant Mod posts:
https://www.mobileread.com/forums/sh...5&postcount=63
https://www.mobileread.com/forums/sh...&postcount=120

If you run the plugin from the commandline there is an option to set stricter rules for metadata removal. In all honesty I'm not sure how many users ever do this.

Last edited by jackie_w; 12-21-2020 at 04:33 PM.
jackie_w is offline   Reply With Quote