Bulk-check ebook collection for edited XHTML files
I have a big collection of epub ebooks, in the thousands let's say. Most of them are retail (original), and some of them have been surreptitiously edited, after being bought, to add some text within the flow of the book to advertise some website or service (this is obviously very annoying when reading).
One way to know whether a book has been messed with is opening it with winrar and checking whether all the xhtml files in the "OEBPS/text" folder have been last edited at the same date and time - the files that show a different date and time are the ones that I'm targeting.
I'm looking for a way to bulk-check the whole collection, without having to check every single file. Is it possible?
Thank you.
|