I'd like to suggest/request three new features:
1. Limited HTML cleanup, specifically dealing with <span> bloat:
a. Remove unstyled <span> elements. Pretty self-explanatory; these add nothing to the book's formatting and can be safely removed.
b. Combine adjacent <span> tags that surround the same content. For instance, <span class="a"><span class="b">blah blah</span></span> is exactly equivalent to <span class="a b">blah blah</span>.
2. Unwrap paragraphs. Pretty basic rule (replace linefeeds inside paragraphs with single spaces), but automating it would be so nice...
3. For lack of a better word, de-Kobo-ify:
On the occasions when one must grab an ebook from the installed Kobo software's library, such as when Kobo doesn't provide a download button, that book includes a reference to a Kobo CSS and JavaScript file, as well as several Kobo-specific <span> elements. All of those can be removed without affecting the book itself; it's just a matter of doing so. If software like [deleted] was used to make the extraction, there may also be a rights.xml file that can be similarly removed.
Also, a tweak to the "remove ADEPT" feature that removes the linefeed at the end of that meta tag would be nice, just for aesthetic reasons.
Last edited by pdurrant; 10-28-2014 at 05:09 AM.
Reason: Clarification.
|