View Single Post
Old 11-05-2021, 08:53 PM   #5
Karellen
Wizard
Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.Karellen ought to be getting tired of karma fortunes by now.
 
Karellen's Avatar
 
Posts: 1,615
Karma: 9500498
Join Date: Sep 2021
Location: Australia
Device: Kobo Libra 2
Hi all, thank you for the responses.

Quote:
Originally Posted by jhowell View Post
The calibre editor is primarily an EPUB 2 editor. KF8 (azw3) format has basically the same content as EPUB 2 but packaged differently so it is not a big stretch to support that also.

MOBI is very similar to KF8 but is based on the ancient HTML 3 standard. I don't know why it isn't supported, perhaps because it is such an outdated format.

KFX on the other hand isn't based on HTML, but is a very proprietary Amazon format. Editing that would be next to impossible due to a lack of any documentation of how it works.
Thanks for the explanation. So if Mobi is such an outdated format, and Amazon is also dropping support, why does the DeDRM default to MOBI output? I suppose that is a question better asked of the developer.


Quote:
Originally Posted by jhowell View Post
Calibre's conversion process is designed to be able to the modify formatting of books. As part of that it regenerates CSS and flattens class definitions. Personally, I would prefer to have an option to disable that and leave the original formatting intact as much as possible.
Yea, it would be good if it was left alone. A couple of times I have tried backing up an epub novel while I do a bit more editing. I use the convert book>epub and the converted book has had the css files changed to "calibre1" etc. Instead, I have opted to do just do a manual copy/paste to create a temporary backup.



Quote:
Originally Posted by jhowell View Post
If you are getting books from Kindle for PC/Mac it tends to use the extension .azw regardless of the actual format of the book. DeDRM detects and sets the actual book format. You may be seeing MOBI because that was the actual format of the book.

There is an option in calibre to automatically convert books to another format upon import. (Preferences, Adding Books, Adding actions, Automatically convert added books to the preferred output format.) You may want to check that you have not enabled this by mistake.
I purchase off Amazon, then transfer to Calibre from my Kindle Gen 2 (with the broken screen ) and I end up with MOBI.
I wanted to avoid the double conversion as I thought that is what is adding all the extra tags - azw(?)>mobi>epub


Quote:
Originally Posted by jhowell View Post
The KindleUnpack plugin can unpack from KF8 (azw3) format to EPUB without going though the calibre conversion process. This results in an EPUB that matches the original source file provided by the publisher as closely as possible.
Great. I'll give it a try.


Quote:
Originally Posted by theducks View Post
Conversion does not have an AI (but it does quite well in cases).
GIGO applies to ebooks given to Calibre to convert.

I've seen phases inside spans,with punctuation outside, then another span.
Not sure what they were after, a non-italic comma or quote????????
The problem is that I could never open the original format to inspect, so I had nothing to compare to the converted ebook I end up with. But after the above comments I now suspect the problem is GIGO just as you state.

I have a small library of 240 books. Most of the Amazon books are decent, but could do with some cleanup because of the excess code. Other books from elsewhere are just so poorly formatted. I have 6 novels where all the paragraphs have been stripped out and each chapter is just one paragraph. Grrrrr. They belong in the "elsewhere" group.
Karellen is offline   Reply With Quote