![]() |
#1051 | ||
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,099
Karma: 92190113
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Quote:
Quote:
|
||
![]() |
![]() |
![]() |
#1052 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,955
Karma: 27060153
Join Date: Apr 2009
Location: USA
Device: iPhone 15PM, Kindle Scribe, iPad mini 6, PocketBook InkPad Color 3
|
Quote:
That said, calibre doesn't convert it to a functioning AZW3 (and never said that it would, but at least it does not get stuck generating something) and Kindle Previewer still thinks there's something wrong with it (without giving any details). As you suggest, it's possible the conversion to KFX does this mangling -- after all KF8 fixed layout never supported text search or hyperlinks or text selection, so why carry forward all the position data and make sure hyperlinks work? Even in ePub universe fixed layout is like an unwanted stepchild. Years after it was specified, they are only now getting around to addressing ePub Fixed Layout Accessibility: https://www.w3.org/news/2024/group-n...accessibility/ https://epubsecrets.com/the-accessiv...ble-comics.php There seems no hope that ePub platforms much less publishers will do anything to implement these practices. Much less Amazon doing so. Last edited by tomsem; 08-03-2025 at 10:18 PM. |
|
![]() |
![]() |
![]() |
#1053 | |
JCL Punch-Card Collector
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 81
Karma: 34468
Join Date: Jun 2014
Location: Antarctica
Device: Aggressively Device Independent
|
Quote:
|
|
![]() |
![]() |
![]() |
#1054 | |||||
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,099
Karma: 92190113
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Quote:
Quote:
In that book the background images have everything the reader sees so the invisible text was possibly intended for annotation, dictionary lookup, and to provide links to other pages in the table of contents. Quote:
The background image for each page becomes a recompressed 1608x1920 JPEG image encapsulated within a single page PDF file. The invisible text from each page is only provided as alt-text to the background image with no links or formatting retained. Quote:
Trying to turn them into something useful would be hit-or-miss so I am not going to attempt it. Quote:
That is the best I can do. The conversion to KFX by Amazon does not leave enough information to properly reconstruct the overlay text and links from the original EPUB that was provided by the publisher. |
|||||
![]() |
![]() |
![]() |
#1055 |
JCL Punch-Card Collector
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 81
Karma: 34468
Join Date: Jun 2014
Location: Antarctica
Device: Aggressively Device Independent
|
Why does what the Grand Sorceror just described sound like "Amazon is trying to do DJVU but proprietarily," especially for the first book described? We've seen this before (remember the origin of .mobi?)...
|
![]() |
![]() |
![]() |
#1056 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,955
Karma: 27060153
Join Date: Apr 2009
Location: USA
Device: iPhone 15PM, Kindle Scribe, iPad mini 6, PocketBook InkPad Color 3
|
Quote:
The conversion from KFX to PDF is still useful enough: I can OCR the PDF to add text objects and fix the links manually. For this type of book, it seems to me, PDF is more functional than KFX or fixed layout ePub, to extent one wants to annotate or search. There are a couple of GitHub projects that I'm planning to investigate more (both as tools to have around, and maybe creating a calibre plugin...): - https://github.com/mashu3/epub2pdf/ - https://github.com/aourednik/pdf2epub3fixed There is a current Humble Bundle offer for some of the DK books, they would in ePub format (from Kobo.com): https://www.humblebundle.com/books/h...sdk_bookbundle Last edited by tomsem; 08-09-2025 at 12:47 PM. |
|
![]() |
![]() |
![]() |
#1057 | |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,557
Karma: 93980341
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
Quote:
|
|
![]() |
![]() |
![]() |
#1058 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,882
Karma: 146918083
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
![]() |
![]() |
![]() |
#1059 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,099
Karma: 92190113
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Version 2.26.0 - 12 Aug 2025
The "From KFX" toolbar action will now perform conversion using a background job to prevent the user interface from being blocked during conversion. A plugin configuration option controls whether or not a popup message will occur when conversion is complete. The "From KFX" toolbar action will now allow multiple books to be converted in one action. KFX format from all selected books will be converted to the same chosen format if possible. Extract page images from PDF when possible when converting Print Replica and Comic books that contain PDF content to either EPUB or CBZ format in order to improve the quality of the result. Otherwise PDF pages are rendered as JPEG images at 300 DPI. (150 DPI was used previously.) Handle unexpected formatting found in "The Nicomachean Ethics (Penguin Classics)", ASIN B07DF9CLGB. Fixes "box-align right with float right in ...". |
![]() |
![]() |
![]() |
#1060 |
The Dank Side of the Moon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 35,918
Karma: 119747553
Join Date: Sep 2009
Location: Denver, CO
Device: Kindle2; Kindle Fire
|
Thank You!!
|
![]() |
![]() |
![]() |
#1061 | |
PC Dev
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 56
Karma: 35532
Join Date: Sep 2024
Device: Kindle Paperwhite (11th Gen), Lenovo Tab M10 (3rd Gen)
|
Quote:
instead of just extracting the images (which should take like a second or two, apart from the epub generation) its re-encoding all of them, which takes quite a bit of time and the file size is smaller than the original. i tested 3 books that ive confirmed can be just extracted (which then would be around the same size as the source +/- 1mb). instead i got the following with the newest update of the plugin: book 1: kfx/pdf (88.6mb) -> epub (54.9mb) book 2: kfx/pdf (287mb) -> epub (81.3mb) book 3: kfx/pdf (135mb) -> epub (82.6mb) |
|
![]() |
![]() |
![]() |
#1062 | |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,882
Karma: 146918083
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
![]() |
![]() |
![]() |
#1063 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,099
Karma: 92190113
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Quote:
The plugin renders each PDF page and then checks that the extracted image closely conforms it before that image will be accepted as a substitute. Right now that comparison process is fairly slow. I intend to optimize it in the future. It turns out that the image extraction function in the PDF library I use (pypdf) recompresses the extracted image resulting in lowered image quality. I am working on an improved method that will avoid this. |
|
![]() |
![]() |
![]() |
#1064 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,099
Karma: 92190113
Join Date: Nov 2011
Location: Charlottesville, VA
Device: Kindles
|
Version 2.26.1 - 14 Aug 2025
Fix reduction in image quality that would occur in the previous plugin release when JPEG images are extracted from PDF pages. |
![]() |
![]() |
![]() |
#1065 | |
PC Dev
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 56
Karma: 35532
Join Date: Sep 2024
Device: Kindle Paperwhite (11th Gen), Lenovo Tab M10 (3rd Gen)
|
Quote:
![]() its the same size as the cover in the calibre library. do kfx files have a separate cover than the included pdf? |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
KFX conversion, transfer back to library issue. | shoelesshunter | Conversion | 9 | 04-13-2025 11:15 AM |
[Conversion Input] Microsoft Doc Input Plugin | igi | Plugins | 77 | 03-08-2025 04:04 AM |
[Conversion Input] LaTeX Formulas Input Conversion Plugin | sevyls | Plugins | 0 | 03-23-2015 05:52 AM |
[Input Plugin] DOCX Input | SauliusP. | Plugins | 42 | 06-05-2013 04:01 AM |
Looking For MHT Input Conversion Plugin | FlooseMan Dave | Plugins | 4 | 03-30-2010 05:52 PM |