Device: Kindle App (Mostly on a Samsung Galaxy Smartphone)
PDF goes from 38 MB to 2.5 GB when converted to Epub?
Hi
I was trying to convert a book from a PDF to Epub so I could upload it to the kindle app, but when I converted it it ballooned up to 2.5 GB, which is far above the limit for send to kindle, or the email version of it to send it to my kindle library.
Any ideas why or what can be done to resolve this?
Please see the attached image of one of the pages (they are all images of scanned pages in the PDF),
Page-464
Retrieving document metadata...
Generating manifest...
Rendering manifest...
Parsing all content...
Parsing index.html ...
Generating default TOC from spine...
Merging user specified metadata...
Detecting structure...
Auto generated TOC with 0 entries.
Flattening CSS and remapping font sizes...
Source base font size is 12.00000pt
Removing fake margins...
Found 928 items of level: p_1
p_1 left margin stats: Counter({'0': 928})
p_1 right margin stats: Counter({'0': 928})
Cleaning up manifest...
Trimming unused files from manifest...
Trimming 'index.xml' from manifest
Creating EPUB Output...
Splitting markup on page breaks and flow limits, if any...
Looking for large trees in index.html...
No large trees found
This EPUB file has no Table of Contents. Creating a default TOC
EPUB output written to C:\Users\vuther316\AppData\Local\Temp\calibre-0xv_ea_s\5p61ksc8.epub
Last edited by BetterRed; Today at 03:51 AM.
Reason: SPOILER LOG files (@td) Thumbnail image (@BR)
Presumably this is a scan pdf i.e. all the pages are images. It will be because in EPUB the images are stored in a less space efficient format (EPUB and PDF dont support the same set of image formats). Open the EPUB in the edit book tool and see for yourself.
Device: iPhone 15PM, Kindle Scribe, iPad mini 6, PocketBook InkPad Color 3
Check the page size of the PDF pages.
These should be exactly the same as the physical copy. You will need some PDF editor to change them, or maybe some command line tool can do this. I sometimes discover page size is huge, like 32in x 24in, and conversion can expand whatever the pixel resolution to match. The PDF image objects themselves do not have PPI property, and scale according to the page dimensions.
Kindle Create, for example, can import PDF as image container for comic format. It exhibits exactly this bloating behavior.