![]() |
#1 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 294
Karma: 107414
Join Date: May 2013
Device: Kobo Glo
|
Bloated epub file sizes?
I am curious, and a quick search on the forum didn't yield an answer:
Why is the so little correlation between the length of a book and its file size? I am not talking about books containing lots of pictures here, just formatted text in a regular novel. One book of average length might weigh in at 400kB, while another, of similar length and also having no images in there, might be 2,5 MB. What is it that bloats these file sizes so? |
![]() |
![]() |
![]() |
#2 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,058
Karma: 54671821
Join Date: Feb 2012
Location: New England
Device: PW 1, 2, 3, Voyage, Oasis 2 & 3, Fires, Aura HD, iPad
|
Quote:
I'm sure someone who DOES edit epubs will come along soon and either confirm what I said or ridicule me for not knowing what I'm talking about ![]() Shari |
|
![]() |
![]() |
![]() |
#3 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
Please post in the correct forum. Moved to the ePub file format forum.
|
![]() |
![]() |
![]() |
#4 |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
You need to edit the book in an ePub editor such as Sigil or Calibre and see what's causing the bloat. It's probably caused by an over-proliferation of CSS styles.
|
![]() |
![]() |
![]() |
#5 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
If there is a cover, than I should look at the cover size for sure. An average book with only text can not be 2.5MB. Using a different compression level will have some influence.
Bad formatting will have impact for sure, but for an average book to be 2.5MB it must be seriously bad formatted... |
![]() |
![]() |
![]() |
#6 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 80
Karma: 1184732
Join Date: Nov 2013
Device: Kobo Glo
|
Embedded font bloat books.
|
![]() |
![]() |
![]() |
#7 |
Unicycle Daredevil
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,944
Karma: 185432100
Join Date: Jan 2011
Location: Planet of the Pudding Brains
Device: Aura HD (R.I.P. After six years the USB socket died.) tolino shine 3
|
Yep. Especially CharisSIL, which is embedded in many retail epubs.
|
![]() |
![]() |
![]() |
#8 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,756
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
But using Calibre to subset embedded fonts really does cut down the file size. Also, if the cover is either very large and/or not all that compressed, it can be a large file size. So reducing the cover image size and recompressing can help.
|
![]() |
![]() |
![]() |
#9 |
Unicycle Daredevil
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,944
Karma: 185432100
Join Date: Jan 2011
Location: Planet of the Pudding Brains
Device: Aura HD (R.I.P. After six years the USB socket died.) tolino shine 3
|
|
![]() |
![]() |
![]() |
#10 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
|
![]() |
![]() |
![]() |
#11 | |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,756
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
Thumbnails are a common waste of space and Modify ePub deletes those no problem. |
|
![]() |
![]() |
![]() |
#12 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,057
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
![]() I will add: Retailer demanded Bloat. Thumbnail covers, Hi-Def covers. The best gain on sub-setting a font would be for those 'Display' fonts that might be only used for Chapter titles, Initial Letters. ![]() |
|
![]() |
![]() |
![]() |
#13 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,613
Karma: 6718541
Join Date: Dec 2004
Location: Paradise (Key West, FL)
Device: Current:Surface Go & Kindle 3 - Retired: DellV8p, Clie UX50, ...
|
Quote:
|
|
![]() |
![]() |
![]() |
#14 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,756
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
CharisSIL is a large font because of how many extended characters is contains. Most of that is unused. So when you subset, you get rid of the fonts not used and the characters not used. Most of the time, you will get rid of the bold italic version of a font. You can cut down the size of CharisSIL (on average) to between 200-300K verses about 1.2-1.3MB per font file.
As for the large graphics, this is because people read tablets with high resolution screens and tiny 800x600 sized graphics just don't cut it all that well. |
![]() |
![]() |
![]() |
#15 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 294
Karma: 107414
Join Date: May 2013
Device: Kobo Glo
|
Thanks everybody for the responses! I am sorry I posted in the wrong forum - but it was because though I personally use epubs, I assumed this was a universal problem that plagued all formats.
I _do_ try to find the highest-res cover art that I can for all my books, but when converted to B/W they don't take up all that much space. I'll experiment a little with removing embedded fonts with Calibre, but cleaning up bloated CSS styles for 900 books is not going to happen :-) |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Calibre ebook-viewer.exe changes EPUB file sizes? | avid01 | Calibre | 23 | 04-11-2018 04:24 AM |
File sizes - why the difference? | Araucaria | Sigil | 4 | 11-22-2011 07:52 PM |
Book/file sizes in Calibre | cavgirl | Calibre | 2 | 11-12-2010 08:15 PM |
Epub file sizes | jerryleejr | Sony Reader | 6 | 07-28-2008 03:09 PM |