|  07-17-2014, 06:42 AM | #1 | 
| Addict            Posts: 294 Karma: 107414 Join Date: May 2013 Device: Kobo Glo | 
				
				Bloated epub file sizes?
			 
			
			I am curious, and a quick search on the forum didn't yield an answer: Why is the so little correlation between the length of a book and its file size? I am not talking about books containing lots of pictures here, just formatted text in a regular novel. One book of average length might weigh in at 400kB, while another, of similar length and also having no images in there, might be 2,5 MB. What is it that bloats these file sizes so? | 
|   |   | 
|  07-17-2014, 06:47 AM | #2 | |
| Wizard            Posts: 3,068 Karma: 54671821 Join Date: Feb 2012 Location: New England Device: PW 1, 2, 3, Voyage, Oasis 2 & 3, Fires, Aura HD, iPad | Quote: 
 I'm sure someone who DOES edit epubs will come along soon and either confirm what I said or ridicule me for not knowing what I'm talking about  Shari | |
|   |   | 
| Advert | |
|  | 
|  07-17-2014, 06:56 AM | #3 | 
| eBook Enthusiast            Posts: 85,560 Karma: 93980341 Join Date: Nov 2006 Location: UK Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6 | 
			
			Please post in the correct forum. Moved to the ePub file format forum.
		 | 
|   |   | 
|  07-17-2014, 07:01 AM | #4 | 
| eBook Enthusiast            Posts: 85,560 Karma: 93980341 Join Date: Nov 2006 Location: UK Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6 | 
			
			You need to edit the book in an ePub editor such as Sigil or Calibre and see what's causing the bloat. It's probably caused by an over-proliferation of CSS styles.
		 | 
|   |   | 
|  07-17-2014, 07:45 AM | #5 | 
| Wizard            Posts: 4,520 Karma: 121692313 Join Date: Oct 2009 Location: Heemskerk, NL Device: PRS-T1, Kobo Touch, Kobo Aura | 
			
			If there is a cover, than I should look at the cover size for sure. An average book with only text can not be 2.5MB. Using a different compression level will have some influence. Bad formatting will have impact for sure, but for an average book to be 2.5MB it must be seriously bad formatted... | 
|   |   | 
| Advert | |
|  | 
|  07-17-2014, 08:15 AM | #6 | 
| Connoisseur            Posts: 80 Karma: 1184732 Join Date: Nov 2013 Device: Kobo Glo | 
			
			Embedded font bloat books.
		 | 
|   |   | 
|  07-17-2014, 08:18 AM | #7 | 
| Unicycle Daredevil            Posts: 13,944 Karma: 185432100 Join Date: Jan 2011 Location: Planet of the Pudding Brains Device: Aura HD (R.I.P. After six years the USB socket died.) tolino shine 3 | 
			
			Yep. Especially CharisSIL, which is embedded in many retail epubs.
		 | 
|   |   | 
|  07-17-2014, 08:21 AM | #8 | 
| Resident Curmudgeon            Posts: 80,740 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			But using Calibre to subset embedded fonts really does cut down the file size. Also, if the cover is either very large and/or not all that compressed, it can be a large file size. So reducing the cover image size and recompressing can help.
		 | 
|   |   | 
|  07-17-2014, 08:29 AM | #9 | 
| Unicycle Daredevil            Posts: 13,944 Karma: 185432100 Join Date: Jan 2011 Location: Planet of the Pudding Brains Device: Aura HD (R.I.P. After six years the USB socket died.) tolino shine 3 | |
|   |   | 
|  07-17-2014, 08:37 AM | #10 | 
| Wizard            Posts: 2,306 Karma: 13057279 Join Date: Jul 2012 Device: Kobo Forma, Nook | 
 | 
|   |   | 
|  07-17-2014, 09:16 AM | #11 | |
| Resident Curmudgeon            Posts: 80,740 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | Quote: 
 Thumbnails are a common waste of space and Modify ePub deletes those no problem. | |
|   |   | 
|  07-17-2014, 11:28 AM | #12 | |
| Well trained by Cats            Posts: 31,249 Karma: 61360164 Join Date: Aug 2009 Location: The Central Coast of California Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A | Quote: 
  I will add: Retailer demanded Bloat. Thumbnail covers, Hi-Def covers. The best gain on sub-setting a font would be for those 'Display' fonts that might be only used for Chapter titles, Initial Letters.  Is there a EASY way to later determine that the books font file has been sub-setted  ( I was thinking of a Quality check PI type test)? | |
|   |   | 
|  07-17-2014, 02:21 PM | #13 | |
| Wizard            Posts: 1,613 Karma: 6718541 Join Date: Dec 2004 Location: Paradise (Key West, FL) Device: Current:Surface Go & Kindle 3 - Retired: DellV8p, Clie UX50, ... | Quote: 
 | |
|   |   | 
|  07-17-2014, 02:49 PM | #14 | 
| Resident Curmudgeon            Posts: 80,740 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			CharisSIL is a large font because of how many extended characters is contains. Most of that is unused. So when you subset, you get rid of the fonts not used and the characters not used. Most of the time, you will get rid of the bold italic version of a font. You can cut down the size of CharisSIL (on average) to between 200-300K verses about 1.2-1.3MB per font file. As for the large graphics, this is because people read tablets with high resolution screens and tiny 800x600 sized graphics just don't cut it all that well. | 
|   |   | 
|  07-18-2014, 03:07 AM | #15 | 
| Addict            Posts: 294 Karma: 107414 Join Date: May 2013 Device: Kobo Glo | 
			
			Thanks everybody for the responses! I am sorry I posted in the wrong forum - but it was because though I personally use epubs, I assumed this was a universal problem that plagued all formats. I _do_ try to find the highest-res cover art that I can for all my books, but when converted to B/W they don't take up all that much space. I'll experiment a little with removing embedded fonts with Calibre, but cleaning up bloated CSS styles for 900 books is not going to happen :-) | 
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Calibre ebook-viewer.exe changes EPUB file sizes? | avid01 | Calibre | 23 | 04-11-2018 04:24 AM | 
| File sizes - why the difference? | Araucaria | Sigil | 4 | 11-22-2011 07:52 PM | 
| Book/file sizes in Calibre | cavgirl | Calibre | 2 | 11-12-2010 08:15 PM | 
| Epub file sizes | jerryleejr | Sony Reader | 6 | 07-28-2008 03:09 PM |