Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 11-15-2013, 03:45 PM   #1
CaptainTenacity
Junior Member
CaptainTenacity began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Nov 2013
Device: Kindle
Epub file far too large, advice?

Hi all!

I'm currently working on a ebook project with Sigil. Basically, I'm transcribing the contents of a wiki to make it into a epub for offline reading. The wiki has a large amount of content. Not only are there over 1000 pages in this praticualr iteration, there are also a lot of picture files.

My problem is that my file is now hopelessly bloated to the extent that it takes Sigil 10-15 minutes to open it. It's current size is 113mb. Even accounting for all the pictures files, that's still far to large. A friend has suggested this bloating could be down redundant CSS and code. However, I know little about such things!

My methodology has been very straightforward, literally to the extent that I just copy and paste the web page and rearrange it so that it all sits well.

Is there any advice you could give a Sigil-newbie on helping slim down this file? Any and all are appreciated!
CaptainTenacity is offline   Reply With Quote
Old 11-15-2013, 03:55 PM   #2
Toxaris
Wizard
Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.Toxaris ought to be getting tired of karma fortunes by now.
 
Toxaris's Avatar
 
Posts: 3,183
Karma: 7180223
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-300, PRS-T1
The issue is probably not the CSS and the code. Sure, it will help but don't expect miracles. I think the biggest part are images. See if you can live without or resize them to a smaller size.
Toxaris is offline   Reply With Quote
Old 11-15-2013, 03:58 PM   #3
CaptainTenacity
Junior Member
CaptainTenacity began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Nov 2013
Device: Kindle
Quote:
Originally Posted by Toxaris View Post
The issue is probably not the CSS and the code. Sure, it will help but don't expect miracles. I think the biggest part are images. See if you can live without or resize them to a smaller size.
I originally though this, so I deleted around 300 pages as a test. It only reduced the file size by 4 mb, with is surely not enough to account for the sizes of the image files included therein?
CaptainTenacity is offline   Reply With Quote
Old 11-15-2013, 03:58 PM   #4
theducks
Grand Sorcerer
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 15,270
Karma: 6022733
Join Date: Aug 2009
Location: (The original) Silicon Valley, USA
Device: Galaxy Tab 2, Astak Pocket Pro, K4NT
Calibre has a wiki (scraper) PI http://www.mobileread.com/forums/sho...d.php?t=183333

Maybe starting with a EPUB will make for a slight tighter code.
Not much you can do with images except reduce size or resolution
theducks is offline   Reply With Quote
Old 11-15-2013, 04:06 PM   #5
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 2,678
Karma: 15513561
Join Date: Apr 2010
Location: Phoenix, AZ
Device: Kindle2, iPad, KindleFire and NookColor
Quote:
Originally Posted by CaptainTenacity View Post
Hi all!

I'm currently working on a ebook project with Sigil. Basically, I'm transcribing the contents of a wiki to make it into a epub for offline reading. The wiki has a large amount of content. Not only are there over 1000 pages in this praticualr iteration, there are also a lot of picture files.

My problem is that my file is now hopelessly bloated to the extent that it takes Sigil 10-15 minutes to open it. It's current size is 113mb. Even accounting for all the pictures files, that's still far to large. A friend has suggested this bloating could be down redundant CSS and code. However, I know little about such things!

My methodology has been very straightforward, literally to the extent that I just copy and paste the web page and rearrange it so that it all sits well.

Is there any advice you could give a Sigil-newbie on helping slim down this file? Any and all are appreciated!
Well, that's pretty damn bloated. First, you probably have images in either at a high-rez, or simply too large (are you using display techniques to downsize images)? However, more importantly, are you trying to put everything in one HTML (or XHTML) "file?" In other words, do you have the whole damn wiki in one single file, instead of breaking it up into, say, topics? Or a range of topics, in a "chapter?"

If you break the topics up, and put the wiki in "chapters" (files) inside the ePUB, it will certainly open faster and more efficiently. As it is now, it'll never work in any ereader I know of, if the single file is 113mb. Many older readers have a 256K limit, if memory serves, per file, which means, per chapter, or file inside the ePUB.

Have you broken it up into workable chapters?

Hitch
Hitch is online now   Reply With Quote
Old 11-15-2013, 04:16 PM   #6
CaptainTenacity
Junior Member
CaptainTenacity began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Nov 2013
Device: Kindle
Quote:
Originally Posted by Hitch View Post
Well, that's pretty damn bloated. First, you probably have images in either at a high-rez, or simply too large (are you using display techniques to downsize images)? However, more importantly, are you trying to put everything in one HTML (or XHTML) "file?" In other words, do you have the whole damn wiki in one single file, instead of breaking it up into, say, topics? Or a range of topics, in a "chapter?"

If you break the topics up, and put the wiki in "chapters" (files) inside the ePUB, it will certainly open faster and more efficiently. As it is now, it'll never work in any ereader I know of, if the single file is 113mb. Many older readers have a 256K limit, if memory serves, per file, which means, per chapter, or file inside the ePUB.

Have you broken it up into workable chapters?

Hitch
Thanks Hitch.

No, each page/article is one single html file, so that each can get it's separate chapter.

I decided to split the file in half, and remove the unused media file from each. It cut the first 1000 pages down to 77mb, and the other 350 from the second part of the book down to 37mb. It would seem that the picture files are indeed the issue. I'll probably have to go back and resize each individually, unless Sigil or another program has a batch resizer that could do it for me?
CaptainTenacity is offline   Reply With Quote
Old 11-15-2013, 04:43 PM   #7
DaleDe
Grand Sorcerer
DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.
 
DaleDe's Avatar
 
Posts: 9,782
Karma: 5137308
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2
Quote:
Originally Posted by CaptainTenacity View Post
Thanks Hitch.

No, each page/article is one single html file, so that each can get it's separate chapter.

I decided to split the file in half, and remove the unused media file from each. It cut the first 1000 pages down to 77mb, and the other 350 from the second part of the book down to 37mb. It would seem that the picture files are indeed the issue. I'll probably have to go back and resize each individually, unless Sigil or another program has a batch resizer that could do it for me?
IrfanView can do batch resizing. It is a free download.

Dale
DaleDe is offline   Reply With Quote
Old 11-15-2013, 04:48 PM   #8
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 2,678
Karma: 15513561
Join Date: Apr 2010
Location: Phoenix, AZ
Device: Kindle2, iPad, KindleFire and NookColor
Quote:
Originally Posted by CaptainTenacity View Post
Thanks Hitch.

No, each page/article is one single html file, so that each can get it's separate chapter.

I decided to split the file in half, and remove the unused media file from each. It cut the first 1000 pages down to 77mb, and the other 350 from the second part of the book down to 37mb. It would seem that the picture files are indeed the issue. I'll probably have to go back and resize each individually, unless Sigil or another program has a batch resizer that could do it for me?
Y'know, I'm sorry, but that still seems a bit hinky. You copy-and-pasted Internet pages into Sigil, is that right? Maybe you would be willing to paste the HTML of a given page (try to make it a short one) here, so we could all see it, in a code block? I don't know if we can help; thousands of pages with images may just well be 118mb. But we can always give it a go.

Hitch
Hitch is online now   Reply With Quote
Old 11-15-2013, 05:05 PM   #9
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 9,540
Karma: 44104176
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Far be it from me to interfere in a personal project, but I would think an "offline" wiki sort of defeats the whole wiki purpose. A wiki is a constantly evolving collaboration. How will the edits/additions/deletions propogate to this static epub version of the real thing? Or is it a one-time snapshot?

Carry on.
DiapDealer is offline   Reply With Quote
Old 11-15-2013, 06:33 PM   #10
dwig
Guru
dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.dwig ought to be getting tired of karma fortunes by now.
 
dwig's Avatar
 
Posts: 996
Karma: 1843322
Join Date: Dec 2004
Location: Paradise (Key West, FL)
Device: Current:Dell Venue 8 Pro - Retired:Kindle 3, Clie UX50, T415, ...
I would suggest that before you slap this ePub with a batch resize you should take a look at the image sizes and see if there is a set, possibly small in number, of "bad apples".

Since an ePub is a ZIP container you could do the following:
  1. Create a copy of the ePub
  2. Change the copy's extension to ".zip"
  3. either Extract the contents or simply view the contents of the ZIP (depends on the abilities of your OS)
  4. Sort the "/images" folder by size
  5. View the sizes

With some OSs you'll have to Extract the files to get full control of how the folders are displayed. You might find that there are only a limited number of images that need most of the work.
dwig is offline   Reply With Quote
Old 11-16-2013, 10:25 PM   #11
phossler
Addict
phossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensionsphossler can understand the language of future parallel dimensions
 
Posts: 371
Karma: 51406
Join Date: Jan 2009
Location: Valley Forge, PA, USA
Device: kindle
@Captain

Long shot, but when you open the big ePub in Sigil, are there any files in the Misc part in the Folder tree on the left?

How about Fonts, or Audio or Video files that might be embedded in the epub

Paul
phossler is offline   Reply With Quote
Old 11-19-2013, 07:00 AM   #12
exaltedwombat
Evangelist
exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.
 
Posts: 462
Karma: 1703930
Join Date: Nov 2011
Device: none
Quote:
Originally Posted by CaptainTenacity View Post
I originally though this, so I deleted around 300 pages as a test. It only reduced the file size by 4 mb, with is surely not enough to account for the sizes of the image files included therein?
Pictures live in the Images folder, the HTML files contain references to them. Deleting pages will not remove the picture files.
exaltedwombat is offline   Reply With Quote
Old 11-19-2013, 02:11 PM   #13
Divingduck
Fanatic
Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.Divingduck can talk all four legs off a donkey... then persuade it to go for a walk.
 
Posts: 562
Karma: 124000
Join Date: Nov 2010
Location: Germany
Device: Sony PRS-650
Why not using the reporting function in Sigil? There you can see where you have your big parts.
Attached Thumbnails
Click image for larger version

Name:	Aufzeichnen.JPG
Views:	96
Size:	60.4 KB
ID:	115434   Click image for larger version

Name:	Aufzeichnen1.JPG
Views:	90
Size:	59.2 KB
ID:	115435  
Divingduck is offline   Reply With Quote
Old 11-19-2013, 03:27 PM   #14
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 2,678
Karma: 15513561
Join Date: Apr 2010
Location: Phoenix, AZ
Device: Kindle2, iPad, KindleFire and NookColor
Quote:
Originally Posted by Divingduck View Post
Why not using the reporting function in Sigil? There you can see where you have your big parts.
Of course. Excellent idea.

Hitch
Hitch is online now   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Choose reader application per file - handling of very large epub files Clark G. Flipper Onyx Boox 2 06-17-2012 08:34 AM
Large File Conversion MOBI -> ePub Hangs at 67% Snauzoo Conversion 3 06-07-2011 02:03 PM
Large file convert ejacevich Calibre 2 09-29-2010 09:51 PM
LARGE pdf file taildragger-j3 Sony Reader 3 03-12-2010 09:48 AM
JBL- Calibre ePub sometimes = "File too Large" Error Ken Maltby Ectaco jetBook 13 01-09-2010 02:27 PM


All times are GMT -4. The time now is 10:44 PM.


MobileRead.com is a privately owned, operated and funded community.