Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > Other formats

Notices

Reply
 
Thread Tools Search this Thread
Old 04-07-2009, 09:01 PM   #1
pruss
Evangelist
pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.
 
Posts: 461
Karma: 819417
Join Date: Nov 2004
explode - plucker to html

I've finally updated the explode utility for compatibility with newer versions of the Plucker format, and made it work a bit smarter. Here are Windows binaries, and source code:
http://www.1src.com/freeware/fileinfo.php?id=1916

To run, do:
explode --directory=outdir filename.pdb

Then your home page is outdir\default.html

You can adjust the jpeg compression with --jpeg-quality=x (where x ranges from 0 to 100).

No direct epub support yet. I might add it one day. But I don't have an epub capable device, so my motivation is low.

This is a kind of sad thing for me. I've been involved in the Plucker project for five or six years, and now it's looking like the end of the format is in sight--it's time to release converters to other formats so people can migrate their converters. :-( Oh well. I still do all my ebook reading on my TX, and 95% of that with Plucker.
pruss is offline   Reply With Quote
Old 04-08-2009, 11:38 AM   #2
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Thanks a lot! It worked like a charm!

Also, I had to login (thanks to my Sony Clie TH-55 days) to get your upload at 1src.com. Most won't be so lucky.
So with the author's permission (pruss), I attach explode v0.11 to this thread for others to have access to it.
For years, everyone kept telling me that plucker to .html was not possible; and thanks to you, they are ALL wrong now!

I converted the 43MB Wikipedia 2006.pdb in plucker format quickly as follows:
Code:
E:\ebooks> explode --directory=Wikipedia Wikipedia.pdb
I'm converting it now to .imp and from there to .mobi/.prc and .epub, so thanks again for your (much appreciated) update to explode.c.
Attached Files
File Type: zip explode.zip (1.78 MB, 1033 views)

Last edited by nrapallo; 04-09-2009 at 11:57 PM. Reason: added link
nrapallo is offline   Reply With Quote
Old 04-08-2009, 11:50 AM   #3
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by nrapallo
I'm converting it now to .imp and from there to .mobi/.prc and .epub, so thanks again for your (much appreciated) update to explode.c.
WOW!!!!

It converted on it's first run to my REB1200 .imp format, but the resulting file is TOO huge to be used on my ebook reader. After about 4.5 hours of processing, my conversion software (eBook Publisher) created a 151MB file!

The PC viewer can load it, so I attach some screenshots as the "proof" of concept. Hey, it was fun to even be able to try this.

Now onto round two; decrease the resulting .imp file size by eliminating some or all of the images and/or repetitive text. I would need a 10 fold reduction to make this useable and fear that that may not be feasible. But I WILL try.... — (to boldly go where no man has gone before)
Attached Thumbnails
Click image for larger version

Name:	Wikipedia_2006_REB1200-1.jpg
Views:	914
Size:	158.3 KB
ID:	27301   Click image for larger version

Name:	Wikipedia_2006_REB1200-2.jpg
Views:	927
Size:	66.1 KB
ID:	27302   Click image for larger version

Name:	Wikipedia_2006_REB1200-3.jpg
Views:	895
Size:	77.4 KB
ID:	27303   Click image for larger version

Name:	Wikipedia_2006_REB1200-4.jpg
Views:	924
Size:	140.6 KB
ID:	27304   Click image for larger version

Name:	Wikipedia_2006_REB1200-5.jpg
Views:	943
Size:	110.4 KB
ID:	27305  
nrapallo is offline   Reply With Quote
Old 04-08-2009, 11:52 AM   #4
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by nrapallo
I'm converting it now to .imp and from there to .mobi/.prc and .epub, so thanks again for your (much appreciated) update to explode.c.
OK, I've completed the round one conversions of this to .mobi/.prc and .epub and the results were similar to my previous conversion to .imp.

Using Mobipocket Creator with standard compression resulted in a 138MB .prc whereas using calibre resulted in a 141MB .epub (see some sample ADE screenshots). In all three conversions, the images seem to be stored in full 16M color resolution (as extracted from explode.exe) and the culprit for the tremendous filesize increase. The (compressed) text occupied only about 10% (12 to 15MB) of the resulting ebooks.

Time to focus on image resolution reduction, but I may ( will! ) not probably get ANY better compression result than Plucker's 43MB .pdb, so this is an exercise in futility, but I don't mind...
Attached Thumbnails
Click image for larger version

Name:	Wikipedia_epub1.jpg
Views:	869
Size:	163.0 KB
ID:	27306   Click image for larger version

Name:	Wikipedia_epub2.jpg
Views:	890
Size:	51.0 KB
ID:	27307   Click image for larger version

Name:	Wikipedia_epub3.jpg
Views:	867
Size:	69.0 KB
ID:	27308   Click image for larger version

Name:	Wikipedia_epub4.jpg
Views:	885
Size:	156.9 KB
ID:	27309   Click image for larger version

Name:	Wikipedia_epub5.jpg
Views:	896
Size:	132.4 KB
ID:	27310  
nrapallo is offline   Reply With Quote
Old 04-08-2009, 11:53 AM   #5
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by pruss
You could also reduce the compression accuracy of the jpeg files. I've just posted a new version of explode with a --jpeg-quality=xxx switch (untested). I noticed that the previous version always set the quality to 100 (!). If you set it to 85 or so, you might find a nice improvement, unless of course your further conversion tools themselves reduce the jpeg quality.
Actually, I was thinking of using instead (in place) .gif images with 256-colors (yields about 50% filesize savings) and will see if using a 16-color/grayscale .gif will reduce this any further. Worth a shot!

However, I will also try your new --jpeg-quality switch. "Whatever works best" is my motto...

Thanks again!
nrapallo is offline   Reply With Quote
Old 04-08-2009, 12:00 PM   #6
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Using 4-bit .gif improved the filesize of the final ebook a lot. The .imp now is 50 MB, .epub now 43 MB and .prc (stnd) is 44 MB & .prc (high compression) is 37 MB. There was very little image quality loss reducing from 16M colors to 16 colors.

Still too big to be useful.

If I exclude images, then the resulting filesizes are: 23 MB .imp, 14 MB .epub and 16 MB .prc (stnd).
much more useable...

I'll experiment a bit more and see if using the Wikipedia 2006 CD source directly instead of the plucker extracted .html/images works just as good or better.

Stay tuned...

Last edited by nrapallo; 04-08-2009 at 01:44 PM. Reason: typo
nrapallo is offline   Reply With Quote
Old 04-09-2009, 09:20 AM   #7
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by nrapallo View Post
Using 4-bit .gif improved the filesize of the final ebook a lot. The .imp now is 50 MB, .epub now 43 MB and .prc (stnd) is 44 MB & .prc (high compression) is 37 MB. There was very little image quality loss reducing from 16M colors to 16 colors.

Still too big to be useful.
WOW!! WOW!! WOW!! I just tried that 50 MB Wikipedia 2006 .imp file on my REB1200 ebook reader and it works flawlessly!!!! All the links, images and text (though needs to be better formatted) were all there and useable.

I had to use Impserve to load it and that took several minutes, but WOW!! WOW!! WOW!!

I'm a bit excited...

p.s. doesn't work on my EBW1150, though!
nrapallo is offline   Reply With Quote
Old 04-10-2009, 11:04 AM   #8
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by pruss View Post
I've finally updated the explode utility for compatibility with newer versions of the Plucker format, and made it work a bit smarter. Here are Windows binaries, and source code:
http://www.1src.com/freeware/fileinfo.php?id=1916
Amazing, this software has already been downloaded over 5,000 times!!! Check it out.

I guess it sure filled a void!
nrapallo is offline   Reply With Quote
Old 08-15-2009, 12:34 PM   #9
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by nrapallo View Post
Using 4-bit .gif improved the filesize of the final ebook a lot. The .imp now is 50 MB, .epub now 43 MB and .prc (stnd) is 44 MB & .prc (high compression) is 37 MB. There was very little image quality loss reducing from 16M colors to 16 colors.

Still too big to be useful.

If I exclude images, then the resulting filesizes are: 23 MB .imp, 14 MB .epub and 16 MB .prc (stnd).
much more useable...

I'll experiment a bit more and see if using the Wikipedia 2006 CD source directly instead of the plucker extracted .html/images works just as good or better.

Stay tuned...
Some HUGE .prc, .epub and REB1200 .imp ebooks made from that 2006 Wikipedia CD Selection from the SOS Children website.

If I exclude images, then the resulting ebook filesizes are: 22 MB REB1200 .imp, 12 MB .epub and 9 MB .prc (high-compression) and may be much more useful. A 15 MB .prc (standard compression) is also available for those readers that cannot handle high (Huff-Dic) compression.

Just a test... ...Try this for more previews of my 2006 Wikipedia work-in-progress ebooks ...

Last edited by nrapallo; 08-20-2009 at 04:33 PM.
nrapallo is offline   Reply With Quote
Old 08-17-2009, 04:17 AM   #10
Blue Tyson
Blue Captain
Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.Blue Tyson ought to be getting tired of karma fortunes by now.
 
Blue Tyson's Avatar
 
Posts: 1,595
Karma: 5000236
Join Date: Feb 2007
Location: Australia
Device: Kindle Keyboard 3G,Huawei Ideos X3,Kobo Mini
Quote:
Originally Posted by pruss View Post
I've finally updated the explode utility for compatibility with newer versions of the Plucker format, and made it work a bit smarter. Here are Windows binaries, and source code:
http://www.1src.com/freeware/fileinfo.php?id=1916

To run, do:
explode --directory=outdir filename.pdb

Then your home page is outdir\default.html

You can adjust the jpeg compression with --jpeg-quality=x (where x ranges from 0 to 100).

No direct epub support yet. I might add it one day. But I don't have an epub capable device, so my motivation is low.

This is a kind of sad thing for me. I've been involved in the Plucker project for five or six years, and now it's looking like the end of the format is in sight--it's time to release converters to other formats so people can migrate their converters. :-( Oh well. I still do all my ebook reading on my TX, and 95% of that with Plucker.
I use Plucker all the time, so thanks very much. This is a great idea.
Blue Tyson is offline   Reply With Quote
Old 08-20-2009, 04:40 PM   #11
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
NEW 2006 Wikipedia CD Selection ebook (Letter A entries only)

EDIT: I've stopped hijacking this thread and now have a new thread called
Creating HUGE ebooks from the 2006 Wikipedia CD Selection to continue any 2006 Wikipedia posts.


Please only discuss here any issues you may have with explode.c issues...

Last edited by nrapallo; 08-20-2009 at 11:05 PM. Reason: see new thread for any 2006 Wikipedia discusssions...
nrapallo is offline   Reply With Quote
Old 02-23-2013, 10:55 PM   #12
pruss
Evangelist
pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.pruss ought to be getting tired of karma fortunes by now.
 
Posts: 461
Karma: 819417
Join Date: Nov 2004
I've made some more updates to explode, and it's now available here.
pruss is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Plucker: can't right-click on html file to convert... jplowman Reading and Management 1 08-08-2009 11:21 PM
Plucker Fails to convert HTML docs via Word evwool Reading and Management 8 05-10-2009 01:23 PM
9/11 Commission Report in Plucker, iSilo and HTML format hacker Deals and Resources (No Self-Promotion or Affiliate Links) 3 05-04-2009 04:04 AM
Explode and Implode an ePub? wallcraft ePub 5 09-12-2008 09:47 AM
Books in HTML/Plucker format? AceHarddrive Reading Recommendations 4 12-16-2006 05:21 PM


All times are GMT -4. The time now is 07:32 AM.


MobileRead.com is a privately owned, operated and funded community.