07-08-2011, 05:01 AM | #1 |
.~^пиратка^~.
Posts: 238
Karma: 14000
Join Date: Sep 2009
Location: Ask NSA...
Device: Onyx Boox M92
|
400 HTML files with images; how do I make a book?
I extracted the HTML from a CHM file. So far, so good.
Now I want to turn the 400 individual HTML pages that I've got in a folder, into an open source ebook format, like epub, alternatively just a pdf. My reader is an IREX 800. Really grateful for advice! |
07-08-2011, 11:41 AM | #2 |
Evangelist
Posts: 450
Karma: 343115
Join Date: Nov 2009
Location: Romania
Device: PW2 2014
|
If it's HTML you're halfway there. You can use Sigil to turn it into an ePub.
|
Advert | |
|
07-08-2011, 01:48 PM | #3 |
Wizard
Posts: 3,388
Karma: 14190103
Join Date: Jun 2009
Location: Berlin
Device: Cybook, iRex, PB, Onyx
|
Martienne, have you tried Mobipocket Creator? It can convert a chm-file into a mobi-file (which you could further convert into epub).
|
07-08-2011, 04:31 PM | #4 |
Connoisseur
Posts: 61
Karma: 12096
Join Date: Sep 2010
Location: Tasmania
Device: Sony PRS 650
|
Try Calibre which lets you input a CHM file and outputs in a format of your choosing. Suggest an epub format then go into Sigil to edit it as required.
|
07-09-2011, 04:34 AM | #5 |
.~^пиратка^~.
Posts: 238
Karma: 14000
Join Date: Sep 2009
Location: Ask NSA...
Device: Onyx Boox M92
|
Thanks for the tips about Sigill, I'll check that out!
While trying to convert directly from CHM to PDF or DOC I had some problems with crashes, freezes etc, or the the program was only converting the FIRST chm page, or only chapter by chapter. That's why I decided to do it in two steps. Just couldn't find anything that did a conversion straight from CHM and didn't quit in the middle. Although I am doing this on my laptop, it's very good spec and I was really surprised it cut out. I had an older version of Calibre though, and now I have upgraded, so maybe it would work better. Anyway I found a way to convert to Word, and did some editing there (took out the HTML style navigation bars.) Then I saved as PDF and all seems well. Last edited by martienne; 07-09-2011 at 04:42 AM. |
Advert | |
|
07-09-2011, 02:15 PM | #6 |
Connoisseur
Posts: 61
Karma: 12096
Join Date: Sep 2010
Location: Tasmania
Device: Sony PRS 650
|
"Anyway I found a way to convert to Word..."
Could you tell us what that was? Did you find something that would batch merge HTML files into one Word doc? |
10-16-2011, 11:39 AM | #7 |
Zealot
Posts: 133
Karma: 2142
Join Date: Oct 2011
Location: Spain
Device: I'm an iRex man: 8x DR1000S, 4x DR800SG, 4x DR800S
|
HTML concatenator (command-line)
I've just run into the same problem and I've coded a simple command line HTML concatenator, so you simply issue
HTMLCat book.htm part1.htm part2.htm part3.htm ... at a command prompt and it merges part1, part2, part3, etc. in the specified order into book.htm. Alternatively, if all your filenames are in alphabetical order you can simplify and do things like HTMLCat book.htm cover.htm chapter*.htm The thing is still a bit crude (it simply keeps the HTML <head> of the first file and concatenates all of the <body> contents of every file after that, so all files must use the same encoding, no ID attributes are checked for duplicity, etc.) but otherwise fully functional. Anyone interested? It's writen in REXX, so it can be run on Windows, Linux and pretty much everything else right away with Regina REXX Last edited by MrWarper; 10-16-2011 at 11:43 AM. Reason: add title 0:) |
10-19-2011, 12:51 PM | #8 |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
There are already such tools. Check our wiki under HTML.
Dale |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Any way to make images exact size using HTML? | khutch | Calibre | 4 | 10-03-2010 11:59 AM |
condense multible .mobi files to make one readable book on Kindle 2 | 18Coaster | Kindle Formats | 3 | 09-10-2010 06:48 PM |
Extracting html/images from within .imp files! | nrapallo | IMP | 12 | 03-10-2009 10:22 PM |
Html files with images, is it possible? | megaman | Bookeen | 4 | 10-27-2008 06:28 AM |
Problems with Book Designer and some HTML-Files | SteffenH | Sony Reader | 19 | 09-17-2007 06:39 PM |