03-28-2010, 01:24 PM | #31 |
Junior Member
Posts: 5
Karma: 10
Join Date: Mar 2010
Device: Kindle
|
I'm reviving this old thread, because I have the same need to merge/join many HTML files of a non-DRM protected e-Book into one readable format, such as RTF.
I first downloaded the prog "Merger", but it too crashed on me each time I ran it (as I read it had happened in another thread on MobileRead). Neither Txtcollector nor BookDesigner, which incidentally has a very unuser-friendly interface, helped me out. Finally, I decided to follow the detailed instructions left by Calibre's creator, Kovidgoyal. Not being very computer-savvy, it took me a while to figure out his no-doubt simple instructions. I even made a TOC, without needing to, as I have an html start page which presumably acts as one. It points to the other pages. I uploaded the "start_here" unto the GUI and saved the zip file with the OPF unto my desktop. I used Mobipockets to open the OPF, but it just doesn't do anything. Nothing loads. I'm stumped. Can anyone help me out? Thanks so much for any help. P.S.: I have since downloaded iterati's VHtmlMerger, and though it was the easiest of all the progs, I still don't have a good end result. The output file has no format, so I tried naming it "One.html", also .txt, .rtf, etc. It simply has code, not the e-Book contents. Please note that I also have the files to be joined in PDF, as well as HTML, in case that sparks any ideas. -- WinXP S3 Calibre 0.6 45 MobiPocket Creator Publisher 4.2 B41 EDIT: Solution BELOW. https://www.mobileread.com/forums/sho...2&postcount=38 Last edited by Bookeee; 03-29-2010 at 04:53 PM. |
03-28-2010, 04:46 PM | #32 | |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
Dale |
|
Advert | |
|
03-28-2010, 07:33 PM | #33 |
Junior Member
Posts: 5
Karma: 10
Join Date: Mar 2010
Device: Kindle
|
Thanks for the attempted cure, Dale. That was one of the first things I tried to do, before posting -- I replicated it just now, but no luck.
WARNING: Could not convert some books: Could not convert 1 of 1 books, because no suitable source format was found. |
03-28-2010, 08:44 PM | #34 |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Were the source files in html format?
|
03-28-2010, 11:25 PM | #35 |
Junior Member
Posts: 5
Karma: 10
Join Date: Mar 2010
Device: Kindle
|
|
Advert | |
|
03-29-2010, 11:12 AM | #36 | |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
Dale |
|
03-29-2010, 04:02 PM | #37 | |
Junior Member
Posts: 5
Karma: 10
Join Date: Mar 2010
Device: Kindle
|
Quote:
It created a 37MB file in zip format, as suggested, but when converted to any other format, only the frontispiece (in writing) showed up. The subsequent pages didn't load. So I'm holding out on someone giving me info/a fix using something other than Calibre. TIA. |
|
03-29-2010, 04:49 PM | #38 |
Junior Member
Posts: 5
Karma: 10
Join Date: Mar 2010
Device: Kindle
|
SUCCESS!
I leave instructions here, for the use of others. 1- Use Adobe Acrobat Pro (mine is 9.3.1). 2- Go to "File" 3- Scroll to "Create PDF" 4- Tab to "Merge Files into a Single PDF" 5- My default settings on that pop-up have "Single PDF" bubbled on top, and medium-sized output (bottom right) 6- Click on "+AddFiles". Again below that 7- Click around until you see folder with the PDF files you want merged 8- NB: If you have HTML files, you will have to order them up/down numerically, as they have that unfortunate tendency to go by all 1s/2s/3s. Those with iPods will understand. For best results, use other PDFs 9- Click on (bottom right) "Combine Files" when ready 10- Acrobat will check security settings, then combine/merge the PDFs into one 11- Default name is "Binder1", but can be renamed 12- Perfect PDF copy of (non-DRMed) ebook results -- even Next and Previous Pages are gone! (I then used Mobipocket Creator Publisher Edition for a PRC/Kindle formatted ebook. Flawless) Good luck, and thanks again to Dale for the help. Last edited by Bookeee; 03-29-2010 at 04:55 PM. |
05-03-2010, 09:21 AM | #39 |
Enthusiast
Posts: 23
Karma: 66956
Join Date: Feb 2010
Location: Conn. USA
Device: Kindle 3, Kindle PW
|
Quick way to create index.html for multiple files
|
08-23-2010, 05:04 PM | #40 |
ZCD BombShel
Posts: 4,793
Karma: 8293322
Join Date: Jan 2009
Location: The Frozen North (aka Illinois, USA)
Device: iPad, STB Kindle Oasis
|
I'm resurrecting this again - and if this is a really stupid question, then forgive me. Is there a way to do something similar with XHTML files?
|
08-24-2010, 12:50 PM | #41 | |
Grand Sorcerer
Posts: 11,470
Karma: 13095790
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2 & Air 2, iPhone 7
|
Quote:
Dale |
|
10-07-2011, 04:53 AM | #42 | |
Addict
Posts: 352
Karma: 103850
Join Date: Apr 2011
Device: Kindle NT
|
Quote:
|
|
10-16-2011, 11:43 AM | #43 |
Zealot
Posts: 133
Karma: 2142
Join Date: Oct 2011
Location: Spain
Device: I'm an iRex man: 8x DR1000S, 4x DR800SG, 4x DR800S
|
I've just run into the same problem and I've coded a simple command line HTML concatenator, so you simply issue
HTMLCat book.htm part1.htm part2.htm part3.htm ... at a command prompt and it merges part1, part2, part3, etc. in the specified order into book.htm. Alternatively, if all your filenames are in alphabetical order you can simplify and do things like HTMLCat book.htm cover.htm chapter*.htm The thing is still a bit crude (it simply keeps the HTML <head> of the first file and concatenates all of the <body> contents of every file after that, so all files must use the same encoding, no ID attributes are checked for duplicity, etc.) but otherwise fully functional. Anyone interested? It's writen in REXX, so it can be run on Windows, Linux and pretty much everything else right away with Regina REXX |
10-16-2011, 07:08 PM | #44 |
Fool
Posts: 380
Karma: 3557934
Join Date: Feb 2003
Device: Kindle Voyage, Kindle PW1, Kobo Glo HD, Nook Glowlight Plus ...
|
Another great tool for just merging html files (windows only) is vHtmlMerger.
Small, simple, self-explanatory, free. I've used it on appropriate occasion for years. http://iterati.org/ebookTools/vHtmlMerger/Default.aspx |
01-01-2012, 01:43 PM | #45 |
Junior Member
Posts: 1
Karma: 10
Join Date: Jan 2012
Device: Kindle
|
I was searching for how to merge html files and found some very useful info on this forum. I thought I would share my experiences on making this all work.
I have a kindle, so it is annoying to transfer multiple html chapters as the filing system is cumbersome. First things first - DownThemAll is very handy to download all chapters on a website using firefox. I first tried using the vhtmlmerger software to merge. This worked, but had encoding problems - I think it saves the merged file with different encoding, so the apostrophes etc. come out as symbols, which is irritating when reading. I then used TXTCollector - this did not have the encoding issues. However, it introduces some bugs into the code, which prevented either Calibre or the Kindle document service from converting to .mobi or .azw (respectively). So I copied and pasted the merged htm file into Word and then deleted the buggy bits. I then re-saved as htm. I was then able to convert to .mobi fine using Calibre. Took me a while to get it right, but I now have a very simple process: 1) DownThemAll 2) TXTCollector 3) Delete buggy bits in word and re-save 4) Calibre |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
[Old Thread] Joining multiple html files | RosanaE | Calibre | 4 | 04-22-2011 06:56 PM |
TOC filter and Multiple HTML Files | Beedrew | Calibre | 1 | 07-20-2010 10:32 PM |
Converting multiple HTML files into a single hyperlinked PDF? | Jürgen Hubert | Reading and Management | 6 | 01-11-2010 07:44 AM |
Merging several Html files into one file | nesseainie | Calibre | 8 | 06-03-2009 02:06 PM |
Multiple HTML Files | JJH1947 | Calibre | 4 | 04-07-2009 10:24 AM |