12-01-2009, 09:34 AM | #1 |
Enthusiast
Posts: 34
Karma: 548
Join Date: Jul 2009
Device: iRex 1000S
|
Converting multiple HTML files into a single hyperlinked PDF?
Last year, I created an ebook version of the TV Tropes Wiki for the Kindle with the help of the MobiPocket Creator Professional by using the downloaded HTML files of the wiki as the base and loading them into the Creator (after using scripts to cut them in size).
This worked reasonably well, but the nested structure of individual entries wasn't really ideal for reading on my Kindle. So now that I've got a iRex DR1000S, I want to use a different format instead - PDF. Is there any program or script out there that takes multiple HTML files as input and turns them into a single PDF file? |
12-01-2009, 09:37 AM | #2 |
Wizard
Posts: 1,244
Karma: 3439432
Join Date: Feb 2008
Device: Amazon Kindle Paperwhite (300ppi), Samsung Galaxy Book 12
|
Adobe Acrobat has a feature for doing this --- the European Union even (used to?) uses it to make their style manual available as a .pdf
William |
Advert | |
|
12-01-2009, 09:41 AM | #3 | |
Enthusiast
Posts: 34
Karma: 548
Join Date: Jul 2009
Device: iRex 1000S
|
Quote:
Free software which also does this would be highly appreciated, although if nothing else helps I'll have to get something commercial instead. (Oh, and it must be possible to upload the component HTML files as batches - that wiki consists of more than 10,000 individual pages, and I have no interest in loading them into the program one at a time...) |
|
12-01-2009, 11:48 PM | #4 |
Wizard
Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
I would use Prince XML for this personally. I think it would do all you want. It's free for non-commercial use. What makes it particularly attractive is that, with the right settings, you'll get some relatively advanced typographical features like kerning/hinting, TeX-style end of line hyphenation, ligatures, etc.
|
01-11-2010, 07:30 AM | #5 | |
Enthusiast
Posts: 34
Karma: 548
Join Date: Jul 2009
Device: iRex 1000S
|
Quote:
Unfortunately, the conversion process stopped when the program ran out of memory when I used the command line interface to create a PDF based on the TV Tropes Wiki. I guess 10,000+ files were a few too many... |
|
Advert | |
|
01-11-2010, 07:38 AM | #6 |
Warrior Princess
Posts: 5,038
Karma: 9724231
Join Date: Sep 2009
Device: PRS-505; PRS-350, PRS-T1, iPad, Aura HD
|
Jurgen, this is an awesome project. T.V. tropes is like crack to me. I hope it works out!
|
01-11-2010, 07:44 AM | #7 | |
Enthusiast
Posts: 34
Karma: 548
Join Date: Jul 2009
Device: iRex 1000S
|
Quote:
I have managed to do the same for smaller wikis, so I know the basic principle is sound. It's just that the memory on my current computer is apparently insufficient to allow Prince run uninterrupted until it finishes, and I don't have the money to upgrade my computer sufficiently at the moment... (Though if anyone is interested, I can upload the Perl script I cobbled together and explain how it should work in practice...) |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Merging multiple HTML files into one HTML file | skoobwoman | Workshop | 45 | 07-11-2014 10:46 AM |
[Old Thread] Joining multiple html files | RosanaE | Calibre | 4 | 04-22-2011 06:56 PM |
TOC filter and Multiple HTML Files | Beedrew | Calibre | 1 | 07-20-2010 10:32 PM |
multiple repeat error converting HTML to MOBI | moog | Calibre | 0 | 02-05-2010 01:03 PM |
Multiple HTML Files | JJH1947 | Calibre | 4 | 04-07-2009 10:24 AM |