I was searching for how to merge html files and found some very useful info on this forum. I thought I would share my experiences on making this all work.
I have a kindle, so it is annoying to transfer multiple html chapters as the filing system is cumbersome.
First things first - DownThemAll is very handy to download all chapters on a website using firefox.
I first tried using the vhtmlmerger software to merge. This worked, but had encoding problems - I think it saves the merged file with different encoding, so the apostrophes etc. come out as symbols, which is irritating when reading.
I then used TXTCollector - this did not have the encoding issues. However, it introduces some bugs into the code, which prevented either Calibre or the Kindle document service from converting to .mobi or .azw (respectively).
So I copied and pasted the merged htm file into Word and then deleted the buggy bits. I then re-saved as htm. I was then able to convert to .mobi fine using Calibre.
Took me a while to get it right, but I now have a very simple process:
1) DownThemAll
2) TXTCollector
3) Delete buggy bits in word and re-save
4) Calibre
|