Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > Workshop

Notices

Reply
 
Thread Tools Search this Thread
Old 10-28-2013, 10:32 PM   #1
slammerkin
Star-gazer
slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.slammerkin 's shirt has a full set of merit badges.
 
Posts: 20
Karma: 16700
Join Date: Jun 2012
Location: Tír na nÓg
Device: Samsung Galaxy Tab3 (Retired Pandigital)
Save/Convert entire website for epub

Is there a way to save an entire website, links and all, and then convert it to an epub?

Is this even possible?

To clarify: I'm off grid a lot, and in the middle of a book series. It has literally 100's of characters, regions, etc.- and I found a Wiki portal dedicated to it. B/c it's a portal, and not a regular wiki page, the wiki export at epub option isn't available.

I would like to save the whole portal- for example, there's a character list page, and each name links to it's own page in the portal- so I'd like to save it, and keep the links working.

If this wasn't such a huge page, I'd just manually open each one and save, but it would take days. I could save the website with winhtTracker to my laptop, but I'm not sure what I would need to do once I have the website downloaded, to then send it to Calibre to convert. Is it even possible to do that? Without killing the links? (I'm referring to links that lead to other pages on the same website, not outside links).

Any information or advice would be hugely appreciated, but I know nothing about code, CSS sheets, or anything like that, so please bear in mind my technical ineptitude.

Thanks so much

Last edited by slammerkin; 10-28-2013 at 10:50 PM.
slammerkin is offline   Reply With Quote
Old 10-29-2013, 02:07 AM   #2
Tex2002ans
Wizard
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
Quote:
Originally Posted by slammerkin View Post
Is there a way to save an entire website, links and all, and then convert it to an epub?
You would need a web crawler program to pull all the HTML files. This is the one that I use:

http://www.httrack.com/

While it is pulling all of the files for you, it also renames all of the internal links to point to their local equivalent.

Quote:
Originally Posted by slammerkin View Post
[...] to then send it to Calibre to convert. Is it even possible to do that? Without killing the links? (I'm referring to links that lead to other pages on the same website, not outside links).
... I haven't had good luck at all with feeding an entire site-rip through Calibre. I would recommend feeding the XHTML files into Sigil, and doing some Regex to fix up a few the links/do other cleanup.

Depending on the difficulty of the site itself, this can be easy/fast, or it can be time-consuming. As long as you aren't aiming for 100% EPUB compliant (epubcheck will probably complain about a lot of stuff), you should be ok.

Quote:
Originally Posted by slammerkin View Post
Any information or advice would be hugely appreciated, but I know nothing about code, CSS sheets, or anything like that, so please bear in mind my technical ineptitude.
Give a link to the Wiki. Perhaps someone here can be kind enough to help.
Tex2002ans is offline   Reply With Quote
Advert
Old 10-29-2013, 07:35 AM   #3
mrmikel
Color me gone
mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.mrmikel ought to be getting tired of karma fortunes by now.
 
Posts: 2,089
Karma: 1445295
Join Date: Apr 2008
Location: Central Oregon Coast
Device: PRS-300
Unless you have to have it portable, it might be best to leave as a stored web site on your computer.

Httrack will bring it all down in a directory on your computer, theoretically. You will want to inspect that downloaded directory closely to make sure it got all of it. I have found it does not always follow everything and can leave out some images.

There are some things like target windows, or even jumps back to the internet which it will point to that might not work for you as an epub, especially if you are off the grid. You might want to download it, turn off your internet connection and play with the wiki to see what bombs.

You can use Sigil to import all the files and see how well it works. You can also search for www references which won't work off the grid.

As Tex pretty much said, it might be easy, but I wouldn't count on it, particularly if the authors get clever and give you hyperlinks to everything. Great if you are on the net, a great annoyance if you are not.
mrmikel is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Convert website with txt files to epub malc_b ePub 3 08-08-2013 09:20 AM
Convert website to epub with Firefox amibz General Discussions 0 05-31-2013 11:07 AM
Is it possible to import an entire website? nbtrap Library Management 1 01-10-2012 10:07 PM
Converting entire website to ePub... sharp21 Conversion 4 05-31-2011 12:00 PM
pulling an entire website into Calibre and generate an epub file using news function? N13L5 Calibre 2 10-06-2010 09:00 PM


All times are GMT -4. The time now is 05:19 PM.


MobileRead.com is a privately owned, operated and funded community.