|12-01-2009, 04:10 PM||#1|
Join Date: Nov 2009
HTML with external links
I hope, I've choosen the right subforum.
I am not really familiar with all these ebooks standards...
I would like to make a little python script which downloads from the given wikipedia article all mentioned and linked wikipedia-entries for lets say 1 or 2 recursion depth.
My output would be the following some html files.
How can I convert them to e.g. LRF, so I can click on a link in the LRF to get the related article in another LRF-file?
|02-03-2010, 04:48 PM||#2|
Join Date: Jun 2009
Device: prs700, i-mate JAMin, smartq v7, GeeksPhone Zero, iPad 3rd Gen
calibre http://calibre-ebook.com/ relies in python, i think, and already has web2disk and ebook-convert cmd line utilities that should do what you want.
The bad thing is that I tried that with mixed results, blame on wikipedia layout not calibre (I would trashcan my prs700 without that marvelous software). To be fair with wikipedia, I think that recursive downloading of articles it is not recommended in the TOS or something similar. And I can understand that, overloading of the servers and things like that.
If plucker format is fine for you, you can try plucker or sunrisexp, this two work very well, I was able to read the whole Solar System (60-70MB) article in my ppc.
|02-07-2010, 08:27 AM||#3|
Join Date: Nov 2006
Device: Kindle Voyage, iPad Mini, iPhone 4, MS Surface Pro, N7
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|HTML of all the Baen Webscription eBooks with links to all the formats||mjdb||Workshop||4||12-10-2014 02:27 PM|
|Will Calibre maintain the links when it converts HTML?||ficbot||Calibre||3||11-18-2010 11:27 PM|
|html to zip without following links||dracore||Calibre||1||09-08-2010 07:10 PM|
|Adding external links to audio for iBooks||nihonjon||ePub||2||07-23-2010 05:29 AM|
|Multiple html issue - too many links and zip isn't created in calibre||Katelyn||Calibre||1||03-10-2009 02:31 PM|