07-06-2009, 06:39 PM | #1 |
Connoisseur
Posts: 57
Karma: 30
Join Date: Jul 2009
Location: Netherlands
Device: PW2
|
multi-page HTML with images to ePub or LRF
I'm trying to convert a multi-page html book (http://www.hq.nasa.gov/office/pao/Hi.../contents.html) to something I can read on my PRS-700. I've tried copying and pasting the text into an RTF file, and then using Calibre to convert to LRF or ePUB. This works, however, the images dissapear. The same thing happens when I just toss the RTF file on my reader. When I open the RTF file with MS Word (2007), the images are there and visible.
Any tips? |
07-06-2009, 07:01 PM | #2 |
Sir Penguin of Edinburgh
Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
I don't think RTF support on the Sony Reader includes images. The best quick way to do it is copy it into calibre and convert to either LRF or Epub.
|
Advert | |
|
07-06-2009, 07:11 PM | #3 |
Connoisseur
Posts: 57
Karma: 30
Join Date: Jul 2009
Location: Netherlands
Device: PW2
|
The problem is, when I import the 'contents.html' into calibre, it thinks that that file is everything, obviously not what I want. When I import the RTF that I made with MS Word (with the pictures) and then convert to LRF or epub it converts but again misses the images.
|
07-06-2009, 07:27 PM | #4 |
Sir Penguin of Edinburgh
Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Well shoot. Is it possible to open the RTF in MSWord? You could save it as DOC, and then use calibre to convert that (I think).
Thanks for pointing this out. I spidered it, and I will place it on top of my TBC pile. If I get a chance, I'll throw up a Q&D conversion tonight. |
07-06-2009, 07:33 PM | #5 | |
Connoisseur
Posts: 57
Karma: 30
Join Date: Jul 2009
Location: Netherlands
Device: PW2
|
Quote:
I'd be VERY happy with a Q&D conversion (especially if you tell me how you did it). I don't care about non-working links to footnotes etc, if they are at the end of a chapter I can find 'm easily enough, the chapters are short anyway. I tried copying & pasting the text to the Atlantis editor and using it's epub export option. That does seem to work better, however I'll have to copy & paste the images one at a time. Selecting all of the html and pasting it in will not put the images in. Also, the right side of the images is cut off on the reader. At least it's progress |
|
Advert | |
|
07-06-2009, 07:44 PM | #6 |
reader
Posts: 6,975
Karma: 5183568
Join Date: Mar 2006
Location: Mississippi, USA
Device: Kindle 3, Kobo Glo HD
|
Try importing the RTF into OpenOffice and exporting it as an ODT file. This should be readable (with images I think) by Calibre. Anoother possibility is save as "web page filtered" from Word.
|
07-06-2009, 07:55 PM | #7 |
Connoisseur
Posts: 57
Karma: 30
Join Date: Jul 2009
Location: Netherlands
Device: PW2
|
Fixing the links to point to local pages (wget -k) did the trick. Calibre correctly read in all the html files and made a decent LRF out of it. Only problem I have is that for some reason it put some chapters in front of of others when they should not be. Not sure what's going on with that, I opened the 'contents.html', which has all the chapters/pages linked, in the proper order.
|
07-06-2009, 09:29 PM | #8 |
Sir Penguin of Edinburgh
Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Here is the Q&D edition in Epub and Mobipocket.
I haven't done anything to the formatting, and I make no claims about the quality becuase the original html is horrible. But I will say that the links _should_ work correctly, the files _should_ be in the correct order, and all the important images _should_ have been included. Enjoy. EDIT: Having looked at the ebooks I must say that they're a lot better than I expected. SECOND EDIT: I moved the files to the book upload section so others can find them. Epub: https://www.mobileread.com/forums/showthread.php?t=50384 Mobi: https://www.mobileread.com/forums/showthread.php?t=50385 Last edited by Nate the great; 07-06-2009 at 10:24 PM. |
07-06-2009, 09:44 PM | #9 |
Connoisseur
Posts: 57
Karma: 30
Join Date: Jul 2009
Location: Netherlands
Device: PW2
|
Awesome!
If you could tell me how you did it I can do it myself next time around |
07-06-2009, 10:02 PM | #10 |
Sir Penguin of Edinburgh
Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
1. Downloaded the set of pages with WinHTTrack.
2. Started a new ebook project in Mobipocket Creator, and carefully added the files a few at a time to make sure they were in the correct order. 3. Failed to build the ebook several times so I could identify and delete the bad files created in the download step. (Don't worry, they were created by the download program and weren't source content.) 4. Built the Mobipocket ebook. Saved the ebook project. 5.Used html2epub.exe with the ebook project files to make the Epub version. Total time invested: about an hour Last edited by Nate the great; 07-06-2009 at 10:05 PM. |
07-07-2009, 12:54 PM | #11 | ||||
GuteBook/Mobi2IMP Creator
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
This is absolutely the RIGHT tool for building ebooks from webpages; much easier when the webpages stay on the same domain and go "downwards" from there. Did you realize there was a "cover.html" that would have been the best place to start the spidering instead of the "contents.html"? I spidered it last night and it took all of 6 minutes. The ensuing ebook conversion to .imp took several hours more (see below).
Quote:
Quote:
Quote:
Quote:
Uploading the .imp formats, which differ slightly from your (.prc) version. Check here. I can upload my .prc/.epub versions if you would like as well? Last edited by nrapallo; 07-07-2009 at 01:55 PM. Reason: added link to .imp versions |
||||
07-07-2009, 03:14 PM | #12 | |
Sir Penguin of Edinburgh
Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Quote:
I wish I'd known about the cover but it's okay. I like the one I made. |
|
07-07-2009, 08:26 PM | #13 | |
GuteBook/Mobi2IMP Creator
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
Quote:
The missing image was m493b.gif and is attached. There were two corrupt images that I could fix (attached as well), the others were corrupt from when the website was originally set up, as far as I can tell. BTW, here's a snapshot of the cover page I used (basically their cover.html). Last edited by nrapallo; 07-07-2009 at 08:39 PM. |
|
07-07-2009, 09:18 PM | #14 |
Sir Penguin of Edinburgh
Posts: 12,375
Karma: 23555235
Join Date: Apr 2007
Location: DC Metro area
Device: Shake a stick plus 1
|
Thank you for the images.
BTW, I sent the 2 files with all the link errors to the contact email listed. I also sent a list of the errors I found, and mentioned that I was making an ebook. This afternoon I received a response. The History Division at NASA is planning to convert all of their documents to ebooks. They wanted to know about the tools I use and my work process. I wrote a fairly lengthy email. And yes, I did direct them here. |
07-07-2009, 10:20 PM | #15 | |
GuteBook/Mobi2IMP Creator
Posts: 2,958
Karma: 2530691
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
|
Quote:
|
|
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
LRFTools. Convert LRF to EPUB, HTML, PDF and RTF | elinares | LRF | 279 | 07-30-2011 11:48 PM |
Problem with html->epub: reader can't page through file | horseflesh | Calibre | 5 | 10-20-2009 12:22 AM |
converting multi-page HTML to Mobipocket | shinew | Calibre | 13 | 02-21-2009 01:33 PM |
HTML to image and CHM to images and CHM to LRF | caritas | LRF | 0 | 12-14-2008 07:58 AM |
Problem converting a webpage html to LRF, what program should I use? Long page turns | seajewel | Workshop | 1 | 08-01-2008 06:32 AM |