![]() |
#1 |
Member
![]() Posts: 12
Karma: 10
Join Date: Sep 2010
Device: Kindle
|
web page image incorrecly appears at top of conversion
I'm trying to convert multiple nested pages from a web site into an ebook. It's a scientific site, and the pages are image-rich with gifs. I open a page, e.g., http://www.chemguide.co.uk/analysis/...ation.html#top , in FireFox, save it to my desktop (and the save from FF saves the images locally in a matching folder), and drag the saved html into Calibre. The file appears in Calibre as a zip, and I convert it to mobi without making any changes to the metadata. When complete and I view the converted file, and regardless of what page from this site I'm working on, one of the image from the page appears at position 1.0 where the content should actually begin, above the page heading where the book should actually start. That image also still shows correctly where it's supposed to as well. It's almost as if the image is being inserted at the beginning as a book cover, although to the right it shows the generic book image.
I don't know if this is related, but when I quit calibre I get this error message: IOError: [Errno 2] No such file or directory: '/var/folders/3g/3g++kTeeHJmwGtYBJz9CQk+++TI/-Tmp-/calibre_0.7.20_tmp_gFSqaR/ipc_result_1_7_q_9c8r.pickle' ERROR: ERROR: Unhandled exception: <b>IOError</b>:[Errno 2] No such file or directory: '/var/folders/3g/3g++kTeeHJmwGtYBJz9CQk+++TI/-Tmp-/calibre_0.7.20_tmp_gFSqaR/ipc_result_1_7_q_9c8r.pickle' Traceback (most recent call last): File "/Applications/calibre.app/Contents/Resources/Python/lib/python2.6/site.py", line 147, in main return run_entry_point() File "/Applications/calibre.app/Contents/Resources/Python/lib/python2.6/site.py", line 116, in run_entry_point return getattr(pmod, func)() File "site-packages/calibre/utils/ipc/worker.py", line 101, in main I'm using a PPC Apple iBook running Mac OS 10.5.8, FF 3.6.10, and Calibre 0.7.20. Any help that can offered would be appreciated. |
![]() |
![]() |
![]() |
#2 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
That's happening most likely because you're not also using the original site's css. Without the original css an html page will often look nothing like the original formatting on the web.
Anyway just saving the html is not generally the best way to convert a site to an ebook - you should read up on creating recipes, and then create a recipe for that site, only extracting the data that's pertinent to the ebook from the site and leaving the rest of the interactive cruft out. |
![]() |
![]() |
![]() |
#3 |
Member
![]() Posts: 12
Karma: 10
Join Date: Sep 2010
Device: Kindle
|
CSS
The site is actually very clean, no ads, doesn't even look bad if I put it in pdf except pdfs on Kindle can't take advantage of most of its features, hence my wanting to convert to mobi. It's a static site, the content does not change, so creating an ebook of it would be a one time thing, so while I initially went into it thinking recipes were the way to go (I originally posted in recipes), it's since become apparent to me that's not the best approach here. In the recipe forum it was determined the stray image problem was because MOBI doesn't support floating images, so calibre puts them where they appear in the source document markup. I don't know what that means or how to address it.
If you can steer me to any site where I might be able to learn the needed skills it would be appreciated. |
![]() |
![]() |
![]() |
#4 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
Quote:
|
|
![]() |
![]() |
![]() |
#5 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
I know this is considered by most to be a no-go, but did you try just taking the PDF file you said you have and just converting that?
|
![]() |
![]() |
![]() |
#6 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
There don't appear to be any floating images in the linked article, so I'm wondering if there is actually a bug in Calibre somewhere.
Anyway, here is a foolproof, but manual way to do this:
You'll get output that looks like what I've attached. The recipe framework can be used to do this all of this manual work automatically as I mentioned before. It's definitely a bit more work than a standard recipe because there isn't any rss feed, but it's doable - you could also create your own list of articles to use as the 'feed'. |
![]() |
![]() |
![]() |
#7 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
|
I tested his page by just saving the page/html out of FireFox, dragging the index.html into Calibre and viewing. That worked correctly. I assume the EPUB conversion would have been fine, too. His problem was the conversion to mobi. I tried that conversion and confirmed the problem. Kovid posted that floating images weren't supported in the mobi conversion. I didn't go any further, but if there aren't any floating images, maybe it's a bug in the mobi conversion code?
|
![]() |
![]() |
![]() |
#8 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,190
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
That site uses tables for layout. Use the linearize tables option. Table support in MOBI is terrible.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
[Old Thread] epub -> mobi conversion; blank page after image | joubert | Calibre | 10 | 03-07-2011 05:26 PM |
Amazon web page appears strange with Chrome? | soondai | Amazon Kindle | 3 | 08-30-2010 07:21 AM |
Scene breaks at page top/bottom | radius | Workshop | 20 | 12-15-2009 06:59 PM |
epub conversion - cover image | Nate the great | Calibre | 15 | 09-14-2009 05:15 PM |
Remove first image in file during conversion? | itimpi | Calibre | 3 | 02-08-2009 12:57 AM |