04-29-2011, 02:29 PM | #1 |
Member
Posts: 14
Karma: 10
Join Date: Apr 2011
Device: windows pc
|
ePub to HTML links all broke
I am trying to use calibre to convert some ePubs to html. Calibre does the conversion but when I open the resulting html file all the links are kinda goofed (they are pointing to web addresses rather than the content on my hard drive). Also, no pictures appear. Calibre creates the images subdirectory and puts the pictures in there but their names do not match the codes in the html file.
Am I doing something wrong? The reason I am doing this is because, while I love Calibre's management and conversion capabilities I want to use Blio as my reader. I just love its interface. The rest of Blio is not so good (and their support sucks). It only works with .xps files, if it would read ePubs I wouldnt have this problem. So I am trying to convert ePubs to html which I can then open in MS word and save out as xps files for Blio. Thanks for any tips Last edited by perry59; 04-29-2011 at 02:30 PM. Reason: typo |
04-29-2011, 04:30 PM | #2 | |||
Sigil & calibre developer
Posts: 2,488
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
What format are you using for HTML output (ZIP or HTMLZ)?
Quote:
Quote:
Quote:
|
|||
Advert | |
|
05-02-2011, 03:25 PM | #3 |
Member
Posts: 14
Karma: 10
Join Date: Apr 2011
Device: windows pc
|
I didnt realize there was a zip output option. I output to htmlz then renamed it a zip file so I could access the contents. Yes the picture links were pointing to the website to (gutenberg) but the links are really weird with a lot of "@" in them. When I get home I'll compare the htmlz output to what I find by just renaming the epub and looking at its internal html. I would just use the content of the epubs but they typically break a book into many html files which would be a pain to stitch together.
Thanks |
05-02-2011, 03:40 PM | #4 |
Sigil & calibre developer
Posts: 2,488
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
@ is not a character conversion should be inserting in links. It sounds like there is an issue with the book you are converting.
|
05-02-2011, 05:42 PM | #5 |
Wizard
Posts: 1,613
Karma: 6718479
Join Date: Dec 2004
Location: Paradise (Key West, FL)
Device: Current:Surface Go & Kindle 3 - Retired: DellV8p, Clie UX50, ...
|
I've seen "@" symbols in image filenames in ePubs from PG many times. It seems that they convert their web server's full pathname to the image file into a filename and replace the illegal "/" symbols with ampersands. The result is a very long filename beginning with "www" and including 4-8 ampersands. One image in a book I recently reformated is named "www.gutenberg.org@dirs@3@2@1@0@32104@32104-h@images@image_f.jpg" in the ePub.
|
Advert | |
|
05-02-2011, 08:33 PM | #6 |
Sigil & calibre developer
Posts: 2,488
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
That's my point. An @ symbol will not be created by calibre as it rewrites links in a consistant manner. The only way for a link to contain an @ symbol is if the original link is an external link that includes it. HTMLZ's rewriting of links does not touch external links.
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Can Calibre Strip HTML links when exporting to epub? | Dasun | Calibre | 6 | 03-03-2020 02:47 AM |
Will Calibre maintain the links when it converts HTML? | ficbot | Calibre | 3 | 11-18-2010 10:27 PM |
html to zip without following links | dracore | Calibre | 1 | 09-08-2010 06:10 PM |
Quick and dirty conversion of html to epub WITH intra-file links | Birdonawire | ePub | 2 | 06-18-2010 02:18 AM |
HTML with external links | posativ | LRF | 2 | 02-07-2010 07:27 AM |