![]() |
#1 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Jun 2016
Device: kindle
|
extracting html file
I have a bit of a problem that I'm really hoping someone can help with! I lost my original html file for my ebook. I have the .mobi and .ebub files within Calibre and would love to know whether there's any way that I can extract the full original html file from either of those, or perhaps elsewhere in Calibre.
I have had a look in the library and see the files there but when I go to click and open the .zip it just opens another zip, which opens another zip... Would REALLY appreciate any help. Thank you! |
![]() |
![]() |
![]() |
#2 |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,724
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
This is for EPUB.
Select Book in Book List, press 'U' - that will take you to the Unpack facility, it unpacks to a temporary folder, and opens it in your file manager, from there you can copy the html, css etc files. An EPUB is a zip, so you could copy the EPUB somewhere, rename to .zip and open in with your everyday unzip utility. Some 'MOBi's' can be unpacked but I've never needed or wanted to do that so I'm not sure about doing it. There's a KindleUnpack plugin. BR |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 602
Karma: 1712372
Join Date: Feb 2013
Location: germany
Device: PocketBook Touch
|
Use the built-in Ebookeditor of Calibre to open the epup. Then you see all containing files.
To look in the library ist es very dangerous way. Looking ist alowed, but changing never ever. |
![]() |
![]() |
![]() |
#4 | |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Jun 2016
Device: kindle
|
Quote:
This splits the files into multiple html files though, and I had them within one file originally. I'm not really confident enough to rebuild it from these. Is there any way of extracting the original do you know, or has that gone for good? Thanks again!! |
|
![]() |
![]() |
![]() |
#5 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Jun 2016
Device: kindle
|
Thank you for your help! I had looked there but would like a bit more control within my ebook editor external to Calibre...
|
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,048
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
The split must have been done sometime earlier. Unpack OR the Editor, just work on what is there. ![]() 1) If you use the Editor, you can work on any piece and saving the 'edit' puts them back into the 'book' in the order of the filelist 2)As long as you leave the Unpack session active, you can click the 'Rebuild" button after editing/replacing pieces. keep the same names unless you also manually correct the OPF (and NCX) |
|
![]() |
![]() |
![]() |
#7 | |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Jun 2016
Device: kindle
|
Quote:
|
|
![]() |
![]() |
![]() |
#8 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,048
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Just export the file and use WinRAR or similar to extract the archive |
|
![]() |
![]() |
![]() |
#9 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,251
Karma: 16539642
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
@beccaa,
I don't know whether this would fit your requirements, but have you tried converting the epub (or mobi or whatever) to calibre's HTMLZ format? HTMLZ is a standard zip file containing one big HTML file (plus images etc). However, be aware that the conversion may introduce changes to original HTML/CSS class names. |
![]() |
![]() |
![]() |
#10 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,613
Karma: 6718541
Join Date: Dec 2004
Location: Paradise (Key West, FL)
Device: Current:Surface Go & Kindle 3 - Retired: DellV8p, Clie UX50, ...
|
Quote:
The original HTML doesn't not exist in the .MOBI. It would have been altered by the conversion process. Anything that you unpack from the .MOBI, if possible, would be this altered HTML or xHTML file. The ePUB might contain the original HTML, but it is quite unlikely that it does. If the ePUB was created by a conversion to ePUB process then the HTML or xHTML in the ePUB would be somewhat different from the original. As stated in another post, if (and that's a big IF) the original HTML was added to calibre before conversion to the other formats it would normally be wrapped up as a ZIP archive when added to the library. That ZIP would contain the original HTML, but no product of a conversion to another format (e.g. ePUB, MOBI, AZW3, ...) would contain an exact copy of the original HTML. Even when the target format for a conversion contains HTML file(s) the HTML code will have been altered by the conversion process so that it complies with the target format's limitations. |
|
![]() |
![]() |
![]() |
#11 |
Junior Member
![]() Posts: 5
Karma: 10
Join Date: Jun 2016
Device: kindle
|
Thanks everyone. I tried some of your suggestions but as you say, the files were altered. Went back to an old version and made some changes manually. I say some changes - like, all night long!
![]() Really appreciate your help and advice though - thank you ![]() |
![]() |
![]() |
![]() |
Tags |
html epub mobi |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Help with extracting pdf file | S411il | General Discussions | 13 | 01-28-2014 05:48 PM |
Extracting html from Mobi on OSX | mGorilla | Kindle Formats | 6 | 05-10-2011 05:00 AM |
Extracting a cover image from lit file | p3aul | Calibre | 6 | 07-25-2010 04:33 PM |
Extracting firmware bin file | adreamer | Ectaco jetBook | 1 | 01-02-2010 01:38 PM |
Extracting html/images from within .imp files! | nrapallo | IMP | 12 | 03-10-2009 10:22 PM |