04-17-2018, 09:56 PM | #1 |
Wizard
Posts: 3,413
Karma: 13369310
Join Date: May 2008
Location: Launceston, Tasmania
Device: Sony PRS T3, Kobo Glo, Kindle Touch, iPad, Samsung SB 2 tablet
|
How does one join epub files?
I am currently working on Cousin Pons by Honoré de Balzac for the MR library. This is one of Balzac's last books, and is not divided into chapters.
It is available from the University of Adelaide ebook library as an HTML file of 668kb, and from Wikisource as 17 smaller files. I find the 668kb file hard to work with, and think my Sony PRS T3 reader will choke on a file that size anyway. Would it be possible to use the Wikisource files by producing smaller ePub files, and making them follow each other in the ebook without needing to start them on new pages? Cousin Pons was initially published as a serial in the Constitutionel newspaper, so I presume that the Wikisource files are the translations of each episode. I could present them that way, but I'd much prefer to join them all together. |
04-17-2018, 10:47 PM | #2 |
A Hairy Wizard
Posts: 3,093
Karma: 18727053
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
|
I wouldn't recommend multiple epubs Alex. 700k is fairly small for an epub - I regularly work with 1.5 mb files with no problem as long as they are not on a single html page in the epub. You can make individual HTML files in the same epub for each of those serials.
|
Advert | |
|
04-18-2018, 01:15 AM | #3 |
null operator (he/him)
Posts: 20,553
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
@AlexBell - calibre has an EpubMerge plugin.
I've used it in a similar situations to yours, sticking newspaper/magazine serialised stories together, to good effect. BR |
04-18-2018, 01:39 AM | #4 | |
Grand Sorcerer
Posts: 12,155
Karma: 73448616
Join Date: Nov 2007
Location: Toronto
Device: Nexus 7, Clara, Touch, Tolino EPOS
|
Quote:
Sent from my Nexus 7 using Tapatalk |
|
04-18-2018, 07:10 AM | #5 | |
Wizard
Posts: 3,413
Karma: 13369310
Join Date: May 2008
Location: Launceston, Tasmania
Device: Sony PRS T3, Kobo Glo, Kindle Touch, iPad, Samsung SB 2 tablet
|
Quote:
Are any of these epubs in the public domain, so I could download it and see how you did it? Last edited by AlexBell; 04-18-2018 at 07:19 AM. |
|
Advert | |
|
04-18-2018, 07:13 AM | #6 | |
Wizard
Posts: 3,413
Karma: 13369310
Join Date: May 2008
Location: Launceston, Tasmania
Device: Sony PRS T3, Kobo Glo, Kindle Touch, iPad, Samsung SB 2 tablet
|
Quote:
|
|
04-18-2018, 07:17 AM | #7 |
Wizard
Posts: 3,413
Karma: 13369310
Join Date: May 2008
Location: Launceston, Tasmania
Device: Sony PRS T3, Kobo Glo, Kindle Touch, iPad, Samsung SB 2 tablet
|
Yes, of course. I didn't explain myself clearly. But I still need to learn how to stop each XHTML file from starting at the top of a new page, and leaving whitespace after itself.
|
04-18-2018, 09:01 AM | #8 |
A Hairy Wizard
Posts: 3,093
Karma: 18727053
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
|
Well, if you don't want to use Calibre, then you can just copy/paste the contents between each of the file's <body> tags onto a single html file. Just make sure that any CSS styling is consistent.
You will have a large single file size which will cause some devices to choke. You need to choose which you prefer, choking devices or some logical separation of the files where starting at the top of a page is not an issue. edit: warning: Sigil slows down quite a bit when working with a single large HTML file. edit, edit: Honestly it would be less work to just use the single University file...clean it up if necessary...then use Sigil's Split at Cursor function at logical separation points. Last edited by Turtle91; 04-18-2018 at 09:27 AM. |
04-18-2018, 09:24 AM | #9 |
Resident Curmudgeon
Posts: 73,896
Karma: 128597114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
|
04-18-2018, 09:56 AM | #10 |
A Hairy Wizard
Posts: 3,093
Karma: 18727053
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
|
So...now that I have had some sleep and re-read your original post Alex... Jon's answer is correct for your question. You can't prevent a new HTML page from starting at the top of the next screen. I've looked at the University version of the file...there is zero styling within the document itself - nice and clean! The title should be on its own page and the bit at the end about the University should be removed if you create your own version. The Wikisource version at least has the italics already added. I'm not familiar with the sharing policy of either source...just make sure you are allowed to use/change what you get from them. If you are correct about the serial nature of the publication then those would be just as logical a split point as any. You could even add the print date as an editor's note at the top of each page to give it your personal flair. Good Luck! Last edited by Turtle91; 04-18-2018 at 10:02 AM. |
04-18-2018, 02:41 PM | #11 | |
mostly an observer
Posts: 1,515
Karma: 987654
Join Date: Dec 2012
Device: Kindle
|
Quote:
As a reader, I appreciate logical places to stop, or rather to start again, instead of using the Notes feature to mark a SRL. |
|
04-18-2018, 06:22 PM | #12 | |
Well trained by Cats
Posts: 29,779
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Code:
<spine toc="ncx"> <itemref idref="titlepage"/> <itemref idref="calibre_jacket"/> <itemref idref="a02-title"/> <itemref idref="a03-copyright"/> <itemref idref="a04-toc"/> <itemref idref="a05-acknowledgments"/> <itemref idref="a06-foreword"/> <itemref idref="a07-writersroom"/> <itemref idref="a08-planning"/> <itemref idref="a09-finding"/> <itemref idref="a10-theme"/> |
|
04-19-2018, 06:39 AM | #13 | |
Guru
Posts: 970
Karma: 4999999
Join Date: Mar 2009
Location: Rosario, Argentina
Device: SONY PRS-505, PRS-T2
|
Quote:
|
|
04-19-2018, 06:52 PM | #14 |
Wizard
Posts: 3,413
Karma: 13369310
Join Date: May 2008
Location: Launceston, Tasmania
Device: Sony PRS T3, Kobo Glo, Kindle Touch, iPad, Samsung SB 2 tablet
|
Many thanks to all who responded, and I apologise again for the poor wording of my original post. Please find attached an early and unfinished version of the ebook I am trying to do. The HTML file Chapter02.html which makes up the story contains about a quarter of the original book. On my Sony and Kobo readers it breaks at
he is one of the kings of fashion in Paris.'</p> <p>'But a wife changes everything so that there is quite a bit of white space between the end of one paragraph and the start of the next paragraph. That is the problem I am fussing about, and want to know how to fix. This material from an earlier post may be a solution, <itemref idref="calibre_jacket"/> <itemref idref="a02-title"/> <itemref idref="a03-copyright"/> <itemref idref="a04-toc"/> <itemref idref="a05-acknowledgments"/> <itemref idref="a06-foreword"/> <itemref idref="a07-writersroom"/> <itemref idref="a08-planning"/> <itemref idref="a09-finding"/> <itemref idref="a10-theme"/> and I'll certainly try to find out what it means. I've never seen anything like that before; can anyone point me to where itemref and idref are discussed? I've never seen or used them before. |
04-19-2018, 08:16 PM | #15 | |||
Wizard
Posts: 2,297
Karma: 12126329
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
Quote:
Quote:
And if not, then you'll just have to bring out the ol' chopper and chop it anyway (what was that old filesize warning limit again? 300 KB? 256KB?). If the entire chapter is going to be 600 KBs, that is much too large. And Alex, like JSWolf said, once you split those files into separate HTML files, there just isn't much you can do. Each HTML file starts on its own page, you don't have control over that. It just so happens to be one of the quirks of reading systems we have to live with. Quote:
It's how the Book Browser and EPUB-reader knows what files are a part of the EPUB + the logical reading order of the HTML files:
And then when you want to change the order the files are displayed in, you just go drag-and-dropping the files around in the Book Browser to a different order: If you open up your content.opf, you can see this: Spoiler:
The <manifest> part just lists all the files in the EPUB and what kind of files they are (this JPG is a jpeg file, this CSS is a CSS file, this HTML file is XHTML, [...]). The <spine toc="ncx"> part just explains what the reading order is. If you want to read the technical details on it, it is explained in the MobileRead Wiki: https://wiki.mobileread.com/wiki/OPF ... but like I said, that's best left to just Sigil/Calibre. You never really have to go digging in there. |
|||
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Question about metadata in epub files vs opf files | machstem | Calibre | 3 | 06-19-2017 12:43 PM |
Library Thing: EPUB Borrowing in the (Non-)English World and Can I Join? | avid01 | General Discussions | 14 | 07-01-2014 05:10 AM |
epub files on kobo have all been changed into shortcut files? | emme278 | Kobo Reader | 6 | 11-01-2013 04:32 AM |
Unwanted epub files once mobi files have been converted. | fletchdt | Conversion | 5 | 03-22-2012 10:18 PM |
Txt files - Convert to Epub - Multiple files into one book - noob help | Cernan | Calibre | 6 | 05-18-2010 10:12 AM |