![]() |
#1 |
Zealot
![]() Posts: 104
Karma: 22
Join Date: Jun 2010
Device: none
|
Converting from PDF to ePub, Calibre not working
Hello,
I recently published an eBook that is beautifully designed in InDesign, and then converted to PDF. A number of people have asked me for it in ePub and I have tried to convert the file using Calibre, but the text comes out all scrambled, formatting all over the place -- it really looks like the book went through a meat grinder. I've tried to convert the PDF to html, hoping that might work better, but alas, it just looks the same. The instructions for exporting from InDesign into ePub are so ridiculously complicated that it is far beyond my capabilities. Is there any way to convert this PDF file to ePub in a reasonably easy way? So grateful for any help! Thanks. Alda |
![]() |
![]() |
![]() |
#2 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,196
Karma: 1281258
Join Date: Sep 2009
Device: PRS-505
|
Indesign works fine for exporting epubs, though CS5 has some additional features that make it easier to produce a file that'll work on mobile readers.
All styling must be applied through paragraph or character styles, which you should be doing anyway, and the text (obviously) has to be laid-out in linked text boxes. Don't even think of trying to convert from a PDF, that's truly a waste of time. |
![]() |
![]() |
![]() |
#3 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 697
Karma: 150000
Join Date: Feb 2010
Device: none
|
What Charlski said, except that ID4 export to epub is just a starting point.
It will need some tweaking in order to become an epub worthy of the name. The biggest fault is that all the body of the book will be in a single .xhtml file within the epub, which will break some readers. I know, because our small publishing house is now involved in converting our print books (imposed in InDesign) to epub, and I get the task of tweaking the output of ID4 so it will meet our standards. Still, the following steps are orders of magnitude better than trying to convert from PDF, which is probably the worst possible choice of starting material (due to the diametrically opposed goals of each format). Starting from the ID4 exported epub, you will need to: 1) expand the epub to a working directory 2) clean up the styles, and make sure that chapter headers are recognizable (i.e. <h1> tag instead of something like <p class="Heading-1"... or such. 3) Add chapter breaks (to break up the single book body file into manageable chunks, via something like sigil) 4) Adjust the CSS file to reasonable values. 5) Re-zip the working directory to an epub file 6) import the new .epub file into sigil, or some equivalent which AFIK does not yet exist. 7) make necessary modifications to text, and metadata, and save. 8) validate 9) you're done! Sounds like a lot of work, doesn't it? and I have probably left out a few critical details at that. But really, it isn't that bad. Less than an hour to come up with a usable epub that will pass epubcheck. |
![]() |
![]() |
![]() |
#4 | ||
Zealot
![]() ![]() Posts: 115
Karma: 150
Join Date: Jul 2008
Location: Netherlands Veenendaal
Device: Palm T5, Sony PRS-505, Nook Color
|
Quote:
With it you can break the large xhtml into chapters, clean up the styles, add/remove images, edit text, etc. Then if needed: Quote:
Joop |
||
![]() |
![]() |
![]() |
#5 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,196
Karma: 1281258
Join Date: Sep 2009
Device: PRS-505
|
Quote:
ID5 allows you to control flow breaks through use of heading styles which have been specified in the ID-generated Table of Contents. There's a video demonstrating the process here. If you use a lot of internal links in your book you'll want to take a look at this blog post and use the script linked there. |
|
![]() |
![]() |
![]() |
#6 | |
Zealot
![]() Posts: 104
Karma: 22
Join Date: Jun 2010
Device: none
|
Thanks everyone!
Quote:
My husband is the InDesign whiz, but he has no knowledge of this new format. Looks like some in-depth study is needed -- and I fear it will take more than an hour! |
|
![]() |
![]() |
![]() |
#7 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 697
Karma: 150000
Join Date: Feb 2010
Device: none
|
@charleski Thanks for that Info! I am glad to know the differences with ID4 and ID5. Our small publishing house JUST paid for upgrading to ID4, and we are not willing to go to ID5 until we see some real benefits. As it is, the ID4 output can be easily converted to something sigil-ready so we are content to stand pat.
@alda: The epub format may seem intimidating at first, but actually it is quite understandable. Don't give up too soon. As tools, look at sigil and calibre, especially the discussions in these forums (and of course this epub forum itself), and you'll soon get a grasp on the issues. One of the strange and wonderful things that I've observed is that "small" publishers are more than willing to share what they've learned in order to help other small publishers. This is in contrast to the "large" brick-and-mortar publishers. So just ask, if you have further questions. |
![]() |
![]() |
![]() |
#8 |
Zealot
![]() Posts: 104
Karma: 22
Join Date: Jun 2010
Device: none
|
@ st_albert -- thank you.
![]() |
![]() |
![]() |
![]() |
#9 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,196
Karma: 1281258
Join Date: Sep 2009
Device: PRS-505
|
If you paid for CS4 after April 12 2010 you may qualify for a free upgrade to CS5, phone Adobe to check.
|
![]() |
![]() |
![]() |
#10 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 78,945
Karma: 144284074
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
One of the reasons we have lots of eBooks out there with errors is because a PDF source was used as the master document to make the eBook versions. I know of no program that can take a novel length PDF file and convert it without introducing errors. That's just how things are.
For example, I have read the first two Sookie Stackhouse books and by looking at the CSS, I se that they were converted from PDF. And because of this, there are errors. In most of the italics, there are missing or extra spaces at the begging and/or ends of the italics. Also, after the simulated small caps, the space was missing. Those sorts of errors are unacceptable. If I had not gotten the ePubs from the library, I would have been asking for my money back. So please do not start with a PDF source ever. Take the PDF and delete it. Make It go away. If there is no existing PDF, there cannot be a convert ion that introduces errors that never should have been introduced in the first place. The only way to make sure a conversion from PDF is correct is to compare the PDF to the converted output and that means checking every word, every letter, every punctuation mark, and even every space. |
![]() |
![]() |
![]() |
#11 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 697
Karma: 150000
Join Date: Feb 2010
Device: none
|
JSWolf, I heartily concur! As I see it, the .pdf format (page structure is preserved no matter what printing system you use) is diametrically opposed to the .epub concept of flowable text that adopts itself (via the ebook reader software) to whatever "page" size you are using at the moment.
.pdf is the WORST possible choice -- the last resort -- to convert to epub. @Alda and @charlski, thanks for the good advice and wishes. Will check on the eligibility to upgrade to ID5. We really want the ability to have one master file of corrections, etc. That being the InDesign file itself. |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
converting pdf to epub | Gagan | ePub | 65 | 06-28-2017 11:57 PM |
Problem converting pdf to epub (size) using calibre | abadguy | 6 | 03-23-2012 05:33 AM | |
Problem converting PDF to EPUB in calibre | adgpro | Calibre | 2 | 07-09-2010 01:10 AM |
Converting Merged HTML file to Epub/PDF Not Working | MV64 | Calibre | 1 | 06-07-2010 07:48 PM |
Calibre: wrong drawings when converting Pdf to epub | gillesB. | Calibre | 1 | 05-01-2009 12:48 PM |