11-18-2008, 07:01 AM | #1 |
Junior Member
Posts: 7
Karma: 10
Join Date: Oct 2008
Device: epub
|
converting pdf to epub
I want to convert pdf files to epub format. I tried calibre but the epub generated is not good.
If anybody is using any other tool for the same conversion.Please let me know as well. |
11-18-2008, 07:47 AM | #2 |
Resident Curmudgeon
Posts: 75,901
Karma: 134368292
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
First convert from the PDF into HTML and make sure you've fixed all of the errors in the HTML due to converting from PDF. I've never yet seen a PDF converter that does it 100% error free. So it could be the ePub is reflecting the errors in the conversion process from PDF.
|
Advert | |
|
11-18-2008, 08:26 AM | #3 |
Feedbooks.com Co-Founder
Posts: 2,263
Karma: 145123
Join Date: Nov 2006
Location: Paris, France
Device: Sony PRS-t-1/350/300/500/505/600/700, Nexus S, iPad
|
PDF is the worst source format that you can use. Don't expect any good result with automatic conversions from PDF to ePub: you'll always have to fix a lot of things manually.
|
11-18-2008, 09:57 AM | #4 |
reader
Posts: 6,975
Karma: 5183568
Join Date: Mar 2006
Location: Mississippi, USA
Device: Kindle 3, Kobo Glo HD
|
The only other ebook-centric converter I am aware of (which does not convert to images) is Windows MobiPocket Reader or Creator, which will convert from PDF to MOBI. Calibre can then convert the MOBI to ePub. This may be no better than Calibre's native converter, but it might be worth trying. Note that the source code for the underlying pdf2xml is available, see Mobipocket convert in mass?.
|
11-19-2008, 12:25 AM | #5 |
Junior Member
Posts: 7
Karma: 10
Join Date: Oct 2008
Device: epub
|
I have tried both the conversions pdf>html>epub and also pdf>mobi>epub.
But both have some problems and html files have to be manually modified to get the required epub. Where can I find the source code for pdf2xml ? Thanx for ur time.. |
Advert | |
|
11-19-2008, 01:23 AM | #6 |
reader
Posts: 6,975
Karma: 5183568
Join Date: Mar 2006
Location: Mississippi, USA
Device: Kindle 3, Kobo Glo HD
|
It is at pdf2xml homepage.
|
05-03-2010, 11:30 AM | #7 |
Junior Member
Posts: 1
Karma: 10
Join Date: May 2010
Location: Karachi - Pakistan
Device: none
|
Try this website: http://epub2go.com, it's free ;-)
|
05-04-2010, 03:42 PM | #8 |
Evangelist
Posts: 412
Karma: 546196
Join Date: Mar 2009
Location: UK canal boat
Device: sony prs505, prs650, kobo Glo HD liseuses
|
FWIW I've found the only way to get a good conversion from PDF is the labour-intensive one: export the PDF as text, load the .txt into your favourite text editor, use extensive search & replace to eliminate hard line endings etc (or perhaps less search & replace if you have any competency with regex, which I don't), convert quotes to “ etc., and finally turn it into a decent epub with Sigil. Loading into Calibre is then of course a trivial exercise.
|
05-04-2010, 04:13 PM | #9 |
Feral Underclass
Posts: 3,622
Karma: 26821535
Join Date: Jan 2010
Location: Yorkshire, tha noz
Device: 2nd hand paperback
|
I've tried most of these methods, but the best so far is to open the PDF in an OCR program and generate a new text file from that. Then manually delete any headers and footers, and fix any broken paragraphs. I haven't seen any common OCR problems yet, presumably because the text in a PDF will be perfectly straight and without any scanner or paper noise.
|
05-05-2010, 04:03 PM | #10 |
Wizard
Posts: 4,520
Karma: 121692313
Join Date: Oct 2009
Location: Heemskerk, NL
Device: PRS-T1, Kobo Touch, Kobo Aura
|
I agree, I usually use Abby Finereader to OCR the PDF. Still a lot of manual afterwork, but the results are quite good.
|
05-05-2010, 06:44 PM | #11 | |
Guru
Posts: 714
Karma: 2003751
Join Date: Oct 2008
Location: Ottawa, ON
Device: Kobo Glo HD
|
Quote:
That sounds like your OCR program is "cheating" and using PDF tags, when and if they are available. |
|
05-06-2010, 10:35 AM | #12 | |
Writer2ePub creator
Posts: 354
Karma: 121129
Join Date: Sep 2009
Location: Genova, Italy
Device: Cybook Bebook iLiad Kindle HanlinV2 Readius SonyPRS500 SonyPRS700 etc
|
Quote:
Luke |
|
05-30-2010, 06:46 PM | #13 |
Junior Member
Posts: 1
Karma: 10
Join Date: May 2010
Device: Ipad
|
I have found the simplest way
|
05-30-2010, 07:15 PM | #14 | |
Wizard
Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
Quote:
|
|
05-31-2010, 01:13 AM | #15 | |
Writer2ePub creator
Posts: 354
Karma: 121129
Join Date: Sep 2009
Location: Genova, Italy
Device: Cybook Bebook iLiad Kindle HanlinV2 Readius SonyPRS500 SonyPRS700 etc
|
Quote:
I release a new version of writer2epub, it corrects many issues unde Windows OS. Look in my signature to download it. Luke |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Converting Sanskrit PDF to epub | sriniamble | Calibre | 17 | 11-25-2010 06:10 AM |
Problem converting PDF to EPUB in calibre | adgpro | Calibre | 2 | 07-09-2010 01:10 AM |
Problem converting pdf to epub | smartin | Calibre | 3 | 05-02-2010 06:55 AM |
Help with converting PDF to epub | neilmarr | Sigil | 6 | 11-14-2009 09:26 AM |
Formatting issues when converting PDF to EPUB | raptir | Calibre | 2 | 10-21-2009 10:32 PM |