![]() |
#1 |
Connoisseur
![]() Posts: 72
Karma: 10
Join Date: Oct 2009
Location: Canada
Device: Sony PRS-505, Nintendo DS, HTC Desire
|
Properly formatted PDFs to Epub
I've been having difficulties when it comes to converting my many pdf files. In general, the majority of my pdf's are formatted well and are very much the most readable they can be.
But after using Calibre to convert them to epub I lose much of the formatting. Gaps appear where there should be none, huge breaks in text etc. I've read through FAQ's and such and can't seem to figure it out. I know pdf's are the least desirable format to convert from, but aside from the odd image for chapter breaks, there really shouldn't be a reason I lose the paragraph, sentence or whatever structure. Is there something I can do to remedy this? |
![]() |
![]() |
![]() |
#2 |
Junior Member
![]() Posts: 1
Karma: 10
Join Date: Jul 2009
Location: Granada, Spain
Device: I don't know
|
Maybe you can convert pdf to word (rtf) and try with this one.
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 972
Karma: 4999999
Join Date: Mar 2009
Location: Rosario, Argentina
Device: SONY PRS-T2, Kindle Paperwhite 11th gen
|
What I do with pdf's:
- Use mobipocket creator to convert to html - Edit the html with notepad++ - Create the epub with Sigil - Unzip the epub and make further corrections with notepad++ |
![]() |
![]() |
![]() |
#4 |
.
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,408
Karma: 5647231
Join Date: Oct 2008
Device: never enough
|
I have exported PDFs directly from Acrobat to HTML with some success, but I'm really just waiting for Kovid to finish his sure to be awesome improved PDF conversion thing.
![]() |
![]() |
![]() |
![]() |
#5 |
Connoisseur
![]() Posts: 72
Karma: 10
Join Date: Oct 2009
Location: Canada
Device: Sony PRS-505, Nintendo DS, HTC Desire
|
I'll try html and rtf, converting straight from pdf to epub just leaves way to much to edit afterwards. Hopefully going through another format first I can keep the form structure.
Thanks for the help |
![]() |
![]() |
Advert | |
|
![]() |
#6 | |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,247
Karma: 16539642
Join Date: Sep 2009
Location: UK
Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3
|
Quote:
https://www.mobileread.com/forums/showthread.php?t=57115 |
|
![]() |
![]() |
![]() |
#7 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,196
Karma: 1281258
Join Date: Sep 2009
Device: PRS-505
|
A lot depends on whether the original PDF was properly tagged. If it wasn't then there's nothing for it but to got through the text searching for bad paragraph marks. If it was, then first export the PDF to html in Acrobat and then feed that to calibre.
Pablo is right, though, even a tagged PDF will need a reasonable amount of html editing to create a professional result, unless whoever created it was particularly scrupulous with the tags and carefully marked each style. |
![]() |
![]() |
![]() |
#8 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
Quote:
I'd recommend using a method that preserves the look of the PDFs exactly if the PDFs already look good. This could involve just using the PDF itself, if your device is capable, or doing minor tweaks like removing margins and cutting the text into chunks sized for your reader. Explore options such as sopdf, PDFLRF and PDFread and see if you get results you like. |
|
![]() |
![]() |
![]() |
#9 | |
Connoisseur
![]() Posts: 72
Karma: 10
Join Date: Oct 2009
Location: Canada
Device: Sony PRS-505, Nintendo DS, HTC Desire
|
Quote:
|
|
![]() |
![]() |
![]() |
#10 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
In that case, my suggestion would be to try using PDFLRF to convert to LRF, and then use calibre to convert the LRF to ePub.
|
![]() |
![]() |
![]() |
#11 |
Connoisseur
![]() Posts: 72
Karma: 10
Join Date: Oct 2009
Location: Canada
Device: Sony PRS-505, Nintendo DS, HTC Desire
|
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Where to get classics which are properly formatted? | neonbible | General Discussions | 16 | 09-08-2010 09:55 PM |
EPUB Expert Needed: Cant properly export epub from InDesign | crottmann | ePub | 17 | 08-27-2010 10:23 AM |
Epub File Won't Open Properly in FBreader | Marcy | PocketBook | 8 | 05-10-2010 07:43 AM |
Site with content in Sony formatted PDFs | Fugubot | Sony Reader | 2 | 11-30-2006 11:03 PM |