12-31-2018, 05:04 AM | #1 |
Junior Member
Posts: 2
Karma: 10
Join Date: Dec 2018
Device: none
|
pdf->epub - using idents as a cue to line-unwrapping
I'm using latest Calibre to convert pdf to epub. Using default settings, all of my paragraphs are split into multiple paragraphs. But all paragraphs are clearly indented, so I'm not sure why Calibre is having troubles. I don't want to adjust "unwrap factor", as it's relies on line ending early and therefore inexact. How to force Calibre into taking indents into account when determining paragraph breaks? If it's not possible, are there any other tool that have that option?
|
12-31-2018, 11:21 AM | #2 | |
Addict
Posts: 384
Karma: 1638210
Join Date: May 2013
Location: Ontario, Canada
Device: Kindle KB, Oasis, Ubuntu, Jutoh,Kobo Forma
|
Quote:
Assuming, of course, that what you want exists to start with. As with anything pdf, success depends on what is inside the source file. Pdftotext will at least show you what is there, and it may vary from excellent to impossible. Simple books like novels often work well with this, but if you have double columns or something complex like a science textbook, its a lot more work. |
|
Advert | |
|
12-31-2018, 11:26 AM | #3 |
Junior Member
Posts: 2
Karma: 10
Join Date: Dec 2018
Device: none
|
Solved it with opening pdf with MS Word, then importing docx into calibre, thus converting it into epub and then manually fixing giant margins, or wrong text-align on rare paragraphs. That preserved all italics, which are all lost if I convert pdf to plain text. Bonus points for inline images which are actually inline.
Last edited by VcSaJen; 12-31-2018 at 11:29 AM. |
12-31-2018, 01:02 PM | #4 |
null operator (he/him)
Posts: 20,550
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
@VcSaJen - There are a couple of Word addins that have tools to help deal with PDF conversions ==>> eBook Tools and TransTools.
There's some discussion on the latter in this thread from post #19 onwards. BR |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PDF lines not unwrapping | truth1ness | Conversion | 2 | 11-19-2015 11:11 PM |
Line Spacing on PDF to Epub conversion | poodlemama | Calibre | 2 | 05-03-2010 08:28 PM |
Still having problems PDF to MOBI line unwrapping | jengwen | Calibre | 2 | 04-16-2010 09:14 AM |
PDF to ePub (New line problem) | Dark123 | Calibre | 3 | 02-13-2010 08:41 PM |
Unwrapping hard line breaks across all input formats | ldolse | Calibre | 17 | 05-10-2009 11:31 PM |