|04-09-2010, 05:45 PM||#1|
Join Date: Aug 2008
Location: Plano, TX
Device: Sony PRS-505 + B&N Nook + Motion LE1700 + Motorola Xoom Wifi
Punctuation in PDF conversions
I've recently used calibre to convert several PDFs to epub format. I noticed that formatting tags tend to be greedy and grab nearby punctuation.
eg. (Book Title p22)
gets converted as <i>(Book Title</i> p22) giving one italic parenthesis and one normal parenthesis: (Book Title p22)
and, "this is bold."
gets converted as "this is <b>bold."</b> giving a bold period & quote mark: "this is bold."
Has anyone else noticed this? Is this a calibre issue? or an issue with the original PDF files? (the original PDFs do not appear this way in evince.)
I would think that formatting should exclude nearby punctuation by default.
|04-09-2010, 11:05 PM||#2|
creator of calibre
Join Date: Oct 2006
Location: Mumbai, India
This is a pdftohtml issue, though I suppose it should be possible for calibre to work around it. Open a ticket and attach a sample PDF file and I'll see what can be done.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|slow pdf conversions + processes not cancelling||cybmole||Calibre||10||10-05-2010 06:36 AM|
|Calibre PDF conversions - LRF/EPUB vs RTF||jackie_w||Calibre||14||09-22-2009 04:06 PM|
|How To Preserve Paragraph Tabs in PDF conversions?||Neil||Calibre||6||05-12-2009 01:14 AM|
|PDF Conversions Fail||jethro10||Calibre||0||03-07-2009 05:36 AM|
|How good are Amazon's pdf conversions?||Red Stapler||Which one should I buy?||7||09-11-2008 03:39 PM|