05-09-2015, 08:57 PM | #1 |
Addict
Posts: 298
Karma: 1537324
Join Date: Aug 2010
Location: Chicago
Device: Nook, K3, Fire, Nexus 7
|
PDF to MOBI - All hyphens removed
I have a very basic PDF - just text, no headers/footers/images. Caliber does a good job of converting it to Mobi except that it's removing every single hyphen. I have the conversion set to the defaults.
Is there anything I can do to stop this from happening? Thanks! |
05-09-2015, 11:48 PM | #2 |
Ex-Helpdesk Junkie
Posts: 19,422
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
|
05-10-2015, 12:03 AM | #3 |
Addict
Posts: 298
Karma: 1537324
Join Date: Aug 2010
Location: Chicago
Device: Nook, K3, Fire, Nexus 7
|
I read through the sticky before I posted and I'm not sure how it helps me. . .
|
05-10-2015, 12:12 AM | #4 |
Ex-Helpdesk Junkie
Posts: 19,422
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Are you sure the PDF is actually text, or is it, like many "text" PDFs, composed of an image of text, with a layer of OCR for searching purposes? It can sometimes be hard to distinguish the two.
|
05-10-2015, 12:13 AM | #5 | |
Well trained by Cats
Posts: 29,792
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
PDF uses odd characters in places. PDF is a 'paste up' page format. Paste down some pictures, now past some body text. Now paste Large Caps, Now paste corrections. The order does not matter (to PDF), because PDF is making a PAGE. OTOH Conversion expects a Top to Bottom feed, there is no place it here |
|
05-10-2015, 12:14 AM | #6 |
Addict
Posts: 298
Karma: 1537324
Join Date: Aug 2010
Location: Chicago
Device: Nook, K3, Fire, Nexus 7
|
I just tried converting the PDF to an RTF and then a TXT file, and both those conversions kept the hyphens.
When I convert to MOBI or EPUB, the hyphens vanish. So it doesn't appear to be a problem with the PDF. Calibre can recognize the hyphens in the PDF. Calibre, however doesn't output the hyphens to a MOBI or EPUB. Last edited by ManosHandsOfFate; 05-10-2015 at 01:36 AM. Reason: clarification |
05-10-2015, 02:15 AM | #7 | |
null operator (he/him)
Posts: 20,565
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
Convert to EPUB, and have a look at the actual character, quickest way would be to use the calibre book-editor and have a look via the Tools->Reports->Characters. Maybe the font or the reader you're using doesn't support the what you think is a hyphen - if they're not regular hyphens you can 'fix them up' in the editor and convert the EPUB to MOBI. BR |
|
05-10-2015, 11:28 PM | #8 | |
Addict
Posts: 298
Karma: 1537324
Join Date: Aug 2010
Location: Chicago
Device: Nook, K3, Fire, Nexus 7
|
Quote:
Thanks for your help. Since the conversion was so consistent about removing the hyphens for only mobi and epub files I thought maybe there was just a setting I couldn't find. I think it's best if I just read this through the PDF. |
|
05-11-2015, 12:17 AM | #9 |
null operator (he/him)
Posts: 20,565
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Its hard to resolve without having the original PDF, as you know PDF is a 'pig' to convert. FWIW I am not aware of anything in calibre's PDF converter that could cause this problem - but I don't use it that much.
If you use Windows you could try the MobiPocket Creator tool (there's a link in PDF Conversion sticky) to create HTML or PRC, and convert that to EPUB and look at in editor (and Sigil for a 2nd opinion if you're inclined). If the PDF is not copyright protected you could post it here and I'll see if I can work it out. Or you could attach it to a Bug Report here ==>> Bugs : calibre and mark it Private and perhaps a developer can help. BR |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
ePub to pdf: Doesn't respect soft hyphens in ePub | EbokJunkie | Conversion | 4 | 11-18-2013 03:27 AM |
Dashed lines auto removed if output type is mobi | flyingfoxlee | Recipes | 3 | 05-20-2013 09:08 AM |
converting epub to mobi for kindle 3 weird hyphens | monkeygirl351 | Calibre | 14 | 12-30-2011 12:57 AM |
Noob help Rtf to Mobi has hyphens at every syllable | muddog23 | Calibre | 3 | 05-17-2010 09:13 PM |
Certain hyphens being removed on HTML to ePub | phunkysai | Calibre | 4 | 05-19-2009 03:17 PM |