Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Conversion

Notices

Reply
 
Thread Tools Search this Thread
Old 08-21-2013, 03:58 AM   #1
cptnemo
Enthusiast
cptnemo began at the beginning.
 
Posts: 35
Karma: 10
Join Date: Oct 2011
Device: Kindle 3
PDF output is searchable with Adobe Reader but not with Mac Preview

Hello,

I converted an .epub into a .pdf. Once I try to search the text with Mac Preview (but also with Skim) I can't find any correspondence. Also when I try to copy some text from the .pdf I get a blank string. I can highlight the text normally.

All the above problems disappear when I use Adobe Reader. I can search and copy.

Should I select different options for the conversion?
cptnemo is offline   Reply With Quote
Old 08-21-2013, 06:31 AM   #2
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 25,949
Karma: 5036099
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
That;s because mac preview has no support for unicode cmaps, a feature that was introduced to PDF over six years ago.
kovidgoyal is offline   Reply With Quote
 
Enthusiast
Old 08-21-2013, 06:22 PM   #3
cptnemo
Enthusiast
cptnemo began at the beginning.
 
Posts: 35
Karma: 10
Join Date: Oct 2011
Device: Kindle 3
Quote:
Originally Posted by kovidgoyal View Post
That;s because mac preview has no support for unicode cmaps, a feature that was introduced to PDF over six years ago.
Any easy way to change the character encoding of the pdf output in Calibre? (Or hasn't it anything about the problem? I saw there are options for the input encoding, but can't find for the output...)
cptnemo is offline   Reply With Quote
Old 08-21-2013, 07:47 PM   #4
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 25,949
Karma: 5036099
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
No .
kovidgoyal is offline   Reply With Quote
Old 08-21-2013, 09:40 PM   #5
cptnemo
Enthusiast
cptnemo began at the beginning.
 
Posts: 35
Karma: 10
Join Date: Oct 2011
Device: Kindle 3
Thanks.

I found I could use a workaround for my character encoding problem. Maybe someone is interested. So see below.

If you want to copy & paste from the PDFs using Preview.app or Skim.app you can't use calibre to generated the PDF. I don't grasp all the technicalities but, in few words, the problem has to do with the encoding: calibre produce PDF with text with encoding "Identity-H", while Preview.app and Skim.app need "Ansi" encoding. Then you can:

1) Convert with calibre to HTMLZ.
2) Replace the .htmlz extension of the file with .zip
3) Unzip the file
4) Create the PDF with Adobe Acrobat Pro using "Create from webpage"

(I tried also Word and LibreOffice, problem here they don't keep the internal links)
cptnemo is offline   Reply With Quote
Old 08-26-2014, 03:48 AM   #6
t04sty
Junior Member
t04sty began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Aug 2014
Device: Android
Quote:
Originally Posted by kovidgoyal View Post
No .
So close, but this is a dealbreaker. A PDF that isn't searchable in MacOS Preview and lacks the ability to copy content is useless to me. The Calibre PDF output looked very nice once I tweaked the epub in Sigil, though.

Thanks for your efforts in developing Calibre, but I guess I will have to look for a different software package that can do what I need.

(The HTMLZ workaround breaks the ToC if I open it in Word, so it's back to square 1)
t04sty is offline   Reply With Quote
Old 08-26-2014, 05:52 AM   #7
t04sty
Junior Member
t04sty began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Aug 2014
Device: Android
Quote:
Originally Posted by cptnemo View Post
Thanks.
I found I could use a workaround for my character encoding problem. Maybe someone is interested. So see below.
If you are looking for a fast solution and don't mind sacrificing the internal links, simply open the Calibre-produced PDF in Mac OS Preview, go to the Print dialog, and from there "Save as PDF". This will produce a PDF that is searchable, with copyable content, but which lacks the internal links.
t04sty is offline   Reply With Quote
Reply

Tags
pdf

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Urgent: How To Convert Wikibook PDF Into a Searchable Index? deerayolia Kindle Formats 4 05-28-2012 06:52 AM
Sony Reader Guide for creating optimized PDF content - Exclusive Preview Bob Russell Sony Reader 51 06-22-2011 11:31 AM
Adobe PDF on kobo reader? domromer Kobo Reader 4 10-29-2010 01:00 AM
DR800 TechPDF: Yet another PDF reader (technology preview) GregorRichards iRex 35 06-11-2010 10:40 AM
Adobe Reader V9.0 for Windows and Mac released Alexander Turcic News 21 07-05-2008 06:14 PM


All times are GMT -4. The time now is 04:16 PM.


MobileRead.com is a privately owned, operated and funded community.