It would be nice if folks actually read the sticky posts in this forum. Specifically in this case the
Read this before Posting PDF Questions sticky post in this forum.
Quote:
Originally Posted by MacEvansCB
Nope... wasn't ligatures... clicked on 'Keep Ligatures' under 'Look and Feel'... exact same results.
|
It is most definitely a ligature problem, calibre will handle some ligatures, but others are outside of calibre's control and thus not supported at this time.
From the Sticky post:
Quote:
Originally Posted by ldolse
Various character pairs like 'ff', 'll', etc are missing from my conversion
This is probably caused by the PDF containing what are called ligatures. These occur when the publisher changes certain pairs of characters into a single character to make the text 'look better'. Common are 'll', 'fl', 'fi', 'ff', 'ffl', and 'ffi'. Unfortunately, due to a bug in the third party library Calibre uses, in many cases ligatures simply aren't supported. Several users have reported having good luck with Mobipocket Creator or Acrobat Professional for these types of files.
|
On a more practical note.
Quote:
Originally Posted by MacEvansCB
Does anyone have any idea how much fun this is to repair, word by word???
|
We all do. On the rare occasion (once I believe) I decided to deal with this issue I used Sigil, opened up my epub and did a find and replace to find
col ected and change it to
collected through the entire document in one fell swoop. Wash, rinse and repeat for each new word.
This is still a pain and most of the time I find a different source to use for the conversion.