Quote:
Originally Posted by LGN
I noticed that when converting some PDF into an RTF file the conversion sometimes looses a letter in a word, like a word with a double 'l'. For example the word 'wollen' becomes 'wol en'. Its not consequent so I'm unsure if here are other factors involved. I haven't checked the same PDF to another format though.
Anyone noticed this before?
|
The sticky says:
"
Various character pairs like 'ff', 'll', etc are missing from my conversion
This is probably caused by the PDF containing what are called ligatures. These occur when the publisher changes certain pairs of characters into a single character to make the text 'look better'. Common are 'll', 'fl', 'fi', 'ff', 'ffl', and 'ffi'. Unfortunately, due to a bug in the third party library Calibre uses, in many cases ligatures simply aren't supported. Several users have reported having good luck with Mobipocket Creator or Acrobat Professional for these types of files."
Some searching here may turn up more info on how to work around this problem.