There are many kinds of PDF content and most convert poorly or not at all.
Also unfortunately the computer industry is very Latin/Roman font orientated.
I'm sure I've lots of English PDFs that neither work well with k2pdfopt, which is about the best tool, nor well with OCR (font, scan quality etc).
|