View Single Post
Old 09-03-2014, 07:15 PM   #5
eschwartz
Ex-Helpdesk Junkie
eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.
 
eschwartz's Avatar
 
Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
Quote:
Originally Posted by nee View Post
I already have! There is no concrete answer to my problem. I know pdfs are tough to be converted. I hoped someone had a similar issue and that they managed to fix it somehow.
Just making sure you knew. (Next time, you can jump the gun by saying so in the OP. )

I know of no way to do it, sorry. The only cure is manual fixing.

I would start by opening it in the calibre ebook editor, and running spellcheck. Assuming calibre has a croatian dictionary, that is. You should be able to import a libreoffice dictionary if not.

You should be able to find many words that are invalid without the diacritic, and correct them in bulk. I don't know if any words would be considered correct either way, but if there are any, a regex should be able to match them once you know which words to look for.

Hopefully this will repair the majority of missing diacritics. If not, the thorough way will work -- unfortunately, that means going though the whole book line by line.

Quote:
Originally Posted by susan_cassidy View Post
It sounds like an encoding problem.
No. Encoding problems leave short strings of gibberish instead of specific characters. This is a PDF conversion, which is different.
eschwartz is offline   Reply With Quote