View Single Post
Old 01-07-2022, 09:07 AM   #5
Hitch
Bookmaker & Cat Slave
Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.Hitch ought to be getting tired of karma fortunes by now.
 
Hitch's Avatar
 
Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
Quote:
Originally Posted by bleopaskom View Post
Thanks, then I guess I won't be bothered with trying to manually link hundreds of footnotes, that would be a waste of time.


Even those really bad manually scanned books with slightly yellow/darker pages were OCRd correctly, with obviously a few characters misread, because the pages were not aligned and it wasn't in English.


Other than that, when I go to edit in Calibre, the only errors that I get are the missing rules in the main.css. Even then, if I just send them to my Kindle Paperwhite 4, there are no issues whatsoever.
If you're doing this for personal use alone, and you don't care overmuch if the footnotes popup or not, then I guess this is the best approach--just leave them as they are.

I, like the others, find myself pretty speechless over the idea that Abbyy did an amazing job converting a PDF to an ePUB. I mean...I work hand-in-hand with a very great fellow, who does world-class scanning/OCR. We send all our querents there, who have printed books in-hand and all that (and image-layer-only PDFs and so on). I've seen what his ePUB exports look like, from Abbyy--and they make me wince. And yet, he's the best and most experienced scanner and Abbyy-user that I know. So...you are either incredibly lucky or you simply want different things from your conversions.

(n.b.: I cannot tell you how many people I've had show up at my shop, over the years, with "eBooks" made by Abbyy, from scanners like BlueLeaf, etc., needing help to fix them and honestly, it's cheaper and less time-consuming to take their OCR-output Word file and redo the book from the jump than it is to fix those messes. Boggling, really, that people can't/don't see how bad those auto-ePUBs really are....)

It sounds to me that you're using Abbyy on modern PDFs that already have a text layer. If you are, I'm honestly not sure why you'd do that. You can export a modern PDF from Acrobat Pro, DC, etc. into Word and then use that Word file to make an ePUB in Calibre, too. Why go to the AbbyyFineReader route, unless you have a PDF that's imaged-only?

Are the footnotes in the PDFs you are scanning already linked, do you know?

Hitch
Hitch is offline   Reply With Quote