MobileRead Forums - View Single Post - Epub from Google books to AZW3 as the original ?

lumpynose · 08-22-2019, 02:53 PM

In addition to the image layer from the scanned old book, PDFs can have an "invisible" layer over that that is the optical character recognition (OCR) text. You can drag your mouse over the page and select and copy the invisible text. There are typically many errors in the OCR of old books. As far as I know the only time you can have this dual layer of the original image and the OCR'd text is with a PDF. The EPUB you downloaded was the OCR'd text from the PDF. You can also find many of these dual layer PDFs on archive.org.

08-22-2019, 02:53 PM	#3
lumpynose Wizard Posts: 1,086 Karma: 6719822 Join Date: Jul 2012 Device: Palm Pilot M105	In addition to the image layer from the scanned old book, PDFs can have an "invisible" layer over that that is the optical character recognition (OCR) text. You can drag your mouse over the page and select and copy the invisible text. There are typically many errors in the OCR of old books. As far as I know the only time you can have this dual layer of the original image and the OCR'd text is with a PDF. The EPUB you downloaded was the OCR'd text from the PDF. You can also find many of these dual layer PDFs on archive.org.