06-22-2015, 10:15 AM | #1 |
Junior Member
Posts: 9
Karma: 10
Join Date: Apr 2015
Device: ipad
|
problem viewing "OCRd" pdf on voyage
We have been scanning our paper books with the fantastic iX500 scanner. The books are coming out great when viewed on a ipad, mac, pc, etc. However, on my wife's voyage if the pdf file has been run through the OCR program many characters are dropped in words.
After much testing we have determined it matters not what resolution of scan, nor grey scale vs black and white. All seem to view fine as long as the file has not been "OCRd". Is there a method of keeping multiple versions of a file in the same format? I would love to be able to keep both the OCR and the non-OCR files under the same title. Right now I make up two separate book titles, one ending in Not OCRd, but this is clumsy at best. Right now I am regretting getting the voyage. I just assumed it would support pdf files better. |
06-22-2015, 02:45 PM | #2 |
Wizard
Posts: 2,251
Karma: 3720310
Join Date: Jan 2009
Location: USA
Device: Kindle, iPad (not used much for reading)
|
No one ever said that an eink Kindle can do a good job with pdf files.
Sounds like the OCR program isn't very good. If you are running it through an OCR program, what format is the output in? Do you then run it through conversion software to make it into an ebook? You wouldn't be able to tell the pdf version from any other version, if they had the same title. How could you? All it shows is the title. |
Advert | |
|
06-22-2015, 04:58 PM | #3 |
Junior Member
Posts: 9
Karma: 10
Join Date: Apr 2015
Device: ipad
|
The output is pdf and it displays great on everything but a kindle. The kindle drops tons of characters in words.
I did convert it into AZW format that seems to work okay, not great, the pdf had the advantage of being able to "zoom in" while with the AZW if one increases the font size the pagination is horrid. |
06-22-2015, 05:41 PM | #4 |
Wizard
Posts: 2,251
Karma: 3720310
Join Date: Jan 2009
Location: USA
Device: Kindle, iPad (not used much for reading)
|
There is no such thing as pagination in an ebook. It is one continuous stream of text.
|
06-22-2015, 05:54 PM | #5 |
Omnivorous
Posts: 3,281
Karma: 27978909
Join Date: Feb 2008
Location: Rural NW Oregon
Device: Kindle Voyage, Kindle Fire HD, Kindle 3, KPW1
|
I've never understood the need to read pdf's on 6 inch screens. The font size is so small you **have** to zoom to read them. My old Kindle DX (9.7 inch) was *just* usable.
|
Advert | |
|
06-22-2015, 07:54 PM | #6 | |
Wizard
Posts: 3,168
Karma: 37800000
Join Date: Jan 2010
Location: Walton-on-Thames, Surrey, England, UK
Device: Kindle Keyboard 3G, Kindle Fire 2, NOOK ST, Kindle HDX, Fire 7"
|
Quote:
Some PDFs (being as how a PDF is up to three types rolled into one container) need a better PDF client than Amazon provide. On a Fire you can try different Apps, on an eink you cannot. |
|
06-23-2015, 09:37 AM | #7 |
eBook Enthusiast
Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
I am a little confused about what it is you're actually attempting to read on the Voyage. Is it a PDF file, or it is a file that you've converted from PDF to Mobi format using an OCR program?
|
06-23-2015, 10:55 AM | #8 | |
Junior Member
Posts: 9
Karma: 10
Join Date: Apr 2015
Device: ipad
|
Quote:
I then run them through an OCR program that also produces a pdf file. The kindle is only able to display the non-OCRd pdf file properly. When the OCR file is read, tons of characters in words are dropped. If I convert the OCR version to AZW, the kindle seems to display that properly, however we all understand how poorly a pdf file converted looks. |
|
06-23-2015, 10:58 AM | #9 | |
eBook Enthusiast
Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
Quote:
Abbyy FineReader is an excellent OCR program, and you can buy older versions of it at pretty moderate prices. |
|
06-23-2015, 12:11 PM | #10 |
Guru
Posts: 886
Karma: 10113994
Join Date: Feb 2010
Location: Serbia
Device: Kindle PW5 [bricked], Kindle PW1
|
It's probably Kindle software fault. Seen it on Kindle PW1 with activated hidden "text only" mode for PDFs. Can't remember if the problem with garbled text (random spaces breaking up words) was present only for "searchable PDFs" generated directly via Abbyy FineReader, or if it was also the case for FineReader>edits in Word> export as "searchable" PDF also. I do remember that the same "problematic PDF" worked as expected in KOreader while viewed in text reflow mode.
|
06-23-2015, 12:22 PM | #11 |
Junior Member
Posts: 9
Karma: 10
Join Date: Apr 2015
Device: ipad
|
What is KOreader?
|
06-23-2015, 01:45 PM | #12 |
Guru
Posts: 886
Karma: 10113994
Join Date: Feb 2010
Location: Serbia
Device: Kindle PW5 [bricked], Kindle PW1
|
KOReader is an reader app that you can run on top of Kindle's UI after you jailbreak your Kindle . It has much better PDF support, and also supports reading epub. I use it only for reading PDF books about computer programming...
Kindle Voyage (or Kindle PW2 runing fw 5.6.x) can't be jailbreaked without opening it up, so no KOreader for you. I have Kindle PW1. |
06-23-2015, 04:00 PM | #13 |
Junior Member
Posts: 9
Karma: 10
Join Date: Apr 2015
Device: ipad
|
rats, yet another indication I chose poorly with this device.
|
06-23-2015, 04:04 PM | #14 |
eBook Enthusiast
Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
|
06-23-2015, 04:36 PM | #15 |
Junior Member
Posts: 9
Karma: 10
Join Date: Apr 2015
Device: ipad
|
I am using AABBY.
|
Tags |
kindle, ocr, pdf, voyage |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Converting PDF w/ "Calibre" Problem? | federalbetrayal | Calibre | 4 | 09-28-2010 06:41 PM |
Problem "saving to disk" pdf files | lucone | Calibre | 1 | 06-28-2010 05:29 AM |
New problem with "The Stand" pdf file | crawdad | Sony Reader | 2 | 02-08-2010 11:37 AM |