01-22-2009, 02:28 PM | #1 |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2008
Device: Sony PRS-505
|
Conversion from PDF
Hi,
I have a collection of huge pdfs that I would like to be able to read on my PRS505. The book is this one: http://www.archive.org/download/camb...05actouoft.pdf It's a huge book, with a broken OCR (if you just copy/paste the words, the result is full of errors). Is there a way to convert it and make it readable? On the other side, I've a little collection of Osprey books that I have scanned (I'm an history professor and use to read them on my laptop). Is there a way to keep their images and look for the 505? Thank you Last edited by FranzG; 01-22-2009 at 02:35 PM. |
01-22-2009, 02:54 PM | #2 |
Fool
Posts: 377
Karma: 3557934
Join Date: Feb 2003
Device: Kindle Voyage, Kindle PW1, Kobo Glo HD, Nook Glowlight Plus ...
|
I had trouble converting large pdfs with Kindle.
There are probably lots of tools that work but the one that seems to do best for me is Mobipocket Reader for the PC. This will convert PDFs to Mobipocket files (of course). From there the Kindle conversion tools will probably do a pretty good job. You may have some trouble finding out where Mobipocket reader puts its converted files--you need to look at the advanced settings to see what directory it is using. |
Advert | |
|
01-22-2009, 08:53 PM | #3 |
Addict
Posts: 271
Karma: 332
Join Date: Nov 2003
Location: San Francisco, USA
Device: Sage, Elipsa, Oasis, Galaxy Tab 8U, S22U
|
I also recommend Mobipocket Creator for extracting from pdf. Before creating a Mobipocket file, it saves the output in html file, you can use it to convert to any other format.
|
01-23-2009, 01:46 PM | #4 |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2008
Device: Sony PRS-505
|
Thank you, but i need a more "extended" work to read the Osprey.
In short, I have to "cut" every page in 2 half (up and down), then turn'em of 90° to read them on the 505... otherwise the font it's too small, and I cannot zoom. Cna I do that with calibre? |
01-23-2009, 04:56 PM | #5 |
Guru
Posts: 860
Karma: 4380
Join Date: Feb 2008
Location: Almada, Portugal
Device: Cybook Gen3, Sony PRS 505, Kindle DXG and Samsung Galaxy Note
|
Hi
You are right; your example book is quite huge! I have taken the liberty to extract the first 50 pages and uploaded them here. See if it’s what you are looking for. |
Advert | |
|
01-24-2009, 09:38 AM | #6 |
Junior Member
Posts: 8
Karma: 10
Join Date: Sep 2008
Device: Sony PRS-505
|
Thanks DDHarriman, that's great! How do you made it?
|
01-24-2009, 01:31 PM | #7 |
Guru
Posts: 860
Karma: 4380
Join Date: Feb 2008
Location: Almada, Portugal
Device: Cybook Gen3, Sony PRS 505, Kindle DXG and Samsung Galaxy Note
|
Ha… ha...
The book you refer to is a PDF with two layers, image on top and text under. The text is resulting from a OCR without proof reading. This type of result is very normal in the business (archiving) environment (workflow systems). A two layer PDF will not be readable (besides trying to turn it into landscape) in the Sony. To get the results you see here, you have to use a good OCR program: in the market, there are Omnipage Pro 16 and Finereader Pro 9. From these two, just Finereader Pro 9 gives a PDF output that gives reflowing in the 505. That’s the one I used in your example. Best regards, Last edited by DDHarriman; 01-24-2009 at 01:38 PM. |
01-24-2009, 06:59 PM | #8 |
Wizard
Posts: 2,608
Karma: 3000161
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
|
Hi, DD Harriman
I have many two-column PDF books. I tried using papercrop software. The end result is a very readable, image-quality file. BUT it's huge. If the original pdf file is about one meg, the output papercrop file will be 18 megs. But you gave me one idea. I could use OCR to transcribe these images to text again. Could you confirm me this is possible without any scanner, only with the OCR software or I misunderstood something ? Thanks for your help. Last edited by roger64; 01-24-2009 at 07:00 PM. Reason: typo |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
pdf conversion | terraskye | Calibre | 0 | 10-07-2010 09:46 PM |
Conversion de pdf ? | Cressence | Assistance | 7 | 02-11-2010 07:34 AM |
PDF conversion help | ardeegee | Other formats | 5 | 01-13-2010 02:47 PM |
PDF Conversion | wamblej | Calibre | 7 | 10-16-2009 08:13 AM |
PDF Conversion Help | Exinferis | Reading and Management | 2 | 06-15-2009 09:11 AM |