![]() |
#1 |
Enthusiast
![]() Posts: 25
Karma: 14
Join Date: Nov 2007
Device: Sony PRS-505
|
Which program for double column PDFs?
'Lo everyone, just poppin' in again to try to get some help with a book conversion. Or two or three, as the case may be.
![]() So, you brilliant denizens of the Sony board, you: Is there a program that will properly convert two column PDF files to a readable format? Assuming so, what might it be, and what are the proper settings for this sexy beast? Thank you kindly in advance. |
![]() |
![]() |
![]() |
#2 | |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,462
Karma: 10484861
Join Date: May 2006
Device: PocketBook 360, before it was Sony Reader, cassiopeia A-20
|
Quote:
When I needed to extract text from such pdf file I used following procedure. 0. Install Ghostscript (it is needed by GhostView) 1. install gsview. Instalation files for Ghostscript and gsview are at http://pages.cs.wisc.edu/~ghost/gsview/ 2. open pdf in GhostView and convert it to bitmap 3. Run the resulting bunch of bitmaps through an OCR program. I have received decent OCR program as a bundle with HP multifunction printer/scanner/fax that my company purchased I now. It is complicated. I just needed the text for work related purposes and I did not like the prospect of typing it, or copying and pasting small chunks of text from Acrobat Reader and those were tools I had at hand in a hurry ;-) |
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Enthusiast
![]() Posts: 25
Karma: 14
Join Date: Nov 2007
Device: Sony PRS-505
|
Wow.
...huh. Complicated is right. I hope there's a better way, because if there's not, I'm thinking I may just have to re-evaluate how vital it is that I read these books.
Thanks muchly for the reply, but of course if anyone knows of an easier way, that'd be great. |
![]() |
![]() |
![]() |
#4 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 834
Karma: 102419
Join Date: Sep 2007
Location: Vienna, Austria
Device: iPhone
|
copy paste, column for column, page for page.
That's what I did when I got a PDF "proof" of a book a friend of mine wrote, and I wanted to convert it to mobipocket. Boring, dull work. |
![]() |
![]() |
![]() |
#5 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,462
Karma: 10484861
Join Date: May 2006
Device: PocketBook 360, before it was Sony Reader, cassiopeia A-20
|
Only the first book is complicated.
Once you have the tools installed, and you know what bitmap export the pdf to, then it is about 3 minutes of your work per book. The conversion, of course might need more than 5 minutes of *computer* time, but you can do something else in the meanwhile. Like check mobileread.com to see what is new, or, even better, read some book ;-) EDIT: this also works for tables and other text with much more complicated layout than just standard multi column prose. I have once extracted a complicated, man pages spanning table from the pdf this way. Last edited by kacir; 02-21-2008 at 05:58 AM. |
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Connoisseur
![]() ![]() ![]() Posts: 80
Karma: 204
Join Date: Jun 2007
Device: Sony Librie, Irex DR1000S
|
Fortunately it is very easy with the right tool (it considers the pdf as images, so later there is no possibility to enlarge the font)
Use the program pdflrf with the double column setting (from the GUI or from the comman line). The program is available at https://www.mobileread.com/forums/showthread.php?t=13135 |
![]() |
![]() |
![]() |
#7 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,207
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
I'm working on greatly improving the PDF conversion in libprs500. It should handle multicolumn text, when it's done.
|
![]() |
![]() |
![]() |
#8 |
Enthusiast
![]() Posts: 25
Karma: 14
Join Date: Nov 2007
Device: Sony PRS-505
|
Godel,
Thank you! I'll give that a try, and see how things turn out. I'm sure it'll look great. If not I'll wait for Kovid to update libPRS500, and just. . .read something else in the meantime. Thanks to you both and I appreciate the help very much. *is looking forward to Kovid's update* ![]() |
![]() |
![]() |
![]() |
#9 |
Connoisseur
![]() Posts: 59
Karma: 97
Join Date: Oct 2007
Location: New Jersey
Device: Sony PRS-500
|
Hello ... I have been able to convert two column pages (like the bible) with PDFLRF. It has a option called '2 column' and it works perfect.
Give it a try. |
![]() |
![]() |
![]() |
#10 | |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 182
Karma: 1078201
Join Date: Sep 2007
Device: iPad Air 2
|
Quote:
http://www.valueinvestorinsight.com/...-Issue_105.pdf Probably asking a lot trying to convert that file, but there's no harm in asking ![]() Regardless, I'm greatly looking forward to the update. ![]() |
|
![]() |
![]() |
![]() |
#11 |
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,207
Karma: 27110894
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
Tables are no can do. And no that PDF is way too complex.
|
![]() |
![]() |
![]() |
#12 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,220
Karma: 145488788
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Your best bet at the moment is to give pdflrf a try and see how it works.
|
![]() |
![]() |
![]() |
#13 |
Enthusiast
![]() Posts: 31
Karma: 10
Join Date: Apr 2007
Device: EBW-1150
|
I use mobipocket creator to make html from pdf. I have found that it makes some of the best conversion possible. Maybe you should give it a try.
|
![]() |
![]() |
![]() |
#14 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,258
Karma: 3439432
Join Date: Feb 2008
Device: Amazon Kindle Paperwhite (300ppi), Samsung Galaxy Book 12
|
There are specialty-tools which will convert a .pdf to .rtf which can then be loaded into Word and cleaned up --- SolidPDF is one w/ which I've had good success.
William |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
2 column PDF book to 1 column possible? | SeaBookGuy | Calibre | 19 | 07-01-2013 02:30 AM |
Q: multi-column PDF to single column mobi format converstion | auburn1975 | Calibre | 7 | 01-28-2012 06:11 PM |
Double Column Conversion Tool | mazzeltjes | Calibre | 0 | 12-10-2009 04:22 PM |
KDX: Unable to search PDFs from main screen... PDFs not indexed? | unrequited | Amazon Kindle | 3 | 06-22-2009 07:59 PM |
pdf to lrf with 2 column and 1 column pages in same file | danielwille | Sony Reader | 3 | 11-12-2008 10:57 AM |