09-02-2018, 09:30 PM | #61 | |||
Banned
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
|
Quote:
Well, thank you for agreeing with me. You just repeated everything I said using different words. Except for the time consuming part. Quote:
Yea, he pretty much demonstrated that he had completely overthought the problem. There's no need for me to write my own pdf extractor when I can use the tools that already exist. That's why it's called a "script". Quote:
Of course, that's probably the whole reason you saw fit to engage me, since you saw your little buddy getting wrecked. I have declined said challenge and it is for my stated reason. You are welcome to think otherwise. |
|||
09-03-2018, 02:46 AM | #62 |
Interested in the matter
Posts: 421
Karma: 426094
Join Date: Dec 2011
Location: Spain, south coast
Device: Pocketbook InkPad 3
|
Boys, you think it's worth arguing with someone who clearly needs a psychiatrist?
|
Advert | |
|
09-03-2018, 10:30 AM | #63 |
Banned
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
|
|
09-06-2018, 04:46 PM | #64 |
Guru
Posts: 776
Karma: 2751519
Join Date: Jul 2010
Location: UK
Device: PW2, Nexus7
|
@sealbeater:
With the amount of time you have spent posting to this thread you could've written your wonder-script by now. Seriously, if you really can do such a good job in a short space of time it could be the best financial decision that you could make. A couple of days worth of your time could silence all the doubters and you would be well recompensed. If you don't need the money then think of the satisfaction gained. If you don't need this kind of satisfaction then do it for altruistic reasons as a service to the e-reading community. Don't even give up your day job yet, just take a couple of days vacation and write your script. |
09-06-2018, 05:00 PM | #65 |
eBook Enthusiast
Posts: 85,544
Karma: 93383043
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
In my experience, a good OCR tool such as Abbyy FineReader does the best job of producing a decent conversion of a PDF. Generally a lot better than those which attempt to extract text from the PDF itself. Of course no OCR is perfect, and a proofing/editing run through the converted file is essential.
|
Advert | |
|
09-06-2018, 11:35 PM | #66 | ||||
Banned
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
|
Quote:
Quote:
Quote:
As for being well recompensed, when you are already in a position of financial security, one requires interest to motivate ones actions, not money. You are assuming i have a couple of days worth of time that can't be better spent doing something else. I have no need to convert pdfs so, what's my motivation? Proving something to you? Please, doubt me. Quote:
No. I've taken a position and I'm standing firm. I could...but I won't. Let the naysayers bark. Last edited by sealbeater; 09-06-2018 at 11:39 PM. Reason: a word |
||||
09-06-2018, 11:37 PM | #67 |
Banned
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
|
Have you much experience extracting txt from pdfs? I have ocr to not be as good as extracting text. As I already stated, most pdfs come in two flavors, images of txt and the actual txt itself. The actual txt itself is as good as the pdf source is. Going further, extracting to xml yields so far, the best results when it comes to preserving layout but I haven't played much with converting to Postscript..yet.
|
09-06-2018, 11:44 PM | #68 |
Banned
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
|
Sorry, I realized I failed to address this point. First off, the people who could use and would appreciate my script don't need me to write it. Second off, I have no interest in being "altruistic" to the "e-reading community", whatever that is. You guys are like minnows who think you are whales. Better, slaves who think you are free. Do yourself a favor. Remove yourself from the need to concern yourself with e-book formats. You'll find your e-book reading experience to be much smoother.
|
09-07-2018, 04:03 AM | #69 |
Grand Sorcerer
Posts: 7,166
Karma: 63764653
Join Date: Feb 2009
Device: Kobo Glo HD
|
|
09-07-2018, 04:05 AM | #70 |
Wizard
Posts: 3,108
Karma: 60231510
Join Date: Nov 2011
Location: Australia
Device: Kobo Aura H2O, Kindle Oasis, Huwei Ascend Mate 7
|
Everyone knows the theory. Please don't feed!
|
09-07-2018, 06:34 PM | #71 |
Banned
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
|
|
09-07-2018, 06:44 PM | #72 | |
Grand Sorcerer
Posts: 7,166
Karma: 63764653
Join Date: Feb 2009
Device: Kobo Glo HD
|
Quote:
Consistency. |
|
09-07-2018, 07:00 PM | #73 | |
Wizard
Posts: 4,742
Karma: 246906703
Join Date: Dec 2011
Location: USA
Device: Oasis 3, Oasis 2, PW3, PW1, KT
|
Quote:
|
|
09-07-2018, 08:14 PM | #74 | |
Banned
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
|
Quote:
No assumptions being made, pdfs are either one or the other and I don't disagree, you would have to do a 2 stage run on the pdf to get the best automatic result. However, I've never seen a pdf that had both. |
|
09-07-2018, 08:18 PM | #75 | ||
Banned
Posts: 666
Karma: 1752814
Join Date: Jan 2008
Device: Sony Reader PRS-505 : Onyx Boox Max : Sony PRS-900 : Onyx Kepler Pro
|
Quote:
If I say it would take me 20 minutes, I've already stated in the thread that that was hyperbole representing the effort needed. When someone says: Quote:
Reading comprehension. It would serve you well. |
||
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PDF in epub? | Floeee | Software | 3 | 10-20-2009 05:52 PM |
PDFTOEPUB BY DNAML- WARNING | mets | News | 0 | 09-21-2009 01:16 PM |
Google releases 1 million public domain books in ePub format | joedevon | News | 25 | 09-02-2009 05:13 PM |