05-05-2019, 03:01 PM | #1 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
Tools to test pdf compatibility and speed?
I variously use k2opt with -mode copy and -dev dx and Ghostscript with -dCompatibilityLevel=1.4 to pre-process pdfs for my Kindle Dx.
I sometimes use k2opt with -mode copy and sometimes -dev dx and then OCRmypdf with -f --output-type pdfa-1 to find readable text on pdfs without it or with corrupt text. I occasionally use PDF to EPUB+, but it can screw up formatting, and it is a hassle if I need to cite page numbers. I used to use Ghostscript with more aggressive options, but it would lose pages. These tools *should* cover compatibility with the Kindle Dx, but Ghostscript sometimes yields very fast pdfs... and sometimes very slow ones that take several minutes to turn a page... forcing me to turn to k2 after all. Is there any easy way to check for compatibility issues, missing or corrupt text, malformed pages, and/or speed? |
05-05-2019, 06:00 PM | #2 | |
Grand Sorcerer
Posts: 5,584
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
Code:
cpdf in.pdf -o out.pdf Code:
cpdf -l in.pdf -o out.pdf |
|
05-05-2019, 09:07 PM | #3 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
Thank you, but without a testing tool, I'd have to run every file through yet another app.
By default cpdf is a command-line tool, and depending how it refers to cpdflin, it may not work in Apple's Automator. I know ocrmypdf does not work in Apple's Automator, because of how it refers to tesseract. Even with drag-and drop, it would be an issue for my repetitive stress injuries. Without drag-and-drop, it isn't suitable for more than a few files a week. And the price is well out of my range. |
05-08-2019, 09:00 PM | #4 | |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
05-09-2019, 07:19 PM | #5 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
No, these are ones I'd bought from Bundles of Holding.
I already converted jpx to Kindle-readable formats and removed passwords, etc. It may be an issue with technically-readable but oversized images. Unfortunately trying to compress or remove images occasionally leads to mis-scaled pages, with only the lower left corner showing, missing pages, etc. I use the following /bin/bash shell script in Mac automator, passing input as arguments: for f in "$@" do suffix="-converted.pdf" base=`basename "$f" .pdf` outputfile=$base$suffix /usr/local/bin/gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -sstdout=%sstderr -dNOPAUSE -dQUIET -dBATCH -sOutputFile="$outputfile" "$f" done I know the syntax is different in the regular terminal. |
05-13-2019, 12:35 AM | #6 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
Apparently I can use -dFastWebView to linearize w/in Ghostscript. I haven't tested the results on the Kindle yet.
|
05-25-2019, 02:01 PM | #7 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
It didn't help. I think it's an issue with excessive images, even when they aren't jpx. But I get erratic results from using GS compression options to deal with the images.
|
06-10-2019, 03:32 PM | #8 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
I got what looks like an interesing fix here:
https://softwarerecs.stackexchange.c...ns-of-same-pdf I can't use it because I can't see the code window in wxdemo, because of the blinking cursor. But maybe some of you can use it. |
06-11-2019, 03:55 AM | #9 | |
Grand Sorcerer
Posts: 5,584
Karma: 22735033
Join Date: Dec 2010
Device: Kindle PW2
|
Quote:
IMHO, the tool would be of limited use anyway, unless most of your pdf files have similar page counts, because all it does is log the time it takes to load a .pdf file. Obviously, it'll take longer to load a 200 page document than a 50 page document. Last edited by Doitsu; 06-12-2019 at 04:02 PM. |
|
06-13-2019, 04:37 PM | #10 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
Yes, I have mupdf. I got stuck at the wxdemo code window. I probably should uninstall both versions of wxpython if I can't use them.
At this point I am thinking of using ghostscript with: -K268435456 so memory and speed limitations show up. I think this is the total memory of the Kindle Dx. (256 mb x 1048576 b/mb). -sPageList=1-10 so pdf length isn't too much of a problem. and something to measure and output the rendering time, or attach it to the file name. |
06-13-2019, 10:32 PM | #11 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
One big difference is that I compress images with k2pdfopt, but due to various bugs, I no longer compress images with ghostscript. If image compression is the biggest difference, then
(1) I can compare the sizes of (compressed) k2pdfopt -ui -mode copy -dev dx -o %_k2opt\ dx with (uncompressed) k2pdfopt -ui -mode copy -o %_k2opt\ copy using the relative sizes as a rough proxy for the relative delays. But I've got some with a 4.0:1 ratio and no trouble, and others with a 2.4:1 ratio and freezes. (2) I can use (uncompressed) k2pdfopt -ui -mode copy -p 1-10 -o %_k2opt\ test using the sizes as a rough proxy for the delays. But I've got some freeze-prone files at the 2.2 MB range and tolerably fast ones at 4.4 MB. |
10-24-2019, 03:41 AM | #12 |
Guru
Posts: 924
Karma: 53902736
Join Date: Jun 2015
Device: multiple
|
I think I finally found a way to pre-check which pdfs are likely to take forever on my Kindle, or other slower devices. [P.S. gtime and gs are gnu-time and ghostscript.]
gtime -p -o input.pdf-timed.txt gs -dNOPAUSE -dQUIET -dBATCH input.pdf In Mac Automator: Application recieves files and folders as input. Run Shell Script. Set to use bash and recieve input as arguments, then: for f in "$@" do suffix="-timed.pdf" base=`basename "$f" .pdf` outputfile=$base$suffix /usr/local/bin/gtime -p -o "$f"-timed.txt /usr/local/bin/gs -dNOPAUSE -dQUIET -dBATCH "$f" done It outputs a txt file listing how long it took for ghostscript to run through the original file. I don't think the suffix, base, and outputfile lines are actually needed. Last edited by MarjaE; 10-24-2019 at 04:15 AM. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PW2 App/tools compatibility thread | Aeris | Kindle Developer's Corner | 104 | 12-05-2013 04:51 PM |
Touch Kobo Touch Speed and Wi-Fi Compatibility | Zimbu | Kobo Reader | 5 | 04-07-2012 10:55 PM |
speed test | toronado | Amazon Kindle | 8 | 09-11-2010 07:44 AM |
Kindle 2 3g speed test | readingforfun | Amazon Kindle | 13 | 08-11-2010 09:55 PM |
Test your reading speed ! | shawn99 | Lounge | 7 | 06-08-2008 04:22 PM |