![]() |
#1216 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
-om 0.25 P.S. You can convert a few pages at a time (e.g. A chapter or two) to keep the converted files to reasonable sizes if that helps. Last edited by willus; 12-21-2015 at 05:26 PM. |
|
![]() |
![]() |
![]() |
#1217 | |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 15180
Join Date: Dec 2015
Device: Kindle Voyage
|
Quote:
When you convert a PDF in smaller sets, do they add up to a total smaller size? As in, I could convert them in smaller packets and combine them at the end? |
|
![]() |
![]() |
![]() |
#1218 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
![]() |
![]() |
![]() |
#1219 | |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 15180
Join Date: Dec 2015
Device: Kindle Voyage
|
Quote:
Why is there such a difference in sizes between the Kindle 1-5 and the Voyage? Shouldn't the file size be smaller when I change the settings, maybe even a quarter of the size like the page on bpc and jpg quality management said? |
|
![]() |
![]() |
![]() |
#1220 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
![]() |
![]() |
![]() |
#1221 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 15180
Join Date: Dec 2015
Device: Kindle Voyage
|
Makes sense. So I'm using the optimization options correctly and I've hit the limit on how much I can shrink this pdf?
|
![]() |
![]() |
![]() |
#1222 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
1. Your use of the -bpc option needs a space between "-bpc" and the value, e.g. "-bpc 1" instead of "-bpc1". 2. The -bpc option is not compatible with the -jpg option. If you use the -jpg option to save the bitmaps as JPEG, the -bpc option is ignored. So you need to pick one or the other. I need to make this more clear in the documentation. A JPEG quality factor of 10 seems pretty low, but as long as you find it tolerable, then that's fine. Last edited by willus; 12-29-2015 at 03:50 PM. |
|
![]() |
![]() |
![]() |
#1223 | |
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 123
Karma: 18554
Join Date: Jan 2008
Location: Spain
Device: Onyx Boox M96+
|
@willus
Is it possible to user other OCR engines? Even commercial ones. I tried to find a repository where k2pdfopt is stored, but it was impossible to find. Do you use some source code versioning control system? It would be ideal to have it in GitHub. Here's an outdated mirror, but obviously all the history is lost. https://github.com/JohannesBuchner/k2pdfopt Are you planning to update your software to latest Tesseract? 3.04.00 released on Jul 11, 2015 https://github.com/tesseract-ocr/tes...901f361ecd7e90 Here's the langdata https://github.com/tesseract-ocr/lan...f3bf238ee8903d Latest MuPDF is 1.8.1-ios from 15 days ago, I'm not sure if you updated it: https://github.com/ArtifexSoftware/m...61f3815a92a375 Latest Leptonica is 1.7.2 Here's a GitHub unofficial mirror: https://github.com/egorpugin/leptonica Version notes: http://leptonica.com/source/version-notes.html Official version isn't hosted in a repository, the code is here: http://leptonica.com/source/leptonica-1.72.tar.gz I wrote this because KOreader project got unable to use newer Tesseract and Leptonica, you need to update it in your project: https://github.com/koreader/koreader-base/issues/361 They seem to use a wrapper around k2pdfopt to make it a library, or something like that: https://github.com/koreader/libk2pdfopt NOTE: I'm not part of KOreader Team and not a developer at all. Just an user of the software. Here's an historical reference why using GitHub would make other projects to deploy k2pdfopt a lot easier, plus potential contributions from other developers. Quote:
Last edited by timofonic; 01-05-2016 at 05:46 PM. |
|
![]() |
![]() |
![]() |
#1224 | |||||
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
Quote:
Quote:
Tesseract v3.04.00 now depends on the ICU library, which has hundreds of source files. I'll have to wade through that at some point and see if it is worth the effort. I'll update to the latest MuPDF on my next k2pdfopt bug-fix release, but I'm not sure when that will be. Quote:
Quote:
![]() Last edited by willus; 01-05-2016 at 10:39 PM. |
|||||
![]() |
![]() |
![]() |
#1225 |
Ex-Helpdesk Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
It has occurred to me before, the idea of hosting k2pdfopt on GitHub, but I was loath to suggest it for precisely that reason -- that if you weren't using it already, it's probably because you aren't familiar with VCS.
Really, uploading release snapshots for each commit doesn't do a lot... the main reason people use VCS is to track the evolution of software line by line and change by change, not simply version by version. Seeing not just how stuff changed, but why it changed (commit messages). Experimenting with changes and rolling them back, mixing and matching (the branching model). Stepping through the commits to see where "x" went wrong (git bisect). Running a `diff` on consecutive releases is not significantly different from committing each release tarball. If the dev isn't familiar enough to do any of that, there is little point, I guess. (And I don't blame you for not having the time to spare.) It doesn't even lower the barrier to entry for contributing. (Easy and familiar to open a pull request on GitHub, but who knows the best way to contribute to a tarball?) That requires the repo be maintained by the dev. |
![]() |
![]() |
![]() |
#1226 |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
I don't know if I'd quite phrase it that way (that I'm not familiar with VCS). I know what VCS is, and I've learned enough about git to install it on my PC and try it out, but my old habits die hard. I am admittedly set in my ways and very comfortable and efficient with my own brand of primitive/manual VCS, which mostly involves zip to archive snapshots of code. I've also never had to work closely with a lot of other people on a large software project, which is where VCS is truly a necessity.
|
![]() |
![]() |
![]() |
#1227 |
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 123
Karma: 18554
Join Date: Jan 2008
Location: Spain
Device: Onyx Boox M96+
|
@willus
No problem! Keep up your great work! Is this a cowboy project? If so, I hope someday you get some devs to help you. This project is growing a lot! What about vectorization support? Maybe this could be good enough in hi-res scanned books, maybe even detecting patterns (letters) and make a custom font library. I failed to find a vectorization library/program at least as good as the old VectorMagic: - potrace lacks color * color trace does some trick I fail to understand it. - Autotrace has color, but seems worse. - imagetracerjs is a javascript library that vectorices raster images. [b]Misc[/url] http://xmailserver.org/davide.html https://en.wikipedia.org/wiki/Image_tracing https://en.wikipedia.org/wiki/Compar...rsion_software http://www.scale-a-vector.de/svg-test4-e.htm#cr2v http://paperjs.org https://github.com/schicks/pdarff Last edited by timofonic; 01-12-2016 at 03:11 AM. |
![]() |
![]() |
![]() |
#1228 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
![]() |
![]() |
![]() |
#1229 |
The Couch Potato
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 34,509
Karma: 230999999
Join Date: Aug 2015
Device: Kobo Glo, Kobo Touch, Archos 9, Onyx Boox C67ML Carta
|
Many thanks willus, for this great tool! Finally I could read that damn three column pdf on my Kobo after conversion.
![]() Keep up your great work! |
![]() |
![]() |
![]() |
#1230 |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 46
Karma: 107690
Join Date: May 2010
Device: Kindle Paperwhite 3, Kobo Aura One
|
Thanks so much for the awesome program, willus!
I'm having one small problem. I like using this with my Nook Glowlight Plus. Same resolution as the Kindle Voyager. But on the nook, it keeps a header and footer at the top and bottom. Which ends up overlapping some of the PDF text. What I want to do is fit to width, but leave space at the top and bottom to accommodate the header and footer. I've figured out I can leave space for the footer by adjusting the height in the device settings, but I can't figure out how to leave space for the header. Is this possible? |
![]() |
![]() |
![]() |
Tags |
ebook apps, k5 tools, kindle tools, kindle touch, tools |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Viewing PDFs with another font | Font | PocketBook | 4 | 11-12-2010 08:27 AM |
Viewing Textbook PDFs... | NJReader | enTourage Archive | 4 | 08-17-2010 05:17 PM |
PRS-600 Restart bug while viewing PDFs? | conundrum | Sony Reader | 2 | 03-04-2010 08:46 PM |
More on viewing pdfs | dso371 | Bookeen | 8 | 03-11-2008 07:15 PM |
Viewing Untagged PDFs on Palm T|X | Eroica | Reading and Management | 3 | 12-10-2007 01:44 PM |