10-26-2015, 08:55 AM | #1201 |
Hmm.
Posts: 124
Karma: 2016606
Join Date: Oct 2015
Device: Android 4.2 Google Play Reader
|
Google books often has PDF files which are just a set of images of scans from an old book. Does this software convert those scanned images (inside the PDF) to text or EPUB? Calibre does this but only with 98% accuracy and Calibre doesn't support ligatures (like "if" next to each other which then becomes one electronic character). So if I have 1,000,000 words total in the book, then I have to find and correct 20,000 words that didn't get identified correctly. And that usually means going back to the PDF to read the actual text and type it in.
|
10-26-2015, 11:09 PM | #1202 | |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
PS. Are you sure calibre is doing the OCR and the OCR layer isn't already in the scanned file? As far as I can tell, calibre does not have integrated OCR capability unless you are using it with a third-party tool. If the OCR is in the scanned file, it's probably done with Tesseract already, since Tesseract is supported by Google. Last edited by willus; 10-27-2015 at 08:52 AM. |
|
10-28-2015, 06:33 PM | #1203 |
I need a chapter break
Posts: 4,042
Karma: 56058267
Join Date: Mar 2015
Location: Israel
Device: Kobo Glo
|
I want to say thanks for this excellent tool, now i can read books as pdf in my Kobo in much more pleasant way.
The converting of pdf in Hebrew RtL is wonderful, before i convert the pdf to epub and the result is not satisfying, and the process is exhausting pdf>word>pdf>word>htm>epub (the twice pdf>word is because the text is showing inverted at first). My request is to add an option, to add cover as image file. part of my pdf files don't have a cover, so i convert the image file to pdf with software and merge image-pdf to the book with Simpo PDF Merge. Last edited by oren64; 10-29-2015 at 02:29 PM. Reason: with, image |
10-28-2015, 10:20 PM | #1204 |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Thank you. That's a good idea. I'll add it to my feature request list.
|
11-01-2015, 05:04 AM | #1205 |
Junior Member
Posts: 3
Karma: 10
Join Date: Oct 2015
Device: kindle paperwhite3
|
crash on win10,older cpu version can not download
|
11-01-2015, 05:30 AM | #1206 |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Can you possibly post or PM me the source PDF file and the options you used to convert it? if your PC was made after 2008, the old CPU version probably won't help. 32 or 64 bit version?
Edit: I just verified the older-CPU version. It downloaded on the first try and runs correctly. Last edited by willus; 11-01-2015 at 09:35 AM. |
11-01-2015, 10:56 AM | #1207 | |
Ex-Helpdesk Junkie
Posts: 19,421
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Quote:
|
|
11-12-2015, 02:54 PM | #1208 |
Connoisseur
Posts: 57
Karma: 98196
Join Date: Mar 2015
Location: Israel
Device: Kobo Aura H20
|
I'm using the cbox option quite a bit on complex PDFs. I was wondering why k2pdfopt will ignore any non-cbox'd pages. Is there a reason for this design? Right now I have to use -cbox 0,0 for any other page or page range. Wouldn't it be easier if k2pdfopt will assume that all pages without an explicit cbox should be treated with an implicit cbox 0,0? (This applies to ibox as well).
Last edited by isaacbh; 11-12-2015 at 03:00 PM. |
11-12-2015, 10:28 PM | #1209 | |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
11-12-2015, 11:57 PM | #1210 |
Connoisseur
Posts: 57
Karma: 98196
Join Date: Mar 2015
Location: Israel
Device: Kobo Aura H20
|
Oh right, defaulting ibox to 0,0 is kinda stupid But thanks for considering it for cbox
|
11-14-2015, 12:26 PM | #1211 |
Junior Member
Posts: 1
Karma: 10
Join Date: Nov 2015
Device: Kindle Paperwhite 3 & Kindle Touch
|
Hi. Crop function works wonder. However, I have some problem with re-flow text. I converted a PDF using re-flow (-dev kv -wrap+) and when I read it on Paperwhite 3, I got this. Wonder whether it's on my end or not.
|
11-14-2015, 08:01 PM | #1212 |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
That's difficult to diagnose without more information. When you view the PDF on a PC reader, is it also cropped like that? Are you using any other options? Does the conversion look right in the preview window (if you're using Windows)? Can you post (or PM me) the source and converted PDF files?
|
12-16-2015, 08:59 AM | #1213 |
Junior Member
Posts: 6
Karma: 15180
Join Date: Dec 2015
Device: Kindle Voyage
|
can native pdf output alter text size?
I have a large document with tiny text that is tough to read and was hoping that I could make the text larger with k2pdfopt. Can I change the text size of the pdf when using native PDF output? The size of the text in the preview doesn't seem to alter whether I set the DPI to 100 or the 300 my Paperwhite 3 runs at.
|
12-16-2015, 05:03 PM | #1214 | |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
Can you post the source PDF or a couple of pages from it, or PM it to me? Also can you send a screen shot of the options you are using? |
|
12-21-2015, 02:52 PM | #1215 |
Junior Member
Posts: 6
Karma: 15180
Join Date: Dec 2015
Device: Kindle Voyage
|
source PDF and screen shot of my k2pdfopt options
The options I'm using are in the attached screencap. I was hoping to make this PDF easier to read on my Kindle Paperwhite 3 without making it much bigger as it looks like it was natively created.
Also, is there an option to add margins in the GUI? |
Tags |
ebook apps, k5 tools, kindle tools, kindle touch, tools |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Viewing PDFs with another font | Font | PocketBook | 4 | 11-12-2010 08:27 AM |
Viewing Textbook PDFs... | NJReader | enTourage Archive | 4 | 08-17-2010 05:17 PM |
PRS-600 Restart bug while viewing PDFs? | conundrum | Sony Reader | 2 | 03-04-2010 08:46 PM |
More on viewing pdfs | dso371 | Bookeen | 8 | 03-11-2008 07:15 PM |
Viewing Untagged PDFs on Palm T|X | Eroica | Reading and Management | 3 | 12-10-2007 01:44 PM |