04-20-2013, 05:43 PM | #406 | |
Member
Posts: 11
Karma: 18200
Join Date: Apr 2013
Device: PRESTIGIO PER3464B
|
Quote:
|
|
04-20-2013, 07:30 PM | #407 | |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
Advert | |
|
04-21-2013, 02:06 AM | #408 |
Connoisseur
Posts: 67
Karma: 2179026
Join Date: Apr 2013
Device: none
|
Looks like a great piece of software, but how is it any better than what say an inbuilt re-flow-er or the send to kindle pdf software amazon use?
|
04-21-2013, 06:31 AM | #409 | |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
04-21-2013, 09:56 AM | #410 | |
Member
Posts: 11
Karma: 18200
Join Date: Apr 2013
Device: PRESTIGIO PER3464B
|
Quote:
|
|
Advert | |
|
04-21-2013, 10:33 AM | #411 | |
Banned
Posts: 488
Karma: 1080260
Join Date: Sep 2012
Device: sony prs t1 kindle dx ipad
|
Quote:
If for example our pdf is ocr-ed image we can use inbuilt reflower in our e-ink reader but for ocr layer only, not original image, so if our scanned pdf contains a lot of formulas, scientific expressions, odd scripts etc. there will regrettably be plenty of ocr errors, but not with k2pdfopt which produces exact copy of sentences, reflowed. Last edited by markom; 04-21-2013 at 11:23 AM. |
|
04-21-2013, 11:31 AM | #412 | |
Connoisseur
Posts: 67
Karma: 2179026
Join Date: Apr 2013
Device: none
|
Quote:
It can reflow an image too, ok, I get this, but it doesn't use ocr so it just chops up the image and reflows it thus. That's something that sounds pretty cool. I think amazon's pdf to kindle does something similar, although it's not always effective, and the ocr might always kick in. My main qualm with similar converters and implementations is for the software to be able to figure out what it can effectively do and what not. So for example if it can ocr a formula correctly, that is without seriously messing it up, that it should go ahead and ocr it, but if it can tell that the end result is a mess, it should just crop/resize/split the image instead and leave anything it can't deal with as image. That's the problem I get with most of my technical books. Formulas and tables get screwed over. |
|
04-21-2013, 01:55 PM | #413 |
Banned
Posts: 488
Karma: 1080260
Join Date: Sep 2012
Device: sony prs t1 kindle dx ipad
|
Abbyy Finereader 11 has training mode, where we can teach program how to interpret special characters.
http://www.youtube.com/watch?v=LAJm3J36tLQ I don't know though, if there is possibility to force Abbyy to automatically show formulas as images or at least whole line (if there is some mathematical expresssion inside), instead of doing it manually page by page. I still use Abbyy or Acrobat for detailed or quick ocr-ing of k2pdfopt files, even though there is ocr capability within app now. Last edited by markom; 04-21-2013 at 02:22 PM. |
05-04-2013, 02:49 AM | #414 |
Junior Member
Posts: 5
Karma: 5714
Join Date: Oct 2012
Device: Kindle 4 black
|
Hi willus
Whenever there is some text in the middle of the page between 2 columns, the output is wrong. Please find attached the examples. Is there any way to rectify it? [This Creative Commons licensed magazine is allowed to be attached to a message on MobileRead.] Last edited by pdurrant; 05-06-2013 at 03:42 PM. |
05-04-2013, 10:29 AM | #415 | |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
I tried some different options and this is the best I could come up with. It's not perfect, but it did keep the basic flow in the right order. k2pdfopt -cgr 0.6 -ch 1 -crgh 0.01 magpi.pdf -cgr 0.6 gets k2pdfopt to look over a wider range than normal for column splits--the middle text moves the column splits outside of its normal range (0.33). A value of 1.0 scans the full page width for column splits. -ch 1 reduces the minimum region height for column breaks from 1.5 inches to 1 inch since the middle text is less than 1.5 inches high. -crgh 0.01 allows a slightly smaller-than-default (default = 1/72 inch) gap to separate regions of differing column widths since the vertical gap between columns of different widths is small in this case. You might also try -col 4, but that may result in "over-columnizing" on other pages. I prefer to avoid it unless the bulk of the document has 3 or more columns. See also this post with a similar issue. Last edited by willus; 05-04-2013 at 10:32 AM. |
|
05-05-2013, 04:33 AM | #416 |
Junior Member
Posts: 5
Karma: 5714
Join Date: Oct 2012
Device: Kindle 4 black
|
Yes, raspberry pi is a wonderful, small, low-cost linux tinkering device
I converted the magazine with your command line options and now it looks much better. The middle text is no more wreaking havoc with the conversion. Many thanks willus. |
05-05-2013, 09:16 AM | #417 |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Glad to help. Hopefully at some point I can make k2pdfopt smart enough to find things like that automatically (without having to resort to precise command-line adjustments).
|
05-06-2013, 04:26 AM | #418 |
Junior Member
Posts: 1
Karma: 10
Join Date: May 2013
Device: prst 2
|
hi willus, thank you for you efforts.
i'm resizing pdf articles which includes tables with numerics, but some tables are oversized, some are not. i couldn't find a solution with various options..i attached th pdfs.. what do you think? thanks [Copyright PDFs deleted. Please do not post copyright material to MobileRead without the explicit permission of the copyright holder.] Last edited by pdurrant; 05-06-2013 at 03:35 PM. |
05-06-2013, 10:13 AM | #419 |
Member
Posts: 20
Karma: 30000
Join Date: Jan 2013
Device: kindle touch 5.3.2
|
thank. very cool
|
05-06-2013, 09:15 PM | #420 | |
Fuzzball, the purple cat
Posts: 1,273
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
Tags |
ebook apps, k5 tools, kindle tools, kindle touch, tools |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Viewing PDFs with another font | Font | PocketBook | 4 | 11-12-2010 08:27 AM |
Viewing Textbook PDFs... | NJReader | enTourage Archive | 4 | 08-17-2010 05:17 PM |
PRS-600 Restart bug while viewing PDFs? | conundrum | Sony Reader | 2 | 03-04-2010 08:46 PM |
More on viewing pdfs | dso371 | Bookeen | 8 | 03-11-2008 07:15 PM |
Viewing Untagged PDFs on Palm T|X | Eroica | Reading and Management | 3 | 12-10-2007 01:44 PM |