![]() |
#1636 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
![]() |
![]() |
![]() |
#1637 |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
I have posted some beta Windows builds [link removed since release of v2.51]. I'd appreciate if somebody could try them on a relatively modern PC with Tesseract OCR and let me know (1) if the OCR works and (2) what the header says (SSE? AVX?).
Last edited by willus; 01-04-2019 at 11:40 PM. Reason: Removed link |
![]() |
![]() |
Advert | |
|
![]() |
#1638 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 42208
Join Date: Feb 2018
Device: android phone
|
Hello!
I did not expect new release, otherwise, I would inform you about another issue with source code. Problem happens sometime, when here is only one element recognized on text line (happens for titles and right aligned epigraphs) then 'wrmap' not aligned properly. As result text detected correctly, and page formed normally, but when I ask for back coordinates (original coordinates on source image) I got wrong results. It happens because 'wrmap' malformed during parsing. Check this fix: * https://gitlab.com/axet/android-k2pd...33f1ae7ec17540 |
![]() |
![]() |
![]() |
#1639 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 42208
Join Date: Feb 2018
Device: android phone
|
It would be even better if you add Android logging support:
* https://gitlab.com/axet/android-k2pd...82164e7d85f09b |
![]() |
![]() |
![]() |
#1640 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
![]() |
![]() |
Advert | |
|
![]() |
#1641 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
![]() |
![]() |
![]() |
#1642 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 42208
Join Date: Feb 2018
Device: android phone
|
You can choice any pdf with centered or right aligment text. The one I'm using is DVJU (hope russian and djvu it is not an issue) https://drive.google.com/open?id=1rH...2pJAFuKN-y9men
Page 2 has title text and ISBN number center and right aligned. When you parse this page, and request coordinates for title (Бэтман Аполло) and isbn (ISBN 978-5-699-63446 -0) text lines it will return left aligned coodinates. If user click on screen using those coordinates it will not select text on source image properly because coordinates are incorrect. Convert settins default, and screen size = android screen size, not important, any small screen will suffice 1080x1920 |
![]() |
![]() |
![]() |
#1643 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 42208
Join Date: Feb 2018
Device: android phone
|
I can give you an visual example of image regions, you can see, center and right aligned regions misspositioned, with fix everying seems normal.
|
![]() |
![]() |
![]() |
#1644 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6
Karma: 42208
Join Date: Feb 2018
Device: android phone
|
images attached
|
![]() |
![]() |
![]() |
#1645 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
![]() |
![]() |
![]() |
#1646 | |
Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 27
Karma: 122330
Join Date: Sep 2017
Device: ipad , Kindle PW3
|
Quote:
it is working now .. I check the ocr in all editions(64 -32-32g) regards Mustafa |
|
![]() |
![]() |
![]() |
#1647 |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
|
![]() |
![]() |
![]() |
#1648 |
Junior Member
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9
Karma: 84406
Join Date: Jan 2019
Device: Kindle 5 (2012)
|
What am I doing wrong?
![]() Pdf Link: https://drive.google.com/file/d/1wuR...ew?usp=sharing Any help is appreciated |
![]() |
![]() |
![]() |
#1649 | |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
-gtr 0.1 In the "Additional Options" box. This will encourage k2pdopt to break lines apart more readily. You may have to adjust the number larger or smaller. The default is 0.006. Larger gives more encouragement to break apart the lines. Last edited by willus; 01-03-2019 at 08:44 AM. |
|
![]() |
![]() |
![]() |
#1650 |
Fuzzball, the purple cat
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,303
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
k2pdfopt v2.51 released
K2pdfopt v2.51 is released. This fixes an issue in v2.50 where the Tesseract OCR would not run on modern PCs and enhances the accuracy of the Tesseract v4.0.0 OCR. See details at the web site.
|
![]() |
![]() |
![]() |
Tags |
ebook apps, k5 tools, kindle tools, kindle touch, tools |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Viewing PDFs with another font | Font | PocketBook | 4 | 11-12-2010 08:27 AM |
Viewing Textbook PDFs... | NJReader | enTourage Archive | 4 | 08-17-2010 05:17 PM |
PRS-600 Restart bug while viewing PDFs? | conundrum | Sony Reader | 2 | 03-04-2010 08:46 PM |
More on viewing pdfs | dso371 | Bookeen | 8 | 03-11-2008 07:15 PM |
Viewing Untagged PDFs on Palm T|X | Eroica | Reading and Management | 3 | 12-10-2007 01:44 PM |