Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > PDF

Notices

Reply
 
Thread Tools Search this Thread
Old 05-03-2025, 10:24 AM   #2116
nmyshkin
Connoisseur
nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.nmyshkin actually enjoys Vogon poetry.
 
nmyshkin's Avatar
 
Posts: 59
Karma: 56675
Join Date: Nov 2021
Device: Nook Simple Touch (3), Nook Simple Touch w/Glowlight (3)
Quote:
Originally Posted by willus View Post
This CPU came out in 2013. It has a Geekbench 6 score of 170 (single threaded). Kudos to you for getting that much mileage out of your computer but wow that is a low score. Modern CPUs are 20x faster single threaded and 100x faster multithreaded. You might really enjoy a newer system.
It will have to happen someday, but the thought of decrapifying yet another bloatware multimedia extravaganza until it's actually a functional computer again is not that appealing at my age. Speed would be nice when working on a dictionary database or re-encoding a video file, I admit. But speed is not everything.
nmyshkin is offline   Reply With Quote
Old 05-03-2025, 01:05 PM   #2117
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Amen to the decrapifying, though I don't think it will be quite as bad as you might think. Just uninstall, uninstall, uninstall.
willus is offline   Reply With Quote
Old 05-16-2025, 07:34 PM   #2118
dhdurgee
Guru
dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.
 
Posts: 881
Karma: 2580688
Join Date: Jun 2010
Device: K3W, PW4
I have used k2pdfopt to convert a PDF file for use on my PW4. I used the -ocr option with it as the original PDF did not include any text for searching.

I am now wondering if it would be possible to extract the ocr text from the k2pdfopt output file and use it as the starting point to create a text version of the file to create an azw3 file.

Is this possible with k2pdfopt or another pdf tool?

Dave
dhdurgee is offline   Reply With Quote
Old 05-16-2025, 08:07 PM   #2119
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by dhdurgee View Post
I am now wondering if it would be possible to extract the ocr text from the k2pdfopt output file and use it as the starting point to create a text version of the file to create an azw3 file.

Is this possible with k2pdfopt or another pdf tool?
You can extract the OCR text in UTF-8 format using the -ocrout <file> option. See the command-line usage. You might take a look at my PDF conversion tips page, though it's a bit stale.
willus is offline   Reply With Quote
Old 05-16-2025, 11:47 PM   #2120
dhdurgee
Guru
dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.
 
Posts: 881
Karma: 2580688
Join Date: Jun 2010
Device: K3W, PW4
Quote:
Originally Posted by willus View Post
You can extract the OCR text in UTF-8 format using the -ocrout <file> option. See the command-line usage. You might take a look at my PDF conversion tips page, though it's a bit stale.
I added the -ocrout option and it worked as expected.

Thank you for your assistance.

Dave

Last edited by dhdurgee; 05-16-2025 at 11:50 PM.
dhdurgee is offline   Reply With Quote
Old 05-19-2025, 02:40 PM   #2121
stillstill
Junior Member
stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.
 
Posts: 6
Karma: 105302
Join Date: Aug 2021
Device: Kindle PW2
Font size and alignment changes before horizontal line

Hello.

I have faced this weird issue first time. If there is a thin horizontal line (it is located just before the footnotes begin, I cannot eliminate it since its position varies on pages) the text that is above that line is rendered into so microscopic font, that is comparable with the line width... Is there a way to remediate from this?... Moreover text alignment switches from justified to left for the entire chapter where there at the end that horizontal line is.

Any idea highly appreciated.

Attaching screenshots and pdf to illustrate this.
Attached Thumbnails
Click image for larger version

Name:	source.png
Views:	21
Size:	859.8 KB
ID:	215761   Click image for larger version

Name:	output.png
Views:	26
Size:	711.9 KB
ID:	215762  
Attached Files
File Type: pdf pdf_source.pdf (487.1 KB, 20 views)
File Type: pdf pdf_output.pdf (855.0 KB, 17 views)
stillstill is offline   Reply With Quote
Old 05-20-2025, 02:16 PM   #2122
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by stillstill View Post
If there is a thin horizontal line (it is located just before the footnotes begin, I cannot eliminate it since its position varies on pages) the text that is above that line is rendered into so microscopic font, that is comparable with the line width... Is there a way to remediate from this?... Moreover text alignment switches from justified to left for the entire chapter where there at the end that horizontal line is.

Any idea highly appreciated.
Thank you for including screen shots and especially for including a source PDF. That makes it much easier to suggest things. There are a couple options.

1. Erase the line using the -ehl option: k2pdfopt -ehl 1 ... source.pdf

2. Set a minimum text row height using the -rhmin option: k2pdfopt -rhmin 1 ... source.pdf

Either one effectively removes the horizontal line from the output. BTW I used -ml 0.2 -mr 0.2 to get rid of the < > marks on the source pages, as you probably did also.
willus is offline   Reply With Quote
Old 05-21-2025, 07:10 AM   #2123
stillstill
Junior Member
stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.stillstill shapes the world with his or her thoughts.
 
Posts: 6
Karma: 105302
Join Date: Aug 2021
Device: Kindle PW2
Thank you for both methods. Both of them perfectly eliminated horizontal line and likely in this way helped to workaround the small font issue (the line presence alone in the output was not an issue). The sudden alignment change still remained. Text alignment changed from "justified" to "left aligned" for the entire source page where there at the end is that horizontal line. Sometimes it affects not the entire source page but the chapter on the page before horizontal line. This remediated effectively by "-j +" which worked flawlessly.

Thank you so much for your really brilliant Application development and such a dedication to the support.



Quote:
Originally Posted by willus View Post
Thank you for including screen shots and especially for including a source PDF. That makes it much easier to suggest things. There are a couple options.

1. Erase the line using the -ehl option: k2pdfopt -ehl 1 ... source.pdf

2. Set a minimum text row height using the -rhmin option: k2pdfopt -rhmin 1 ... source.pdf

Either one effectively removes the horizontal line from the output. BTW I used -ml 0.2 -mr 0.2 to get rid of the < > marks on the source pages, as you probably did also.
stillstill is offline   Reply With Quote
Old 05-23-2025, 02:00 PM   #2124
dhdurgee
Guru
dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.
 
Posts: 881
Karma: 2580688
Join Date: Jun 2010
Device: K3W, PW4
After I got things sorted out with your tool I had yet another thought occur to me. I passed the original scanned image document along with some guidance to the Grok AI engine and asked it to do the OCR for me detecting the images and inserting placeholders for them in the output. This was after trying to get it to do a full conversion to an epub format, which it turned out was beyond its capabilities. It was, however, able to give me the files comprising the epub piecemeal so that I could assemble the epub manually.

I hit some issues with limitations of grok.com and had to do the processing in pieces, but I now have a final epub of that document with images that is about 1/3 the size of your optimized output.

I understand that the Grok API is available to developers and can tell you that its OCR capabilities were great. It might be worth your time to investigate if it could be useful to you in a later release of your tool.

Dave
dhdurgee is offline   Reply With Quote
Old 05-23-2025, 10:12 PM   #2125
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,299
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by dhdurgee View Post
I understand that the Grok API is available to developers and can tell you that its OCR capabilities were great. It might be worth your time to investigate if it could be useful to you in a later release of your tool.
That is interesting information. Thanks. I may look into it at some point, but I'm a bit of a dinosaur when it comes to AI.
willus is offline   Reply With Quote
Old 05-23-2025, 11:21 PM   #2126
dhdurgee
Guru
dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.dhdurgee ought to be getting tired of karma fortunes by now.
 
Posts: 881
Karma: 2580688
Join Date: Jun 2010
Device: K3W, PW4
Quote:
Originally Posted by willus View Post
That is interesting information. Thanks. I may look into it at some point, but I'm a bit of a dinosaur when it comes to AI.
I'm new to it as well, but it is simple enough to go to grok.com and ask about something. I have also had it provide me with recipes meeting certain requirements or with particular ingredients to use up left overs.

Its even simpler to use than a search engine and produces more understandable results in most cases.

Give it a try and see.

Dave
dhdurgee is offline   Reply With Quote
Old 05-24-2025, 09:23 PM   #2127
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 45,184
Karma: 168808723
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
My experience with AI and recipes has not been that great. I was curious as to what it would give me for a couple of recipes. I couldn't bring myself to try a "mild" curry recipe that includes 4 tablespoons of 7 Pod Jonah pepper powder. One of the other recipes it supplied was basically a quote of Guy Fieri's Cheddar Trans-Porter Soup recipe.

I worked in IT in an education setting for years. One of the teachers whom I still chat with is still being surprised by the number of students whose idea of writing a program is to ask an AI to write it for them and then never bother to try using the program. He has a form now that basically says "The AI's first error occurred at line ___ in the source code and since you used AI to write the program, your mark is 0. A second occurrence will result in your final mark for the course being 0".
DNSB is offline   Reply With Quote
Reply

Tags
ebook apps, k5 tools, kindle tools, kindle touch, tools


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Viewing PDFs with another font Font PocketBook 4 11-12-2010 08:27 AM
Viewing Textbook PDFs... NJReader enTourage Archive 4 08-17-2010 05:17 PM
PRS-600 Restart bug while viewing PDFs? conundrum Sony Reader 2 03-04-2010 08:46 PM
More on viewing pdfs dso371 Bookeen 8 03-11-2008 07:15 PM
Viewing Untagged PDFs on Palm T|X Eroica Reading and Management 3 12-10-2007 01:44 PM


All times are GMT -4. The time now is 09:11 PM.


MobileRead.com is a privately owned, operated and funded community.