Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > PDF

Notices

Reply
 
Thread Tools Search this Thread
Old 01-08-2020, 06:50 AM   #1726
famfam
Connoisseur
famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.
 
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
Tried k2pdfopt with pdf-document in gothic script (german Frakturschrift). The results are to small for me for reading on my device Kobo Clara HD (1072 x 1448 resolution). So until now I have to use my tablet for reading it. That is why I am looking for a 'How to' for pdf in gothic script (german Frakturschrift). Did anyone test the k2pdfopt with Frakturschrift and got good results for his device?

Last edited by famfam; 01-08-2020 at 06:54 AM.
famfam is offline   Reply With Quote
Old 01-08-2020, 09:52 PM   #1727
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by famfam View Post
Tried k2pdfopt with pdf-document in gothic script (german Frakturschrift). The results are to small for me for reading on my device Kobo Clara HD (1072 x 1448 resolution). So until now I have to use my tablet for reading it. That is why I am looking for a 'How to' for pdf in gothic script (german Frakturschrift). Did anyone test the k2pdfopt with Frakturschrift and got good results for his device?
If you can post the source file, or a couple pages from the source file, I can recommend some settings to you. Or PM it to me.
willus is offline   Reply With Quote
Advert
Old 01-10-2020, 04:07 PM   #1728
famfam
Connoisseur
famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.
 
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
Quote:
Originally Posted by willus View Post
If you can post the source file, or a couple pages from the source file, I can recommend some settings to you. Or PM it to me.
You can find the source files (pdf) on archive.org.

In the meanwhile I had some succes with the following 2 books in frakturschrift(gothic script):
Eduard Bernstein:
1) Die Geschichte der Berliner Arbeiterbewegung Band 2
2) Sozialismus und Demokratie in der Englischen Revolution

I found them in the internet (legal sources, copyright out of date, author dead since 1932)

I can send you my results from k2pdfopt by pm. I would be happy, if you could find a better way with better results. (How to?)
famfam is offline   Reply With Quote
Old 01-11-2020, 06:27 AM   #1729
famfam
Connoisseur
famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.
 
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
Quote:
Originally Posted by willus View Post
If you can post the source file, or a couple pages from the source file, I can recommend some settings to you. Or PM it to me.
I try to append the (cropped and cleaned) source files here.

Sorry, did not work ->

Your submission could not be processed because a security token was missing.

If this occurred unexpectedly, please inform the administrator and describe the action you performed before you received this error.
famfam is offline   Reply With Quote
Old 01-12-2020, 10:50 AM   #1730
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by famfam View Post
I try to append the (cropped and cleaned) source files here.

Sorry, did not work ->

Your submission could not be processed because a security token was missing.

If this occurred unexpectedly, please inform the administrator and describe the action you performed before you received this error.
The files you are trying to send are probably too big. I'm not sure what the limit is. Is there a link you can send for any of the source files.
willus is offline   Reply With Quote
Advert
Old 01-14-2020, 03:13 PM   #1731
famfam
Connoisseur
famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.
 
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
@willus
Here are the links:

1) Die Geschichte der Berliner Arbeiterbewegung Band 2

https://archive.org/download/diegesc...01berngoog.pdf

or (better)

https://archive.org/download/bub_gb_...knAQAAIAAJ.pdf

and

2) Sozialismus und Demokratie in der Englischen Revolution

https://archive.org/download/bub_gb_...NCAAAAYAAJ.pdf

I don't know how to send my results to you as pm. May be that they are too big. But I zipped them in part of 20 mb. But no success. Can I send them to you by email, and could you send me a pm with your email-address please.
And sure, I cleaned the originel files from empty or unneedet pages, cropped them and so on. Would be easier for you to work with.

Last edited by famfam; 01-16-2020 at 04:53 AM.
famfam is offline   Reply With Quote
Old 01-14-2020, 10:15 PM   #1732
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by famfam View Post
@willus
Here are the links ...
I don't know how to send my results to you as pm. May be that they are too big. But I zipped them in part of 20 mb. But no success. Can I send them to you by email, and could you send me a pm with your email-address please.
And sure, I cleaned the originel files from empty or unneedet pages, cropped them and so on. Would be easier for you to work with.
I have no recommendation for the first file. It is so random--every page seems to be formatted differently. The second file is a more conventional book. I attached two conversions--one using "fit width" and one using larger font and word wrap. I just converted the first 25 pages. Here are the commands:

k2pdfopt -m 0.13in,0.2in,0.13in,0.45in -mode fw -ls- -p 1-25 -o conv_fw.pdf file2.pdf

k2pdfopt -m 0.13in,0.2in,0.13in,0.45in -mag 1.5 -p 1-25 -o conv_wrap.pdf file2.pdf


The -m option crops out the Google watermark.
Attached Files
File Type: pdf conv_fw.pdf (1.19 MB, 227 views)
File Type: pdf conv_wrap.pdf (5.05 MB, 198 views)
willus is offline   Reply With Quote
Old 01-16-2020, 05:27 AM   #1733
famfam
Connoisseur
famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.
 
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
Quote:
Originally Posted by willus View Post
I have no recommendation for the first file. It is so random--every page seems to be formatted differently.
I just found a better version of 'Die Geschichte der Berliner Arbeiterbewegung. Teil 2'.
And it was this version, I hat worked with. I only couldn't find the link to it.
Now I found the link in my jpdownloader.

https://archive.org/download/bub_gb_...knAQAAIAAJ.pdf

I now will test your settings.

My procedure is, to clean the files in 'foxit phantom pdf' or in 'adobe acrobat' (cropping is very fast in acrobat). For deleting headers or footers, that are close too to the text, I use the Foxit 'Comment rectangular function'. I also delete all pages (cover, title page, table of contents, dedication, copyright, index), except for the text pages, notes, footnotes, bibliography before editing with k2topdfopt. That is necessary for getting better results. After the k2pdfopt process I add Cover, title page, dedication, copyright. TOC I do manually with Foxit. OCR: Is it better to do ocr with k2pdfop or is it better to do ocr before with Foxit. I think for Frakturschrift I should do it with k2pdfopt and Tesseract traindata. So I did it.

So many questions. Why?
Because I want to know whether cleaning up with Foxit and cutting with Acrobat will unnecessarily inflate the file and how to get smaller files.
Thanks a lot for helping me finding better solutions.
famfam is offline   Reply With Quote
Old 02-05-2020, 05:08 PM   #1734
vds
Junior Member
vds began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Feb 2020
Device: none
Compiling k2pdfopt on linux

Hi, I've been trying to compile k2pdfopt on linux without success. I got the source from here and I'm following the steps described in the readme file.

First I run
Code:
gcc -Wall -Ofast -m64 -o k2pdfopt.o -c k2pdfopt.c
and it gives me the following error:
Code:
k2pdfopt.c:76:10: fatal error: k2pdfopt.h: No such file or directory
   76 | #include <k2pdfopt.h>
      |          ^~~~~~~~~~~~
compilation terminated.
Then I tried adding the other directories to the search path
Code:
gcc -I./willuslib/ -I./k2pdfoptlib/ -Wall -Ofast -m64 -o k2pdfopt.o -c k2pdfopt.c
and this first command runs successfully.
Then I run the second command:
Code:
g++ -Ofast -m64 -o k2pdfopt k2pdfopt.o -static -static-libgcc -static-libstdc++ -lk2pdfopt -lwillus -lgocr -ltesseract -lleptonica -ldjvu -lmupdf -lfreetype -ljbig2 -ljpeglib -lopenjpeg -lpng -lzlib -lpthread -lstdc++ -lc -lm
and it gives the following error:
Code:
/usr/sbin/ld: cannot find -lk2pdfopt
/usr/sbin/ld: cannot find -lwillus
/usr/sbin/ld: cannot find -lgocr
/usr/sbin/ld: cannot find -ltesseract
/usr/sbin/ld: cannot find -lleptonica
/usr/sbin/ld: cannot find -ldjvu
/usr/sbin/ld: cannot find -lmupdf
/usr/sbin/ld: cannot find -lfreetype
/usr/sbin/ld: cannot find -ljbig2
/usr/sbin/ld: cannot find -ljpeglib
/usr/sbin/ld: cannot find -lopenjpeg
/usr/sbin/ld: cannot find -lpng
/usr/sbin/ld: cannot find -lzlib
collect2: error: ld returned 1 exit status
Has anyone compiled it successfully on Linux? What am I doing wrong?
vds is offline   Reply With Quote
Old 02-10-2020, 12:34 PM   #1735
famfam
Connoisseur
famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.
 
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
How to extend column limit to 5?

Just found a Magazine article with 5 columns (Die Zeit, 30.01.2020, Das Corona Virus).
So it would be great, if we could make it happen, to extend the column limit to 5.
Would that be possible without much effort?
Thank you so much.
famfam is offline   Reply With Quote
Old 02-11-2020, 12:18 AM   #1736
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by famfam View Post
Just found a Magazine article with 5 columns (Die Zeit, 30.01.2020, Das Corona Virus).
So it would be great, if we could make it happen, to extend the column limit to 5.
Would that be possible without much effort?
Thank you so much.
You should be able to do this in two passes without any modifications. Do the first pass with text re-flow disabled and limit to two columns max (e.g. -mode 2col). Then do another pass with 4 columns max. If you want to send me a link to the document, I can experiment for you.
willus is offline   Reply With Quote
Old 02-11-2020, 12:22 AM   #1737
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by vds View Post
Hi, I've been trying to compile k2pdfopt on linux without success. I got the source from here and I'm following the steps described in the readme file.

...

What am I doing wrong?
The archive you linked to does not contain all of the source code for all of the libraries that it needs to create the final binary (except for k2pdfopt and willus--it has the source code for those). You have to build all of those libraries separately and then link to them (e.g. gocr, tesseract, leptonica, djvu, mupdf, etc.).

What version of Linux are you running? Do the i386 linux binaries not work for you?

Last edited by willus; 02-11-2020 at 12:26 AM.
willus is offline   Reply With Quote
Old 02-12-2020, 11:48 AM   #1738
vds
Junior Member
vds began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Feb 2020
Device: none
Quote:
Originally Posted by willus View Post
The archive you linked to does not contain all of the source code for all of the libraries that it needs to create the final binary (except for k2pdfopt and willus--it has the source code for those). You have to build all of those libraries separately and then link to them (e.g. gocr, tesseract, leptonica, djvu, mupdf, etc.).

What version of Linux are you running? Do the i386 linux binaries not work for you?
Thanks for the response. The binaries work fine for me, I'm trying to compile the sources because I had the idea to use emscripten to create a javascript version of k2pdfopt. I'm first trying to compile using gcc before moving to emscripten. I don't even know if it will be possible or if I will be able to do that but looks like something worth trying.

I will try again soon and post the results here.
vds is offline   Reply With Quote
Old 02-12-2020, 03:36 PM   #1739
willus
Fuzzball, the purple cat
willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.willus ought to be getting tired of karma fortunes by now.
 
willus's Avatar
 
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
Quote:
Originally Posted by vds View Post
Thanks for the response. The binaries work fine for me, I'm trying to compile the sources because I had the idea to use emscripten to create a javascript version of k2pdfopt. I'm first trying to compile using gcc before moving to emscripten. I don't even know if it will be possible or if I will be able to do that but looks like something worth trying.

I will try again soon and post the results here.
I think given all of the k2pdfopt dependencies, it will be a small miracle if you get this to work. You might start by modifying the main include file to disable as many of the support libraries as possible (e.g. TESSERACT, DJVU, etc.) just to see if you can get a small subset to work.
willus is offline   Reply With Quote
Old 02-13-2020, 07:22 AM   #1740
famfam
Connoisseur
famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.famfam ought to be getting tired of karma fortunes by now.
 
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
question to: k2pdfopf gui tesseract ocr

Is tesseract integrated in the k2pdfopt-gui and if so version 3 or version 4? Do the traindata files have to be version 3 or version 4? In which folder must the traindata files be in Windows 10: 'Programs' (for 64 bit) or 'Programs (x86)' (for 32 bit)?
In my k2pdfopt-gui I get the error message,

Initializing OCR for 2 threads x x
Could not find Tesseract data (env var TESSDATA_PREFIX = (not assigned)).
Using GOCR v0.50.

What am I doing wrong?
Is my entry in the input window 'Env. var: TESSDATA_PREFIX = c: \ program files \ tesseract-ocr \ tessdata 'not correct?
famfam is offline   Reply With Quote
Reply

Tags
ebook apps, k5 tools, kindle tools, kindle touch, tools

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Viewing PDFs with another font Font PocketBook 4 11-12-2010 08:27 AM
Viewing Textbook PDFs... NJReader enTourage Archive 4 08-17-2010 05:17 PM
PRS-600 Restart bug while viewing PDFs? conundrum Sony Reader 2 03-04-2010 08:46 PM
More on viewing pdfs dso371 Bookeen 8 03-11-2008 07:15 PM
Viewing Untagged PDFs on Palm T|X Eroica Reading and Management 3 12-10-2007 01:44 PM


All times are GMT -4. The time now is 04:42 PM.


MobileRead.com is a privately owned, operated and funded community.