01-08-2020, 06:50 AM | #1726 |
Connoisseur
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
|
Tried k2pdfopt with pdf-document in gothic script (german Frakturschrift). The results are to small for me for reading on my device Kobo Clara HD (1072 x 1448 resolution). So until now I have to use my tablet for reading it. That is why I am looking for a 'How to' for pdf in gothic script (german Frakturschrift). Did anyone test the k2pdfopt with Frakturschrift and got good results for his device?
Last edited by famfam; 01-08-2020 at 06:54 AM. |
01-08-2020, 09:52 PM | #1727 | |
Fuzzball, the purple cat
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
01-10-2020, 04:07 PM | #1728 | |
Connoisseur
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
|
Quote:
In the meanwhile I had some succes with the following 2 books in frakturschrift(gothic script): Eduard Bernstein: 1) Die Geschichte der Berliner Arbeiterbewegung Band 2 2) Sozialismus und Demokratie in der Englischen Revolution I found them in the internet (legal sources, copyright out of date, author dead since 1932) I can send you my results from k2pdfopt by pm. I would be happy, if you could find a better way with better results. (How to?) |
|
01-11-2020, 06:27 AM | #1729 | |
Connoisseur
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
|
Quote:
Sorry, did not work -> Your submission could not be processed because a security token was missing. If this occurred unexpectedly, please inform the administrator and describe the action you performed before you received this error. |
|
01-12-2020, 10:50 AM | #1730 | |
Fuzzball, the purple cat
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
01-14-2020, 03:13 PM | #1731 |
Connoisseur
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
|
@willus
Here are the links: 1) Die Geschichte der Berliner Arbeiterbewegung Band 2 https://archive.org/download/diegesc...01berngoog.pdf or (better) https://archive.org/download/bub_gb_...knAQAAIAAJ.pdf and 2) Sozialismus und Demokratie in der Englischen Revolution https://archive.org/download/bub_gb_...NCAAAAYAAJ.pdf I don't know how to send my results to you as pm. May be that they are too big. But I zipped them in part of 20 mb. But no success. Can I send them to you by email, and could you send me a pm with your email-address please. And sure, I cleaned the originel files from empty or unneedet pages, cropped them and so on. Would be easier for you to work with. Last edited by famfam; 01-16-2020 at 04:53 AM. |
01-14-2020, 10:15 PM | #1732 | |
Fuzzball, the purple cat
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
k2pdfopt -m 0.13in,0.2in,0.13in,0.45in -mode fw -ls- -p 1-25 -o conv_fw.pdf file2.pdf k2pdfopt -m 0.13in,0.2in,0.13in,0.45in -mag 1.5 -p 1-25 -o conv_wrap.pdf file2.pdf The -m option crops out the Google watermark. |
|
01-16-2020, 05:27 AM | #1733 | |
Connoisseur
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
|
Quote:
And it was this version, I hat worked with. I only couldn't find the link to it. Now I found the link in my jpdownloader. https://archive.org/download/bub_gb_...knAQAAIAAJ.pdf I now will test your settings. My procedure is, to clean the files in 'foxit phantom pdf' or in 'adobe acrobat' (cropping is very fast in acrobat). For deleting headers or footers, that are close too to the text, I use the Foxit 'Comment rectangular function'. I also delete all pages (cover, title page, table of contents, dedication, copyright, index), except for the text pages, notes, footnotes, bibliography before editing with k2topdfopt. That is necessary for getting better results. After the k2pdfopt process I add Cover, title page, dedication, copyright. TOC I do manually with Foxit. OCR: Is it better to do ocr with k2pdfop or is it better to do ocr before with Foxit. I think for Frakturschrift I should do it with k2pdfopt and Tesseract traindata. So I did it. So many questions. Why? Because I want to know whether cleaning up with Foxit and cutting with Acrobat will unnecessarily inflate the file and how to get smaller files. Thanks a lot for helping me finding better solutions. |
|
02-05-2020, 05:08 PM | #1734 |
Junior Member
Posts: 2
Karma: 10
Join Date: Feb 2020
Device: none
|
Compiling k2pdfopt on linux
Hi, I've been trying to compile k2pdfopt on linux without success. I got the source from here and I'm following the steps described in the readme file.
First I run Code:
gcc -Wall -Ofast -m64 -o k2pdfopt.o -c k2pdfopt.c Code:
k2pdfopt.c:76:10: fatal error: k2pdfopt.h: No such file or directory 76 | #include <k2pdfopt.h> | ^~~~~~~~~~~~ compilation terminated. Code:
gcc -I./willuslib/ -I./k2pdfoptlib/ -Wall -Ofast -m64 -o k2pdfopt.o -c k2pdfopt.c Then I run the second command: Code:
g++ -Ofast -m64 -o k2pdfopt k2pdfopt.o -static -static-libgcc -static-libstdc++ -lk2pdfopt -lwillus -lgocr -ltesseract -lleptonica -ldjvu -lmupdf -lfreetype -ljbig2 -ljpeglib -lopenjpeg -lpng -lzlib -lpthread -lstdc++ -lc -lm Code:
/usr/sbin/ld: cannot find -lk2pdfopt /usr/sbin/ld: cannot find -lwillus /usr/sbin/ld: cannot find -lgocr /usr/sbin/ld: cannot find -ltesseract /usr/sbin/ld: cannot find -lleptonica /usr/sbin/ld: cannot find -ldjvu /usr/sbin/ld: cannot find -lmupdf /usr/sbin/ld: cannot find -lfreetype /usr/sbin/ld: cannot find -ljbig2 /usr/sbin/ld: cannot find -ljpeglib /usr/sbin/ld: cannot find -lopenjpeg /usr/sbin/ld: cannot find -lpng /usr/sbin/ld: cannot find -lzlib collect2: error: ld returned 1 exit status |
02-10-2020, 12:34 PM | #1735 |
Connoisseur
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
|
How to extend column limit to 5?
Just found a Magazine article with 5 columns (Die Zeit, 30.01.2020, Das Corona Virus).
So it would be great, if we could make it happen, to extend the column limit to 5. Would that be possible without much effort? Thank you so much. |
02-11-2020, 12:18 AM | #1736 |
Fuzzball, the purple cat
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
You should be able to do this in two passes without any modifications. Do the first pass with text re-flow disabled and limit to two columns max (e.g. -mode 2col). Then do another pass with 4 columns max. If you want to send me a link to the document, I can experiment for you.
|
02-11-2020, 12:22 AM | #1737 | |
Fuzzball, the purple cat
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
What version of Linux are you running? Do the i386 linux binaries not work for you? Last edited by willus; 02-11-2020 at 12:26 AM. |
|
02-12-2020, 11:48 AM | #1738 | |
Junior Member
Posts: 2
Karma: 10
Join Date: Feb 2020
Device: none
|
Quote:
I will try again soon and post the results here. |
|
02-12-2020, 03:36 PM | #1739 | |
Fuzzball, the purple cat
Posts: 1,272
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
02-13-2020, 07:22 AM | #1740 |
Connoisseur
Posts: 77
Karma: 2178856
Join Date: Oct 2013
Device: Kobo Clara HD
|
question to: k2pdfopf gui tesseract ocr
Is tesseract integrated in the k2pdfopt-gui and if so version 3 or version 4? Do the traindata files have to be version 3 or version 4? In which folder must the traindata files be in Windows 10: 'Programs' (for 64 bit) or 'Programs (x86)' (for 32 bit)?
In my k2pdfopt-gui I get the error message, Initializing OCR for 2 threads x x Could not find Tesseract data (env var TESSDATA_PREFIX = (not assigned)). Using GOCR v0.50. What am I doing wrong? Is my entry in the input window 'Env. var: TESSDATA_PREFIX = c: \ program files \ tesseract-ocr \ tessdata 'not correct? |
Tags |
ebook apps, k5 tools, kindle tools, kindle touch, tools |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Viewing PDFs with another font | Font | PocketBook | 4 | 11-12-2010 08:27 AM |
Viewing Textbook PDFs... | NJReader | enTourage Archive | 4 | 08-17-2010 05:17 PM |
PRS-600 Restart bug while viewing PDFs? | conundrum | Sony Reader | 2 | 03-04-2010 08:46 PM |
More on viewing pdfs | dso371 | Bookeen | 8 | 03-11-2008 07:15 PM |
Viewing Untagged PDFs on Palm T|X | Eroica | Reading and Management | 3 | 12-10-2007 01:44 PM |