Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > Workshop

Notices

Reply
 
Thread Tools Search this Thread
Old 02-18-2024, 09:40 AM   #1
HalBenHB
Junior Member
HalBenHB began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Feb 2024
Device: Smart phone
Large quantity of high resolution e-book page images

I have 1400+ png files of pages. Images are not scanned, they are probably computer generated because there is no color corruption, distortion etc. They are probably from an official e-book version of a book. They are all in 1519 x 2459 resolution (I believe the book itself has 105 x 170 mm papers).
  • I tried to merge them by Adobe Acrobat Pro, but after merging I couldn't save the final pdf. It somehow broke save function. I tried a couple of time. Try to do it by halves. First 700 pages can saved but second half broke it again.
  • I tried to merge them by Abbyy FineReader, however, no matter which setting I tried, it produced a pdf with inconsistent page sizes (even though all images are in same resolution) and sometimes zoomed in for some pages.
  • I finally merged them by PDF Shaper Professional. It gave me a 2GB+ pdf file with all pages having same size which is 401.9 x 650.6 mm (for comparison, A4 is 210 x 297 mm). Then, I processed this pdf file with Abbyy and convert it to searchable PDF. Then, I compressed it by Acrobat Pro and finally have 250MB+ searchable pdf.
This pdf is still heavy. Abbyy crashes when I try to convert it to epub. MS Word can't process both size and (after splitting the pdf) page size. Kindle and Google Play Books not accepting because of size.

I need to lower its page size to make it lighter. However, every solution I tried produced image only pdfs and deleted OCR data. I can again scan this by Abbyy to make it searchable but I suppose it will have less correct OCR since I lowered the quality of images (pages). I want to have a light and searchable pdf of this book and then further convert it to epub so I can read it easily from my phone. There is no graphics in the pages as far as I saw, except for the first cover page, it is a anthology book for short stories. I'm using Windows 10, the OCR language is Turkish but the book is old and have old words so dictionary based OCRs work more inaccurately. What do you suggest I do?
HalBenHB is offline   Reply With Quote
Old 02-18-2024, 07:25 PM   #2
Turtle91
A Hairy Wizard
Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.Turtle91 ought to be getting tired of karma fortunes by now.
 
Turtle91's Avatar
 
Posts: 3,092
Karma: 18727053
Join Date: Dec 2012
Location: Charleston, SC today
Device: iPhone 11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire
Assuming there are no copyright issues, just break it up into smaller sections… maybe 200 pages at a time. Merge, OCR, save as epub… then repeat for each of the other sections. Then combine the individual ePubs into a single ePub.
Turtle91 is offline   Reply With Quote
Old 02-22-2024, 09:06 AM   #3
Sirtel
Grand Sorcerer
Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.Sirtel ought to be getting tired of karma fortunes by now.
 
Sirtel's Avatar
 
Posts: 10,008
Karma: 224450762
Join Date: Jan 2014
Location: Estonia
Device: Kobo Sage & Libra 2
As you said those images are from an official ebook version, why not just buy that and save yourself all this headache?
Sirtel is offline   Reply With Quote
Old 02-22-2024, 10:18 AM   #4
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 73,817
Karma: 128597114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
What book is this that you have images of?
JSWolf is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
How to compress pdf file which contains high resolution images. rupeshforu3 PDF 7 12-05-2021 07:59 AM
Choosing waterproof, high resolution ereader grenacia Which one should I buy? 9 05-21-2017 05:50 AM
Is there any way to get Calibre to send the high resolution cover images to device? Arainais Devices 5 08-27-2011 07:38 AM


All times are GMT -4. The time now is 02:49 AM.


MobileRead.com is a privately owned, operated and funded community.