Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 02-25-2023, 09:35 PM   #1
tatagi
Connoisseur
tatagi began at the beginning.
 
Posts: 52
Karma: 10
Join Date: Oct 2022
Device: none
Any ideas on how to remove images from PDF file?

I want to keep the pdf file in same layouts and alignments including texts, but without images. the pdf file is not scanned document, it's basically text with images attached.

I know I can manually edit/remove images in online/desktop editors but it takes too long time for hundreds of my files in the queue.

It's discouraging to know PDF isn't as easy to control elements as EPUB is.

Are there easier ways to removes only images using calibre or any other software out there?


converting to TXT isn't an option because it destorys the whole structure of the document.

Last edited by tatagi; 02-25-2023 at 09:38 PM.
tatagi is offline   Reply With Quote
Old 02-25-2023, 10:51 PM   #2
Foozle
Connoisseur
Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.Foozle ought to be getting tired of karma fortunes by now.
 
Posts: 62
Karma: 221034
Join Date: May 2021
Device: None
Doing a search, I found one program that may work. You can check it out here: VeryPDF PDF to EPUB Converter. It uses a command line to convert and ignore images. There's also Adobe Acrobat itself (or other PDF editors) that can be used to remove images, but that has to be done one file at a time. Otherwise you may be stuck doing it manually as there's no other automagic way (that I've ever personally seen) of stripping images besides just converting / saving to pure text.
Foozle is offline   Reply With Quote
Advert
Old 02-26-2023, 05:08 AM   #3
Quoth
Still reading
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 14,016
Karma: 105092227
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
PDF is an end-use format rather than editable so in extreme cases scissors are needed.

Really this sounds like too much work and will only work for some PDFs. I'm curious as to why you want to remove the images?
Quoth is offline   Reply With Quote
Old 02-27-2023, 12:13 PM   #4
DaltonST
Deviser
DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.DaltonST ought to be getting tired of karma fortunes by now.
 
DaltonST's Avatar
 
Posts: 2,265
Karma: 2090983
Join Date: Aug 2013
Location: Texas
Device: none
Since Adobe designed PDFs to faithfully and accurately print a document, it has always been a terrible ebook format.

Having said that, it is of course ubiquitous in academic publication databases due to that characteristic, although modern users nowadays don't routinely physically print them. They simply view them, which should be identical to "print preview" by design.

If you cannot actually delete the images (because: see above), perhaps an acceptable alternative would be to use available utilities to reduce the quality of the images, and change them to grayscale.

That would drastically reduce the file size of your PDFs, which is often the motivation to remove the images. It is possible that changing the image attributes might disrupt the "flow" of the PDF document that you prize. Only trial-and-error testing of different combinations of criteria will tell you if that is so.


DaltonST
DaltonST is offline   Reply With Quote
Old 02-27-2023, 12:36 PM   #5
Quoth
Still reading
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 14,016
Karma: 105092227
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
Even 4bit or 1 bit greyscale is possible. The 4 bit isn't too bad (14 greys, black, white), but 1 bit only suits line art and equations. Imagemagick or k2pdfopt or other tools.
Quoth is offline   Reply With Quote
Advert
Old 02-27-2023, 10:09 PM   #6
tatagi
Connoisseur
tatagi began at the beginning.
 
Posts: 52
Karma: 10
Join Date: Oct 2022
Device: none
THANK YOU ALL FOR THE HEAD-UPS!

Quote:
Originally Posted by Foozle View Post
Doing a search, I found one program that may work. You can check it out here: VeryPDF PDF to EPUB Converter. It uses a command line to convert and ignore images. There's also Adobe Acrobat itself (or other PDF editors) that can be used to remove images, but that has to be done one file at a time. Otherwise you may be stuck doing it manually as there's no other automagic way (that I've ever personally seen) of stripping images besides just converting / saving to pure text.
Thx. I tried verypdf but unfortunately it just closes its command prompt window without doing anything. PDF is , as others said, the worst ebook format to convert from. I can confirm this after the attempt to convert pdf to epub using Calibre (source document was column separated magazine document and it was all mixed up as one)


Quote:
Originally Posted by DaltonST View Post
Since Adobe designed PDFs to faithfully and accurately print a document, it has always been a terrible ebook format.

Having said that, it is of course ubiquitous in academic publication databases due to that characteristic, although modern users nowadays don't routinely physically print them. They simply view them, which should be identical to "print preview" by design.

If you cannot actually delete the images (because: see above), perhaps an acceptable alternative would be to use available utilities to reduce the quality of the images, and change them to grayscale.

That would drastically reduce the file size of your PDFs, which is often the motivation to remove the images. It is possible that changing the image attributes might disrupt the "flow" of the PDF document that you prize. Only trial-and-error testing of different combinations of criteria will tell you if that is so.

DaltonST
Thx. you clearly got my point of asking yes it was the file size that I am concerned with, especially for the pdf that contains one hel of a lot of images that I don't need to view. I am usually text-oriented person

It is a nice idea to utterly reduce the quality to the last drop, instead of simply removing them. But Isn't pdf basically unreflowable anyway? If the image in the page is gone, the image is just replaced by empty space and does not affect the page layouts, imho


Quote:
Originally Posted by Quoth View Post
Even 4bit or 1 bit greyscale is possible. The 4 bit isn't too bad (14 greys, black, white), but 1 bit only suits line art and equations. Imagemagick or k2pdfopt or other tools.
Thx. I will try them out. k2pdfopt seems to serve different purpose (reflow pdf text to fit in smaller screen size by cropping the original pages in half or more) but there're options that fiddle with output image quality.

Last edited by tatagi; 02-27-2023 at 10:13 PM.
tatagi is offline   Reply With Quote
Old 02-28-2023, 05:10 AM   #7
Quoth
Still reading
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 14,016
Karma: 105092227
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
Quote:
Originally Posted by tatagi View Post

Thx. I will try them out. k2pdfopt seems to serve different purpose (reflow pdf text to fit in smaller screen size by cropping the original pages in half or more) but there're options that fiddle with output image quality.
Imagemagic and K2pdfopt can crop. K2pdfopt can do stuff with a text layer. But both can leave the page size as is and change from colour to monochrome and reduce to 4 bits or 1bit (4bits is the minimum I would use).

Most PDFs can't be reflowed anyway.
Quoth is offline   Reply With Quote
Old 02-28-2023, 02:52 PM   #8
ownedbycats
Custom User Title
ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.
 
ownedbycats's Avatar
 
Posts: 10,971
Karma: 75337983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
Quote:
Originally Posted by DaltonST View Post
Since Adobe designed PDFs to faithfully and accurately print a document, it has always been a terrible ebook format.
There's exceptions to this of course-- some books that rely heavily on layout tend to look better as a fixed-layout pdf than a flow-layout epub. (There's epub3 but Calibre doesn't handle those.)
ownedbycats is offline   Reply With Quote
Old 02-28-2023, 04:50 PM   #9
Quoth
Still reading
Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.Quoth ought to be getting tired of karma fortunes by now.
 
Quoth's Avatar
 
Posts: 14,016
Karma: 105092227
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
Quote:
Originally Posted by ownedbycats View Post
There's exceptions to this of course-- some books that rely heavily on layout tend to look better as a fixed-layout pdf than a flow-layout epub. (There's epub3 but Calibre doesn't handle those.)
Then they are kinds of books that need large screens and are electronic documents, not ebooks.

I agree there are kinds of books that either:
Useless on a small mono screen (Big books with photos).
Need Print replica on large screen (Many science & maths textbooks).
Or Magazines and newspapers are often unsuitable for small screens.

But PDFs for novels, even with illustrations, are crazy. Trying to perfectly mimic a print edition fiction or poetry or play is also crazy, better to simplify.
Quoth is offline   Reply With Quote
Old 02-28-2023, 06:23 PM   #10
ownedbycats
Custom User Title
ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.
 
ownedbycats's Avatar
 
Posts: 10,971
Karma: 75337983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
I have several books (mixed paper and Kindle-unpacked to PDF) from DK/Dorling Kindersley and they tend to be layout-heavy:

Click image for larger version

Name:	Rocks and Minerals.png
Views:	83
Size:	481.8 KB
ID:	200017

They can be read on eInk, but it's kind of awkward and no colour. I prefer my big monitor.
ownedbycats is offline   Reply With Quote
Old 02-28-2023, 07:48 PM   #11
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,722
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by ownedbycats View Post
There's exceptions to this of course-- some books that rely heavily on layout tend to look better as a fixed-layout pdf than a flow-layout epub. (There's epub3 but Calibre doesn't handle those.)
I have hundreds of EPUB3 books in my calibre libraries, amongst which are many fixed layout books. So I'm unsure what you mean by Calibre doesn't handle them.

I can view them with calibre, albeit with loss of colour in some cases, if that's a problem I can use another viewer via the Open (View) With feature. I can edit them with calibre and/or Sigil, and convert to PDF providing I'm prepared to do any necessary tweaks with a PDF editor.

BR
Attached Thumbnails
Click image for larger version

Name:	Screenshot 2023-03-01 114957.jpg
Views:	78
Size:	423.0 KB
ID:	200018  

Last edited by BetterRed; 02-28-2023 at 07:52 PM. Reason: add screen shot of editor and viewer
BetterRed is offline   Reply With Quote
Old 02-28-2023, 07:48 PM   #12
HAL.Unke
Member
HAL.Unke began at the beginning.
 
Posts: 12
Karma: 10
Join Date: Jun 2017
Location: In the middle of nowhere in Germany
Device: Android Tablet
With PDF-Xchange Editor (free version) you can copy or remove images from pdfs.
Tools>content editing tools>Edit content>images
Then mark and copy or delete an image.

I use this feature to copy not only the recipes from food magazines. Often are the images without text because they are an extra layer.
HAL.Unke is offline   Reply With Quote
Old 02-28-2023, 08:09 PM   #13
ownedbycats
Custom User Title
ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.ownedbycats ought to be getting tired of karma fortunes by now.
 
ownedbycats's Avatar
 
Posts: 10,971
Karma: 75337983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
Quote:
Originally Posted by BetterRed View Post
I have hundreds of EPUB3 books in my calibre libraries, amongst which are many fixed layout books. So I'm unsure what you mean by Calibre doesn't handle them.

I can view them with calibre, albeit with loss of colour in some cases, if that's a problem I can use another viewer via the Open (View) With feature. I can edit them with calibre and/or Sigil, and convert to PDF providing I'm prepared to do any necessary tweaks with a PDF editor.

BR
Need a third-party viewer to properly view them, mainly.

How did you convert to PDF? Mine just tend to stall out or produce files that barely work.
ownedbycats is offline   Reply With Quote
Old 02-28-2023, 08:25 PM   #14
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 46,181
Karma: 168983734
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
Quote:
Originally Posted by BetterRed View Post
I can view them with calibre, albeit with loss of colour in some cases, if that's a problem I can use another viewer via the Open (View) With feature. I can edit them with calibre and/or Sigil, and convert to PDF providing I'm prepared to do any necessary tweaks with a PDF editor.
Loss of colour, I could live with. Totally screwed up layouts? That a different kettle of fish. See the attached images for a sample from a FLO epub3 in calibre viewer vs. Sigil's preview vs. Thorium.
Attached Thumbnails
Click image for larger version

Name:	FLO_in_calibre_viewer.jpg
Views:	74
Size:	159.8 KB
ID:	200019   Click image for larger version

Name:	FLO_in_sigil_preview.jpg
Views:	77
Size:	138.3 KB
ID:	200020   Click image for larger version

Name:	FLO_in_thorium.png
Views:	75
Size:	743.0 KB
ID:	200021  
DNSB is offline   Reply With Quote
Old 02-28-2023, 08:30 PM   #15
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,722
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
The few I've converted with calibre, mainly for 'fun', are short papers and so-called roadmaps from a local ¬think-tank. They offer DOCX and EPUB3, but to get a PDF you must jump hurdles. I think they must enjoy whinging subscribers.

I have an admin-tag that acts as a reminder (via an icon) that a book is best read in another viewer, shortcut for AZARDI is Shift+Z.

My point was, that for me at least calibre handles EPUB3 fine, your comment implied 'not at all'.

BR
BetterRed is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
How to remove images of same size but named differently from a file unkn0wn Editor 3 09-28-2021 11:53 PM
How to remove headers with images in pdf? Princess kindle Calibre 0 08-28-2019 03:34 PM
Remove images from clearscanned PDF hernep PDF 2 06-02-2012 11:34 AM
Remove file path from PDF file DuckDodgers PDF 1 08-13-2006 09:23 AM


All times are GMT -4. The time now is 09:30 AM.


MobileRead.com is a privately owned, operated and funded community.