![]() |
#1 |
Connoisseur
![]() Posts: 52
Karma: 10
Join Date: Oct 2022
Device: none
|
Any ideas on how to remove images from PDF file?
I want to keep the pdf file in same layouts and alignments including texts, but without images. the pdf file is not scanned document, it's basically text with images attached.
I know I can manually edit/remove images in online/desktop editors but it takes too long time for hundreds of my files in the queue. It's discouraging to know PDF isn't as easy to control elements as EPUB is. Are there easier ways to removes only images using calibre or any other software out there? converting to TXT isn't an option because it destorys the whole structure of the document. Last edited by tatagi; 02-25-2023 at 09:38 PM. |
![]() |
![]() |
![]() |
#2 |
Connoisseur
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 62
Karma: 221034
Join Date: May 2021
Device: None
|
Doing a search, I found one program that may work. You can check it out here: VeryPDF PDF to EPUB Converter. It uses a command line to convert and ignore images. There's also Adobe Acrobat itself (or other PDF editors) that can be used to remove images, but that has to be done one file at a time. Otherwise you may be stuck doing it manually as there's no other automagic way (that I've ever personally seen) of stripping images besides just converting / saving to pure text.
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,016
Karma: 105092227
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
PDF is an end-use format rather than editable so in extreme cases scissors are needed.
Really this sounds like too much work and will only work for some PDFs. I'm curious as to why you want to remove the images? |
![]() |
![]() |
![]() |
#4 |
Deviser
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,265
Karma: 2090983
Join Date: Aug 2013
Location: Texas
Device: none
|
Since Adobe designed PDFs to faithfully and accurately print a document, it has always been a terrible ebook format.
Having said that, it is of course ubiquitous in academic publication databases due to that characteristic, although modern users nowadays don't routinely physically print them. They simply view them, which should be identical to "print preview" by design. If you cannot actually delete the images (because: see above), perhaps an acceptable alternative would be to use available utilities to reduce the quality of the images, and change them to grayscale. That would drastically reduce the file size of your PDFs, which is often the motivation to remove the images. It is possible that changing the image attributes might disrupt the "flow" of the PDF document that you prize. Only trial-and-error testing of different combinations of criteria will tell you if that is so. DaltonST |
![]() |
![]() |
![]() |
#5 |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,016
Karma: 105092227
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Even 4bit or 1 bit greyscale is possible. The 4 bit isn't too bad (14 greys, black, white), but 1 bit only suits line art and equations. Imagemagick or k2pdfopt or other tools.
|
![]() |
![]() |
Advert | |
|
![]() |
#6 | ||
Connoisseur
![]() Posts: 52
Karma: 10
Join Date: Oct 2022
Device: none
|
THANK YOU ALL FOR THE HEAD-UPS!
![]() Quote:
Quote:
![]() ![]() It is a nice idea to utterly reduce the quality to the last drop, instead of simply removing them. But Isn't pdf basically unreflowable anyway? If the image in the page is gone, the image is just replaced by empty space and does not affect the page layouts, imho Thx. I will try them out. k2pdfopt seems to serve different purpose (reflow pdf text to fit in smaller screen size by cropping the original pages in half or more) but there're options that fiddle with output image quality. Last edited by tatagi; 02-27-2023 at 10:13 PM. |
||
![]() |
![]() |
![]() |
#7 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,016
Karma: 105092227
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
Most PDFs can't be reflowed anyway. |
|
![]() |
![]() |
![]() |
#8 |
Custom User Title
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 10,971
Karma: 75337983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
There's exceptions to this of course-- some books that rely heavily on layout tend to look better as a fixed-layout pdf than a flow-layout epub. (There's epub3 but Calibre doesn't handle those.)
|
![]() |
![]() |
![]() |
#9 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 14,016
Karma: 105092227
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
I agree there are kinds of books that either: Useless on a small mono screen (Big books with photos). Need Print replica on large screen (Many science & maths textbooks). Or Magazines and newspapers are often unsuitable for small screens. But PDFs for novels, even with illustrations, are crazy. Trying to perfectly mimic a print edition fiction or poetry or play is also crazy, better to simplify. |
|
![]() |
![]() |
![]() |
#10 |
Custom User Title
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 10,971
Karma: 75337983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
I have several books (mixed paper and Kindle-unpacked to PDF) from DK/Dorling Kindersley and they tend to be layout-heavy:
They can be read on eInk, but it's kind of awkward and no colour. I prefer my big monitor. |
![]() |
![]() |
![]() |
#11 | |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,722
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
Quote:
I can view them with calibre, albeit with loss of colour in some cases, if that's a problem I can use another viewer via the Open (View) With feature. I can edit them with calibre and/or Sigil, and convert to PDF providing I'm prepared to do any necessary tweaks with a PDF editor. BR Last edited by BetterRed; 02-28-2023 at 07:52 PM. Reason: add screen shot of editor and viewer |
|
![]() |
![]() |
![]() |
#12 |
Member
![]() Posts: 12
Karma: 10
Join Date: Jun 2017
Location: In the middle of nowhere in Germany
Device: Android Tablet
|
With PDF-Xchange Editor (free version) you can copy or remove images from pdfs.
Tools>content editing tools>Edit content>images Then mark and copy or delete an image. I use this feature to copy not only the recipes from food magazines. Often are the images without text because they are an extra layer. |
![]() |
![]() |
![]() |
#13 | |
Custom User Title
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 10,971
Karma: 75337983
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
Quote:
How did you convert to PDF? Mine just tend to stall out or produce files that barely work. |
|
![]() |
![]() |
![]() |
#14 | |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 46,181
Karma: 168983734
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Quote:
|
|
![]() |
![]() |
![]() |
#15 |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,722
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
The few I've converted with calibre, mainly for 'fun', are short papers and so-called roadmaps from a local ¬think-tank. They offer DOCX and EPUB3, but to get a PDF you must jump hurdles. I think they must enjoy whinging subscribers.
I have an admin-tag that acts as a reminder (via an icon) that a book is best read in another viewer, shortcut for AZARDI is Shift+Z. My point was, that for me at least calibre handles EPUB3 fine, your comment implied 'not at all'. BR |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
How to remove images of same size but named differently from a file | unkn0wn | Editor | 3 | 09-28-2021 11:53 PM |
How to remove headers with images in pdf? | Princess kindle | Calibre | 0 | 08-28-2019 03:34 PM |
Remove images from clearscanned PDF | hernep | 2 | 06-02-2012 11:34 AM | |
Remove file path from PDF file | DuckDodgers | 1 | 08-13-2006 09:23 AM |