|04-29-2011, 08:45 AM||#1|
Join Date: Apr 2011
Location: Campinas-SP Brazil
Removing background images from pdf to epub
I need to remove a background image when converting from pdf to epub.
This background image is like a old paper with stains and points pretending to simulate an old document.
When converting a similar document without this "noisy" background the epub file is about 500KB from a pdf of 1MB. But with this background the epub file size grows to 30MB.
How may I filter this background? Is there any special option in Calibre to do this task or should I edit the pdf file before conversion? In the later case, which SW may I use to edit the pdf using Ubuntu?
|04-29-2011, 08:55 AM||#2|
Sigil & calibre developer
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
You can use Search and Replace. Use the builder button to the right of the search entry. Find the code where the background is defined and match on it. Leave the replace blank so it is removed. You may need to play with it a bit to get it right.
|04-29-2011, 09:15 AM||#3|
US Navy, Retired
Join Date: Feb 2009
Location: North Carolina
Device: Nexus 7
|04-29-2011, 09:38 AM||#4|
Join Date: Apr 2011
Device: kindle 3 & sony daily prs950sc
You can highlight copy and paste the entire text to notepad, word pad, ms word or what not and then save as a .txt, .htm, .html, .rt or what not and then convert that to what u will. Or you could get the program adobe acrobat professional and get rid of the image that way and resave the .pdf because the acrobat pro allows you to do edits the reader doesnt.
if the text is also part of an image file withen the .pdf with no selectable text then you need acrobat pro to do text recognition first for you. so that it is selectable then you can do any of the above steps.
|04-29-2011, 11:09 AM||#5|
Well trained by Cats
Join Date: Aug 2009
Location: (The original) Silicon Valley, USA
Device: Astak Pocket Pro, K4NT,Galaxy Tab 2
I will hint on how to add just ONE image and use it on every page (if you must )
Background-image: and background-repeat:
More info at http://www.w3.org/TR/CSS2/colors.html
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|why ePub -> PDF pages as images?||xristy||Calibre||15||12-28-2010 08:42 PM|
|PDF to Epub - Images with Text||ebahm||Calibre||2||09-19-2010 03:23 PM|
|pdf to epub/breaking up images?||dhume01||Calibre||1||07-06-2010 08:51 PM|
|title page & background images||Nate the great||ePub||13||07-28-2009 04:38 PM|
|Pdf background darker than LRF/RTF background||LOL2005||Sony Reader||2||10-23-2007 11:00 AM|