|
|
#1 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,720
Karma: 1759970
Join Date: Sep 2010
Device: none
|
any way to remove this PDF transform junk
I have this junk at the start of, and throughout, a few short stories. I guess it's because the book has been converted in & out of PDF - any easy way to get rid of it ? it fills a whole page in its raw form, as there are lots of lines with only 1 or 2 letters per line!
[code]</head> <body class="calibre"> <div class="calibre1"> <p class="calibre2"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">PDF Transform</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">PDF Transform</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">Y</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">Y</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">Y</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">er</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">Y</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">er</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">B</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">2</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">B</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">2</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">B</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">.0</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">B</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">.0</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">A</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">A</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">Click here to buy</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">Click here to buy</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">w</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">w</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">w</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">w</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">w . </span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">w</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">A B B YY.com</span> </a> </p> <p class="calibre4"> <a href="http://www.abbyy.com/buy" class="calibre3"> <span class="bold">.A B BYY.com</span> </a> </p> <p class="calibre4"><a href="http://www.abbyy.com/buy" [code] |
|
|
|
|
|
#2 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Calibre has the option to remove arbitrary strings defined by regular expressions. You can find this in the "structure detection"- part of the conversion options, it's called header and footer removal.
|
|
|
|
| Advert | |
|
|
|
|
#3 |
|
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,286
Karma: 62000000
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
The string has been varied significantly to discourage easy removal
from conversions done with the Demo version of the program.Sigil and many S&R passes will get most (and still leave broken-apart paragraphs) |
|
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Remove PDF DRM | PieOPah | 18 | 05-15-2013 12:15 PM | |
| Still not possible to remove drm from pdf? | Leoric1991 | 1 | 02-06-2010 12:25 PM | |
| Remove Header from PDF | rrosenwald | Calibre | 10 | 08-22-2009 09:36 PM |
| remove pdf margins | Hanselda | Bookeen | 12 | 05-13-2009 09:30 AM |
| Unutterably Silly We need to transform MR into a commercial bank | radioflyertoo | Lounge | 5 | 11-13-2008 02:59 PM |