04-04-2009, 05:17 PM | #1 |
Enthusiast
Posts: 26
Karma: 10
Join Date: Mar 2009
Device: PRS-700
|
Created A Document with Adobe Acrobat, DRMed?
Basically I just scanned a few documents and I combined it with Adobe Acrobat. When I go to Secure >> Manage Securities, it shows as No Security. However when I drag the PDF into calibre and I try to convert to LRF, it tells me its DRM protected. =/
A second question, is there anyway to remove the grey color background from a scanned document? It's a waste of ink and it shows up in my reader. Do I have to do that in photoshop or something? Last edited by ooo; 04-04-2009 at 05:20 PM. |
04-05-2009, 10:55 PM | #2 | ||
Wizard
Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
Quote:
PDFLRF Rasterfarian PDFread SoPDF PaperCrop Quote:
|
||
Advert | |
|
04-05-2009, 11:05 PM | #3 |
Enthusiast
Posts: 26
Karma: 10
Join Date: Mar 2009
Device: PRS-700
|
I think I was just hoping the reflow would be better after I converted the PDF to LRF. It seems that all my PDFs created with Adobe Acrobat are some how DRM protected although it doesn't say it has any securities on it. Strange.
|
04-05-2009, 11:14 PM | #4 |
Wizard
Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
Neither calibre, nor your Sony's internal PDF viewer, nor anything else can "reflow" an image-based PDF, which is typically what you get after scanning. Converting to LRF won't change anything either.
Acrobat and other software could "OCR" the scans to convert them from images to text, and then you could convert it it to RTF or HTML, etc., and put that into Calibre. Whether it'll work out well depends on a lot of factors, including the quality of the source, whether it's plain text, or has a lot of (mathematical or other) symbols, etc., and the quality of the OCR routine. |
04-06-2009, 02:26 PM | #5 |
Enthusiast
Posts: 26
Karma: 10
Join Date: Mar 2009
Device: PRS-700
|
I did the OCR conversion and then I tried copying and pasting the text. It seemed to go a good job. Any clues as to why Adobe DRM protects the documents it generates, even though it clearly states "no security"? I'll attach a sample later tonight when I get home.
|
Advert | |
|
04-12-2009, 11:23 AM | #6 |
Resident Curmudgeon
Posts: 73,983
Karma: 128903378
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Converting your scans to PDF was a really bad idea. The best thing you can do is delete the PDFs and start over and use a good OCR to get the scans to text and then create a proper reflowing document. Starting from PDF is going to give you no end of errors.
|
04-12-2009, 01:55 PM | #7 |
Wizard
Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
A lot of scanners/scanning software return PDF as their native format. If you're going to OCR it, what's wrong with starting in PDF? How would anything else be any better?
|
04-12-2009, 02:10 PM | #8 |
frumious Bandersnatch
Posts: 7,516
Karma: 18512745
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
PDF probably has the images encoded in some low-quality format. The quality is good enough for a human reading it, but not for an OCR program, which may need other settings (maybe a two-level TIFF instead of a true-colour JPG, for instance). Other that that, yes, a scanned PDF is nothing more than a set of images, and you should be able to OCR them just as well.
|
04-12-2009, 02:25 PM | #9 |
Wizard
Posts: 1,213
Karma: 12890
Join Date: Feb 2009
Location: Amherst, Massachusetts, USA
Device: Sony PRS-505
|
I don't understand the force of that "probably". It'll have whatever resolution you tell it to have when you scan it. Dealing with a PDF is easier than dealing with separate image files for every page. If it's a decent resolution, it depends more on the sophistication of the OCR software than the input format, I find.
|
04-12-2009, 03:08 PM | #10 |
Enthusiast
Posts: 26
Karma: 10
Join Date: Mar 2009
Device: PRS-700
|
Sorry for the absence, been awfully busy with work lately. Anyways, I've uploaded a sample page of the OCR / scan I did. Now, it wasn't really a scan. I had a DSLR mounted onto a tripod and I basically flipped the pages and took a picture of each page. I found it a lot faster than using my scanner. I got 300 pages done in 20 minutes. If I had more megapixels I could of captured both pages at once. I also included a basically text file I created and sent to Adobe PDF. Both files are DRM protected, any clues as to why? I tried to use calibre to convert to lrt or epub and it would not let me. Thanks!
|
04-12-2009, 03:26 PM | #11 |
frumious Bandersnatch
Posts: 7,516
Karma: 18512745
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
View Kindle notes in Adobe Acrobat ? | djk80 | Amazon Kindle | 2 | 08-26-2010 12:53 PM |
For those of you still using Acrobat or Reader for your Adobe DRM'd PDFs... | Jim Lester | 0 | 12-01-2009 07:00 PM | |
Adobe acrobat PDF file names | psionx | Sony Reader | 6 | 10-10-2009 08:02 PM |
pdf and adobe acrobat reader? | superstitious | 2 | 09-11-2009 07:56 PM | |
Adobe reader 9 VS. Acrobat 7 | Icarusbop | Reading and Management | 2 | 03-08-2009 08:16 PM |