View Single Post
Old 09-25-2011, 11:47 AM   #19
Elfwreck
Grand Sorcerer
Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.Elfwreck ought to be getting tired of karma fortunes by now.
 
Elfwreck's Avatar
 
Posts: 5,187
Karma: 25133758
Join Date: Nov 2008
Location: SF Bay Area, California, USA
Device: Pocketbook Touch HD3 (Past: Kobo Mini, PEZ, PRS-505, Clié)
Quote:
Originally Posted by Zetmolm View Post
If a PDF is not tagged, you cannot tag it after the fact unless you have the source file(s).

I don't know about academic sources, but PDFs I've seen from other legit sources are often untagged.
Acrobat Pro will add tags to PDFs. (Not that everyone has access to Acrobat Pro.) And its tagging is automatic and doesn't always pick the right order for complicated things like tables or multiple columns on a page. (Two columns tends to be fine; it's "three columns with inset callout text" like magazines that have problems.)

The tagging can be editing, but that's a nightmare; there are few more annoyingly complicated doc formatting tasks. (I say this as someone who likes text zoning in Finereader and enjoys OCR correction work.)

Many academic texts aren't tagged (they're often made in InDesign with intent to print; tagging is irrelevant for that and the editors often don't even know it exists); a lot of indie publisher texts aren't tagged. Adding tags to these might not work well; it depends on the program used to create the PDF. Books released with double-spaced text tend to get tagged as paragraph-per-line by Acrobat.
Elfwreck is offline   Reply With Quote