Quote:
Originally Posted by BlackVoid
"You know that linebreaks and paragraphs are unknown (unused) conceptions in pdf? "
Your eyeballs can identify those line breaks and paragraphs. Are they there or not? If you see them, a program can also "see" them.
|
How interesting. You do realize that there are great many things that humans can do that computers can't - yet, anyway. Such as recognizing known shapes and their meaning.
Maybe you should accept it as a fact that while there
may be PDF files that are easy to convert to text, this is by no means necessary and it is just as possible that the PDF file will be a mess of lines and curves that just happen to spell letters, words, sentences and paragraphs. And, unfortunately, in my experience PDFs tend more towards the latter than the former.