Ok, I pasted some of the first few paragraphs from the above linked PDF and guess what? It did make mistakes. The first LONG paragraph has 3 extra returns. And it does put returns in sometimes not at the proper end of a sentence. So, It's not too too bad overall, but you'd still need to go through it with the PDF to fix the mistakes. There is no way any program can figure out exactly where ever paragraph is supposed to start. It's not possible unless there was maybe spaces or a tab at the beginning of the paragraph or it used line ends/lengths to try to guess. If you have lines of say 60-80 characters and then a line of say less then that that was a proper sentence end, then yeah, it could do it. But when there is no place to tell what is a paragraph or not, it cannot 100 % figure it all out.
using 4 paragraphs from that PDF, I have 10 returns when I should only have 4.
|