View Single Post
Old 05-20-2020, 12:20 PM   #5
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,731
Karma: 24031401
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by DNSB View Post
His argument is that there must be an automated solution to doing this task pointing at AntConc 3.5.8 as an example
I haven't used AntConc in a while, but I'm pretty sure that it doesn't return page numbers.

Quote:
Originally Posted by DNSB View Post
[...] the other example he pointed out was written in Fortran and has not been maintained since the 80's.
I seriously doubt that an 80's program will return page numbers. What's the name of the Fortran program?

Quote:
Originally Posted by DNSB View Post
That still requires a fair amount of manual labour including generating the list of stop words.
There are ready-made stop word lists. For example:

MySQL

NLTK

It's also relatively easy to create a list of all unique words with NLTK and other tools. But insisting on page numbers doesn't make sense.

Quote:
Originally Posted by DNSB View Post
From his email this AM, he has decided to to find a "professional" who will give him what he wants and at the rather low price he wants to pay.
Good riddance.
Doitsu is offline   Reply With Quote