Quote:
Originally Posted by mikapanja
That works in case you know what you are looking for. But what if you don't, which was my original thought? E.g. if you want to know if any 5-word group is repeated in the text.
|
For those kind of searches you'll need to use a concordance tool. For example, Laurence Anthony's
AntConc (freeware).
- Unzip the epub file.
- If the folder contains .xhtml files change their file extensions to .html.
- Open AntConc, select Open file(s) as 'Quick Corpus', select .html as the file type and then select the extracted .html files.
- Click the N-Gram tab, select the desired number of words and click Start.
I've attached a sample screenshot of the output.