Quote:
Originally Posted by Doitsu
For those kind of searches you'll need to use a concordance tool. For example, Laurence Anthony's AntConc (freeware).
- Unzip the epub file.
- If the folder contains .xhtml files change their file extensions to .html.
- Open AntConc, select Open file(s) as 'Quick Corpus', select .html as the file type and then select the extracted .html files.
- Click the N-Gram tab, select the desired number of words and click Start.
I've attached a sample screenshot of the output.
|
Obviously, copy and paste all the chapters/files to one mega file* as the repeat in a novel (likely unwanted if you are the author or official editor) is probably in a separate file.
Indeed any search tool is useless unless you know the exact repeating text.
[* Unless it has a project / session mode that remembers previous files?]