MobileRead Forums - View Single Post

Quoth · 05-18-2024, 09:48 AM

Quote:

Originally Posted by Doitsu

For those kind of searches you'll need to use a concordance tool. For example, Laurence Anthony's AntConc (freeware).

Unzip the epub file.
If the folder contains .xhtml files change their file extensions to .html.
Open AntConc, select Open file(s) as 'Quick Corpus', select .html as the file type and then select the extracted .html files.
Click the N-Gram tab, select the desired number of words and click Start.

I've attached a sample screenshot of the output.

Obviously, copy and paste all the chapters/files to one mega file* as the repeat in a novel (likely unwanted if you are the author or official editor) is probably in a separate file.

Indeed any search tool is useless unless you know the exact repeating text.

[* Unless it has a project / session mode that remembers previous files?]