I'm using spacy's `nlp.pipe()` to find named entities in the book text, this function has a parameter 'n_process' to set how many processes to run this function. The default value is 1 and no process is created. I want to see whether enable multiprocessing will be faster.
Oh, I don't know fork a multithreaded process is problematic as the doc says.
And could you please explain the difference between ParallelJob and ThreadedJob?
|