Each word is 2 captures

([A-Z]) is the first Letter
(.+) <a space here> is the rest of the word with the space being used as a word delimiter detection
so my system works for 4 words maximum and must be crafted for
each of 4 , 3, 2 and 1 words surrounded by a <text> tag
A REGEX guru might come up with a better pattern / replacement