You might also want to check the EPUB 3.2 spec at
4.3.3 Text-to-Speech. As far as I can tell, there is no provision for multiple voices and the key word in that chunk of the spec is
SHOULD which is generally translated as it's optional and no one is likely to implement it.