Number three is supposed to be the correct way. I find that if there is a long speech then maybe there should be some sort of expression or action happening, too. Like 'they sighed' or 'they whirled and glared before continuing' or even 'they gestured dramatically', that sort of thing.
Let's face it, usually a long speech is because someone is impassioned, so maybe some facial explanation or action wouldn't be remiss. If they aren't impassioned then the speech probably needs to be broken up into smaller chunks, just for a brain rest.
|