I think the original version sounds more spontaneous: more like stream of consciousness style. It's as if someone was talking to themselves. The other versions sound more like formal English.
I don't think any of them automatically sound more right than the others.
N.
|