It's as if the perceptual system unconsciously combines both audio and visual inputs to represent motion in three dimensions.