The Importance of Spatiotemporal Information in Biological Motion Perception: White Noise Presented with a Step-like Motion Activates the Biological Motion Area Akiko Callan, Daniel Callan, and Hiroshi Ando Abstract Humans can easily recognize the motion of living creatures using only a handful of point-lights that describe the motion of the main joints (biological motion perception). This special ability to perceive the motion of animate objects signifies the importance of the spatiotemporal information in perceiving biological motion. The posterior STS (pSTS) and posterior middle temporal gyrus (pMTG) region have been established by many functional neuroimaging studies as a locus for biolog- ical motion perception. Because listening to a walking human also activates the pSTS/pMTG region, the region has been pro- posed to be supramodal in nature. In this study, we investigated whether the spatiotemporal information from simple auditory stimuli is sufficient to activate this biological motion area. We compared spatially moving white noise, having a running-like tempo that was consistent with biological motion, with station- ary white noise. The moving-minus-stationary contrast showed significant differences in activation of the pSTS/pMTG region. Our results suggest that the spatiotemporal information of the auditory stimuli is sufficient to activate the biological motion area. INTRODUCTION For animals, including humans, detecting the sound of footsteps and determining the direction and location of their movement have ecological significance for survival. Although the sound of a single footstep merely provides auditory information about the footwear contacting the walking surface, listening to consecutive footsteps in real life allows us to infer how the person is moving in a cer- tain environment. People can determine not only the direction and speed of locomotion but also the gender (Li, Logan, & Pastore, 1991), posture (Pastore, Flint, Gaston, & Solomon, 2008) and stride length (Young, Rodger, & Craig, 2013) of the walker. This phenomenon is compara- ble to visual biological motion perception. Johansson (1973) used the term biological motionto refer to the characteristic motion patterns of living beings during locomotion and showed that the movement of 1012 small light bulbs attached to the main joints (known as point-light [PL] animation) could evoke biological mo- tion perception. A single frame of the PL animation may appear as a meaningless assembly of dots. However, successive frames of the PL animation depict a compelling impression of human motion, such as walking, running, or dancing. This special ability to perceive the motion of animate objects demonstrates the importance of the spatiotemporal information in perceiving biological mo- tion. Many human neuroimaging studies that have inves- tigated the brain areas responsive to biological motion perception using PL animations have reported that the posterior STS (pSTS) region is activated more by actual PL animations than by scrambled PL animations (for a review, see Blake & Shiffrar, 2007). In this article, we use the term pSTS/posterior middle temporal gyrus (pMTG) because biological motion perception activates pSTS as well as the adjacent pMTG (Allison, Puce, & McCarthy, 2000). Bidet-Caulet, Voisin, Bertrand, and Fonlupt (2005) investigated whether the pSTS/pMTG region was activated by the auditory stimuli and revealed that the pSTS/pMTG region was activated by detecting the direction of motion of the crossing walker while listening to the footsteps. This result indicates that auditory stimuli related to human motion also activate the pSTS/pMTG region. In our previous study, we compared realistic (external- ized) and artificial (internalized) spatial sounds and found that the activity in the posterior superior temporal gyrus (pSTG) and pMTG were greater in response to the exter- nalized stimuli than in response to internalized stimuli (Callan, Callan, & Ando, 2013). The externalized stimuli were individualized binaural recordings, and the internal- ized stimuli were stereo recordings. By recording sounds with microphones placed within the left and right pinnae, binaural recordings include the acoustic filter characteris- tics of the head and ears (head-related transfer function [HRTF]) and room reverberation. During headphone lis- tening, the HRTF is known to be responsible for percep- tual sound externalization when head motion cues are CiNet, National Institute of Information and Communication Technology, and Osaka University © 2016 Massachusetts Institute of Technology Journal of Cognitive Neuroscience 29:2, pp. 277285 doi:10.1162/jocn_a_01046