1940 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, NO. 9, SEPTEMBER 2013 Room Impulse Response Synthesis and Validation Using a Hybrid Acoustic Model Alex Southern, Samuel Siltanen, Damian T. Murphy, and Lauri Savioja, Senior Member, IEEE Abstract—Synthesizing the room impulse response (RIR) of an arbitrary enclosure may be performed using a number of alterna- tive acoustic modeling methods, each with their own particular ad- vantages and limitations. This article is concerned with obtaining a hybrid RIR derived from both wave and geometric-acoustics based methods, optimized for use across different regions of time or fre- quency. Consideration is given to how such RIRs can be matched across modeling domains in terms of both amplitude and boundary behavior and the approach is veriﬁed using a number of standard- ised case studies. Index Terms—FDTD, image-source, auralization, hybrid acoustic modeling. I. INTRODUCTION P REDICTING wave propagation behavior within an arbi- trary enclosure is beneﬁcial in many areas of science. The simulation of sound wave propagation is particularly important as part of the architectural design process without which it is not possible to audition the acoustic qualities of a building before it has been constructed. Acoustic modeling techniques therefore provide an early indication of the objective acoustic properties of non-existent spaces through the analysis of the synthesized room impulse response (RIR) and subsequent derivation of stan- dard acoustical parameters [1], [2]. Such methods can then also be extended with some conﬁdence to other virtual reality type applications, as found in ﬁrst person gaming environments [3], [45], or in the reconstruction of historic buildings that no longer exist, or exist only in part [4]. Fig. 1 is an overview of the typical acoustic design and eval- uation workﬂow for a given environment that supports the anal- ysis of established objective metrics with subjective analysis based on critical listening. In this situation the designer is pre- sented with a rendered soundﬁeld derived directly from the syn- thesized RIRs. Hence, the RIR synthesis and spatial encoding stage combined with the subsequent representation in a format suitable for audible rendering, result in an auralization of the Manuscript received August 21, 2012; revised December 19, 2012, April 16, 2013; accepted April 18, 2013. Date of publication May 14, 2013; date of current version July 12, 2013. This work was supported in part by the Academy of Finland under Project No. 138780 and in part by EPSRC Grant EP/J000108/1. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Woon-Seng Gan. A. Southern, S. Siltanen, and L. Savioja are with the Department of Media Technology, Aalto University School of Science, 02150 Espoo, Finland (e-mail: mrapsouthern@gmail.com; saasilta@tml.hut.ﬁ; lauri.savioja@aalto.ﬁ). D. T. Murphy is with the AudioLab, Department of Electronics, University of York, North Yorkshire YO10 5DD, U.K. (e-mail: dtm3@ohm.york.ac.uk). Color versions of one or more of the ﬁgures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identiﬁer 10.1109/TASL.2013.2263139 Fig. 1. An overview of a typical acoustic design and evaluation workﬂow for a given environment based on the analysis of both objective measures obtained from RIR synthesis, and subjective listening via auralization. modeled enclosed space. In this context auralization allows the listener to experience auditory cues that would occur if the lis- tener was present in the equivalent real-world listening environ- ment. However in practice the auditory cues are only represen- tative of the listening situation rather than being absolutely iden- tical. Realizing this subjectively informed method of design, particularly with respect to concert hall acoustics, is the over- arching concept that motivates the contributions of this article. Currently established and accepted methods of predicting room acoustic characteristics are based on a geometric-based model of ray-like sound propagation. This approach is valid when representing wavelengths that are small in comparison to the dimensions of the bounding enclosure and internal objects. The main difference between these methods is in the discovery or approximation of reﬂection paths [5]. Each geometric method has its associated strengths and limitations although generally all rays that intersect a chosen listening region are recorded in a list for their energy, path length/time-of-arrival and angle-of-arrival, providing sufﬁcient information to build a reﬂection echogram. At larger wavelengths the ray-like as- sumption no longer holds and this leads to a misrepresentation of the low frequency response of the acoustic model [6]. An alternative modeling approach is offered through wave- based methods that have been shown to be more appropriate in the low frequency range, e.g., [7]–[9]. These methods are based on a discrete numerical solution to the wave equation and are therefore a proper treatment of physical wave motion. How- ever, they are computationally intensive, especially when mod- eling large volumes and/or a wide frequency bandwidth such as the range of hearing. This is due to the large number of el- ements/calculations required to approximate continuous wave propagation with sufﬁcient accuracy using a discrete system. Most previous research in attempting to develop a complete and accurate solution to synthesizing a RIR have been based 1558-7916/$31.00 © 2013 IEEE