3D ROOM GEOMETRY ESTIMATION FROM MEASURED IMPULSE RESPONSES
Sakari Tervo and Timo Tossavainen
Aalto University School of Science
Department of Media Technology
P.O. Box 15400, FI00076 Aalto
ABSTRACT
Estimation of the room geometry from spatial room impulse re-
sponses is studied. An algorithm for estimating the geometry is
presented. The algorithm does not require any a priori information
on the room shape, number of walls, or order of the reflections,
but deduces the set of planes that explain the measured source and
image-source locations and covariances iteratively. The algorithm is
demonstrated with real data experiments.
Index Terms— Room geometry estimation, room impulse re-
sponse, reflection
1. INTRODUCTION
The geometry of the room is one of the most essential parts of room
acoustic modeling. Besides the prediction of the acoustics of rooms,
the room acoustic models can be used for example to enhance source
localization performance [1].
Estimation of the room geometry can be divided into three
subtopics, localization of reflections, i.e. the image-sources, estima-
tion of the surface parameters, i.e plane points and normals, and the
estimation of room geometry. In principle, any general localization
method can be used to localize the reflections. As an example, in [2]
reflections are localized using sound intensity vectors and time of
arrival (TOA).
The locations of the reflections can be used together with the es-
timated or a priori known source location to deduce the surface pa-
rameters. This requires the knowledge of the order of the reflections.
Plane parameters are estimated in [3] by rotating a B-format micro-
phone around a loudspeaker, directed towards the microphone. The
estimation is based on the TOA and the direction of arrival (DOA)
of the first arriving reflection in each direction. The TOA and DOA
measurements are grouped using hierarchical clustering to avoid es-
timating the same plane multiple times. Moreover, in [4] the plane
parameters are estimated with a common tangent algorithm. The
same approach is applied in [5] and several other publications for
the estimation of plane parameters.
The actual room geometry estimation algorithms combine the
locations of reflections and source as well as the orders of the reflec-
tions. One such algorithm, which uses only one room impulse re-
sponse, has been proposed in [6]. The algorithm requires the knowl-
edge of the order of the first and second order reflections and of their
arrival times. Moreover, in [7] a constrained room model and l1-
regularized least-squares method is applied to fit 3-D shoebox model
to a set of measured impulse responses. The number of walls is as-
sumed to be known a priori. In addition, in [3] the clustering of
This work was supported by ERC grant agreement no. [203636],
HECSE, and Nokia Foundation
the TOA and DOA measurements constitutes as the room geometry
estimation algorithm. The geometry estimation presented in [3], as
well as in [4] and [5], use the assumption that all the detected reflec-
tions are of first order. To the understanding of the present authors
all the previous approaches use a priori information either on the
number of the walls, shape of the enclosure, or on the order of the
reflections. Especially the a priori assumption on the order of the re-
flections is not feasible, since in most of the practical situations the
earliest arriving second order reflection arrives before the latest first
order reflection.
Here a room geometry estimation algorithm is proposed that is
able to deduct the room geometry without any of the above listed
a priori information. The algorithm deduces iteratively the set of
planes that has produced a set of estimated reflection locations and
covariances. Rest of the article is organized as follows. Section 2,
presents the estimation of the reflection locations and of their covari-
ance matrices from the spatial room impulse responses. In Section 3,
the geometry that explains the estimated locations and covariances
of the reflections is estimated with an iterative maximum likelihood
algorithm. Experiments are conducted with real data in Section 4.
Section 5 discusses the results and concludes the article.
2. ESTIMATION OF SOURCE AND REFLECTION
LOCATIONS
2.1. Reflection signal model
In this paper, a room impulse response measured with a microphone
at location rn and a loudspeaker at location x is considered as a sum
of the direct sound and individual reflections:
hn(t)
△
= h(rn, x; t)=
K
k=0
h
k,n
(t)
+ wn(t)
=
K
k=0
∞
-∞
H
k,n
(ω)e
jωt
dω
+ wn(t), (1)
where t is time, ω is angular frequency, n is the index for micro-
phone, k =0 is the direct sound, k =1,...,K are the reflec-
tions, wn(t) is measurement noise independent for each microphone
and of the signal and distributed according to normal distribution for
each microphone. Moreover, h
k,n
(t) and H
k,n
(ω) are the time and
the frequency domain presentation of the direct sound and of the
reflections.
The applied microphone array is assumed to have a small aper-
ture size compared to the dimensions of the room. Then the impulse
responses can be divided into short time windows which each in-
clude only one reflection. In realistic situations this is true for the
513 978-1-4673-0046-9/12/$26.00 ©2012 IEEE ICASSP 2012