STATE-SPACE ARCHITECTURE OF THE PARTITIONED-BLOCK-BASED ACOUSTIC ECHO CONTROLLER Fabian Kuech, Edwin Mabande Fraunhofer IIS Am Wolfsmantel 33 D-91058 Erlangen, Germany fabian.kuech@iis.fraunhofer.de Gerald Enzner Institute of Communication Acoustics Ruhr-Universit¨ at Bochum D-44780 Bochum, Germany gerald.enzner@rub.de ABSTRACT Acoustic echo cancellation has traditionally employed basi- cally all variants known from deterministic adaptive ﬁlter design, such as least mean-square (LMS), recursive least-squares (RLS), and frequency-domain adaptive ﬁlters (FDAF). More recently, a stochas- tic adaptive ﬁlter design based on the concept of acoustic state-space modeling of the echo path has been introduced to accommodate for an ever sought uniﬁcation of adaptive ﬁltering and adaptation con- trol. The corresponding Kalman ﬁlter theory has been formulated for single-channel, multi-channel, and nonlinear echo cancellation problems. This paper closes an important gap by formulating the state-space model and the corresponding adaptive algorithm for the partitioned-block ﬁltering structure which is especially relevant in practice. This structure allows for the use of signiﬁcantly longer ﬁlter lengths in comparison to previous work, and for the ﬂexible design and implementation of acoustic echo cancellers for widely differing acoustic conditions. Index Terms— acoustic echo control, adaptive ﬁltering 1. INTRODUCTION AND RELATION TO PRIOR WORK Acoustic echo represents a well-known distraction in hands-free voice communication systems. Due to acoustic coupling between the loudspeaker and the microphone of a telecommunication termi- nal, the far-end talker receives a delayed version of his own voice that will eventually inhibit ﬂuent conversation. The general setup of the acoustic echo problem is illustrated in Fig. 1. The far-end signal x(n) is played back by the near-end loudspeaker. The microphone signal y(n) then picks up the echo d(n) together with the near-end signal s(n), including background noise and local speech. The adaptive acoustic echo canceler (AEC) regenerates and subtracts an estimate of the echo from the microphone signal. A variety of adaptive ﬁlter structures and methods to control the adaptation in adverse environments have been proposed for this purpose [1, 2, 3]. Typically, due to time-varying acoustics and echo-path undermodel- ing, the AEC is not always able to sufﬁciently remove the echo and, thus, a residual echo suppressor (RES) is introduced after the AEC AEC RES d(n) s(n) x(n) y(n) e(n) ˆ d(n) z(n) Fig. 1. Setup of the acoustic echo cancellation problem. to attenuate remaining echo components [4, 5, 6, 7]. In modern communication systems such as Voice-over-IP ser- vices or high-quality video conferencing systems, sampling rates of 16 kHz and higher are used. This implies a signiﬁcant increase in computational complexity for the AEC. Moreover, the convergence speed of time-domain adaptive ﬁlters is usually not sufﬁcient in case of high sampling rates and long echo paths. Besides subband adap- tive ﬁlters [8, 9], frequency-domain adaptive ﬁlters using block pro- cessing are well-known solutions to address both of these issues [10, 11, 12, 13]. However, the required length of the frequency transform becomes relatively large for long echo paths, leading to potential algorithmic noise, e.g., when implementing the AEC with ﬁxed point arithmetic in embedded devices. In this case, approaches based on partitioned-block ﬁltering [14, 15, 16, 17, 18] are more suit- able, as they allow for ﬂexible designs, e.g., a separate choice of the transformation length and the time span covered by the AEC, and the ﬁlter length is not bounded to powers of two as typically used in fast fourier transform implementations. Moreover, a reduction of the transformation length also reduces the algorithmic delay caused by post-processing stages such as the RES. By using a partitioned block structure, approximations of the RES ﬁlter to reduce the delay as, e.g., proposed in [19], can be avoided. The step-size parameter to control the adaptation of the ﬁlter co- efﬁcients is generally a critical component of the AEC. An overview of popular methods is found, e.g., in [1, 2, 3]. The alternative ap- proach in [6] relies on an acoustic state-space model of the echo path to deduce a robust and efﬁcient frequency-domain adaptive ﬁl- ter with inherent step-size control, however, not considering block partitioning. This approach has analogously been applied to multi- channel as well as nonlinear adaptive ﬁltering problems [20, 21]. In this paper, we extend the approach in [6] to the partitioned-block ﬁl- tering structure, which is of practical importance for the application of AECs for widely varying acoustic conditions. In our paper, Sec. 2 revises partitioned-block adaptive ﬁltering based on state-space modeling in analogy with [6]. Sec. 3 shows that the corresponding exact Kalman ﬁlter in the block-frequency- domain can be simpliﬁed into a diagonalized version. This light approximation leads to a variant of the known multi-delay adaptive ﬁlter [14], but providing inherent adaptive step-size control. While the resulting step-size turns out to be similar to the one in [15], we propose a different estimator for the required system distance, again in analogy with the non-partitioned algorithm in [6]. In Sec. 4, a so far unique relation between the step-size parameters of all parti- tions and the optimum RES ﬁlter is derived in order to efﬁciently obtain the RES coefﬁcients. Simulation results in Sec. 5 conﬁrm the suitability of the proposed architecture for acoustic echo control. 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) 978-1-4799-2893-4/14/$31.00 ©2014 IEEE 1309