STATE-SPACE ARCHITECTURE OF THE PARTITIONED-BLOCK-BASED ACOUSTIC ECHO CONTROLLER Fabian Kuech, Edwin Mabande Fraunhofer IIS Am Wolfsmantel 33 D-91058 Erlangen, Germany fabian.kuech@iis.fraunhofer.de Gerald Enzner Institute of Communication Acoustics Ruhr-Universit¨ at Bochum D-44780 Bochum, Germany gerald.enzner@rub.de ABSTRACT Acoustic echo cancellation has traditionally employed basi- cally all variants known from deterministic adaptive filter design, such as least mean-square (LMS), recursive least-squares (RLS), and frequency-domain adaptive filters (FDAF). More recently, a stochas- tic adaptive filter design based on the concept of acoustic state-space modeling of the echo path has been introduced to accommodate for an ever sought unification of adaptive filtering and adaptation con- trol. The corresponding Kalman filter theory has been formulated for single-channel, multi-channel, and nonlinear echo cancellation problems. This paper closes an important gap by formulating the state-space model and the corresponding adaptive algorithm for the partitioned-block filtering structure which is especially relevant in practice. This structure allows for the use of significantly longer filter lengths in comparison to previous work, and for the flexible design and implementation of acoustic echo cancellers for widely differing acoustic conditions. Index Termsacoustic echo control, adaptive filtering 1. INTRODUCTION AND RELATION TO PRIOR WORK Acoustic echo represents a well-known distraction in hands-free voice communication systems. Due to acoustic coupling between the loudspeaker and the microphone of a telecommunication termi- nal, the far-end talker receives a delayed version of his own voice that will eventually inhibit fluent conversation. The general setup of the acoustic echo problem is illustrated in Fig. 1. The far-end signal x(n) is played back by the near-end loudspeaker. The microphone signal y(n) then picks up the echo d(n) together with the near-end signal s(n), including background noise and local speech. The adaptive acoustic echo canceler (AEC) regenerates and subtracts an estimate of the echo from the microphone signal. A variety of adaptive filter structures and methods to control the adaptation in adverse environments have been proposed for this purpose [1, 2, 3]. Typically, due to time-varying acoustics and echo-path undermodel- ing, the AEC is not always able to sufficiently remove the echo and, thus, a residual echo suppressor (RES) is introduced after the AEC AEC RES d(n) s(n) x(n) y(n) e(n) ˆ d(n) z(n) Fig. 1. Setup of the acoustic echo cancellation problem. to attenuate remaining echo components [4, 5, 6, 7]. In modern communication systems such as Voice-over-IP ser- vices or high-quality video conferencing systems, sampling rates of 16 kHz and higher are used. This implies a significant increase in computational complexity for the AEC. Moreover, the convergence speed of time-domain adaptive filters is usually not sufficient in case of high sampling rates and long echo paths. Besides subband adap- tive filters [8, 9], frequency-domain adaptive filters using block pro- cessing are well-known solutions to address both of these issues [10, 11, 12, 13]. However, the required length of the frequency transform becomes relatively large for long echo paths, leading to potential algorithmic noise, e.g., when implementing the AEC with fixed point arithmetic in embedded devices. In this case, approaches based on partitioned-block filtering [14, 15, 16, 17, 18] are more suit- able, as they allow for flexible designs, e.g., a separate choice of the transformation length and the time span covered by the AEC, and the filter length is not bounded to powers of two as typically used in fast fourier transform implementations. Moreover, a reduction of the transformation length also reduces the algorithmic delay caused by post-processing stages such as the RES. By using a partitioned block structure, approximations of the RES filter to reduce the delay as, e.g., proposed in [19], can be avoided. The step-size parameter to control the adaptation of the filter co- efficients is generally a critical component of the AEC. An overview of popular methods is found, e.g., in [1, 2, 3]. The alternative ap- proach in [6] relies on an acoustic state-space model of the echo path to deduce a robust and efficient frequency-domain adaptive fil- ter with inherent step-size control, however, not considering block partitioning. This approach has analogously been applied to multi- channel as well as nonlinear adaptive filtering problems [20, 21]. In this paper, we extend the approach in [6] to the partitioned-block fil- tering structure, which is of practical importance for the application of AECs for widely varying acoustic conditions. In our paper, Sec. 2 revises partitioned-block adaptive filtering based on state-space modeling in analogy with [6]. Sec. 3 shows that the corresponding exact Kalman filter in the block-frequency- domain can be simplified into a diagonalized version. This light approximation leads to a variant of the known multi-delay adaptive filter [14], but providing inherent adaptive step-size control. While the resulting step-size turns out to be similar to the one in [15], we propose a different estimator for the required system distance, again in analogy with the non-partitioned algorithm in [6]. In Sec. 4, a so far unique relation between the step-size parameters of all parti- tions and the optimum RES filter is derived in order to efficiently obtain the RES coefficients. Simulation results in Sec. 5 confirm the suitability of the proposed architecture for acoustic echo control. 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) 978-1-4799-2893-4/14/$31.00 ©2014 IEEE 1309