Af ne Combinations of Adaptive Filters Renato Candido, Magno T. M. Silva, and V´ ıtor H. Nascimento Electronic Systems Engineering Department, Escola Polit´ ecnica, University of S˜ ao Paulo Av. Prof. Luciano Gualberto, trav. 3, n o 158, CEP 05508-900 – S˜ ao Paulo, SP, Brazil E-mails: {renatocan, magno, vitor}@lps.usp.br Abstract—We extend the analysis presented in [1] for the afne combination of two least mean-square (LMS) lters to allow for colored inputs and nonstationary environments. Our theoretical model deals, in a unied way, with any combinations based on the following algorithms: LMS, normalized LMS (NLMS), and recursive-least squares (RLS). Through the analysis, we observe that the afne combination of two algorithms of the same family with close adaptation parameters (step- sizes or forgetting factors) provides a 3 dB gain in relation to its best component lter. We study this behavior in stationary and nonstationary environments. Good agreement between analytical and simulation results is always observed. Furthermore, a simple geometrical interpretation of the afne combination is investigated. A model for the transient and steady-state behavior of two possible algorithms for estimation of the mixing parameter is proposed. The model explains situations in which adaptive combination algorithms may achieve good performance. Index Terms—Adaptive lters, afne combination, steady-state analy- sis, transient analysis, LMS algorithm. I. I NTRODUCTION Recently, an afne combination of two least mean-square (LMS) adaptive lters was proposed and its transient performance analyzed [1]. This method combines linearly the outputs of two LMS lters operating in parallel with different step-sizes. The purpose of the combination is to obtain an adaptive lter with fast convergence and reduced steady-state excess mean-square error (EMSE). Since the mixing parameter is not restricted to the interval [0, 1], this method can be interpreted as a generalization of the convex combination of two LMS lters of [2], [3]. In this paper, we extend the results of [1] by providing a unied analysis, which is valid for colored inputs, nonstationary environ- ments, and combinations based on LMS, NLMS, and RLS algorithms. To explain the behavior of the afne combination of two algorithms, we present a simple geometrical interpretation. Furthermore, we also explain why fast-adaptation of the mixing parameter in general leads to a quite large variance around the optimum value. Then, we nd a model for the transient and steady-state behavior of two possible algorithms for estimation of the mixing parameter. In order to simplify the arguments, we assume that all quantities are real. II. PROBLEM FORMULATION A combination of two adaptive lters is depicted in Figure 1. In this scheme, the output of the overall lter is given by y(n)= η(n)y1(n) + [1 - η(n)]y2(n), (1) where η(n) is the mixing parameter, yi (n), i =1, 2 are the outputs of the transversal lters, i.e., yi (n)= u T (n)wi (n - 1), u(n) R M is the common regressor vector, and wi (n - 1) R M are the weight vectors of each length-M component lter. We focus on the afne combination of two adaptive algorithms of the following general class wi (n)= wi (n - 1) + ρi (n) Mi (n)u(n)ei (n), i =1, 2, (2) This work is partly supported by CNPq under grants No. 136050/2008-5 and No. 303361/2004-2 and by FAPESP under grants No. 2008/00773-1 and No. 2008/04828-5. where ρi (n) is a step-size, Mi (n) is a symmetric non-singular matrix, ei (n)= d(n) - yi (n) is the estimation error, and d(n) is the desired response. The LMS, NLMS, and RLS algorithms employ the step-sizes ρi (n) and the matrices Mi (n) as in Table I. In this table, μi , ˜ μi and ǫ are positive constants, ‖·‖ is the Euclidian norm, I is the M × M identity matrix, and 0 λi < 1 is a forgetting factor. For RLS, Mi (n)= R -1 i (n) is obtained via the matrix inversion lemma [4, Eq.(2.6.4)] applied to Ri (n), which is an estimate (with forgetting factor λi ) of the autocorrelation matrix of the input signal, i.e., R E{u(n)u T (n)}, where E{·} is the expectation operator. We assume that d(n) and u(n) are related via a linear regression model, that is, d(n)= u T (n)wo(n - 1) + v(n), where wo(n - 1) is the time-variant optimal solution and v(n) is an i.i.d. (independent and identically distributed) and zero mean random process with variance σ 2 v =E{v 2 (n)}, which plays the role of a disturbance independent of u(n) [4, Sec. 6.2.1]. Furthermore, the sequences {u(n)} and {v(n)} are assumed stationary. In the afne combination, the mixing parameter η(n) is not restricted to the interval [0, 1] and can be adapted via η(n + 1) = η(n)+ μη e(n)[y1(n) - y2(n)], (3) where μη is a step-size, and e(n)= d(n) - y(n) is the estimation error of the overall lter. The recursion (3) was obtained in [1], using a stochastic gradient search to minimize the instantaneous mean- square error (MSE) cost function. In [1], η(n) was constrained to be less than or equal to 1 for all n, to ensure stability of (3). In this paper, we applied this constraint when using (3). The constraint was not necessary when the normalized version of (3) was used (see Sec. V). 1 ( 1 ) n w 2 ( 1) n w () yn 1 () y n 2 () y n () n 1- () n () T n u 1 () en 2 () e n () en () dn O ( 1) n w () vn ( 1) n w Fig. 1. Afne combination of two transversal adaptive lters. III. STEADY-STATE PERFORMANCE OF ADAPTIVE FILTERS We assume that in a nonstationary environment, the variation in the optimal solution wo follows a random-walk model [4, p. 359], that is, wo(n)= wo(n - 1) + q(n). In this model, q(n) is an i.i.d. vector 236 978-1-4244-2941-7/08/$25.00 ©2008 IEEE Asilomar 2008