Af ﬁne Combinations of Adaptive Filters Renato Candido, Magno T. M. Silva, and V´ ıtor H. Nascimento Electronic Systems Engineering Department, Escola Polit´ ecnica, University of S˜ ao Paulo Av. Prof. Luciano Gualberto, trav. 3, n o 158, CEP 05508-900 – S˜ ao Paulo, SP, Brazil E-mails: {renatocan, magno, vitor}@lps.usp.br Abstract—We extend the analysis presented in [1] for the afﬁne combination of two least mean-square (LMS) ﬁlters to allow for colored inputs and nonstationary environments. Our theoretical model deals, in a uniﬁed way, with any combinations based on the following algorithms: LMS, normalized LMS (NLMS), and recursive-least squares (RLS). Through the analysis, we observe that the afﬁne combination of two algorithms of the same family with close adaptation parameters (step- sizes or forgetting factors) provides a 3 dB gain in relation to its best component ﬁlter. We study this behavior in stationary and nonstationary environments. Good agreement between analytical and simulation results is always observed. Furthermore, a simple geometrical interpretation of the afﬁne combination is investigated. A model for the transient and steady-state behavior of two possible algorithms for estimation of the mixing parameter is proposed. The model explains situations in which adaptive combination algorithms may achieve good performance. Index Terms—Adaptive ﬁlters, afﬁne combination, steady-state analy- sis, transient analysis, LMS algorithm. I. I NTRODUCTION Recently, an afﬁne combination of two least mean-square (LMS) adaptive ﬁlters was proposed and its transient performance analyzed [1]. This method combines linearly the outputs of two LMS ﬁlters operating in parallel with different step-sizes. The purpose of the combination is to obtain an adaptive ﬁlter with fast convergence and reduced steady-state excess mean-square error (EMSE). Since the mixing parameter is not restricted to the interval [0, 1], this method can be interpreted as a generalization of the convex combination of two LMS ﬁlters of [2], [3]. In this paper, we extend the results of [1] by providing a uniﬁed analysis, which is valid for colored inputs, nonstationary environ- ments, and combinations based on LMS, NLMS, and RLS algorithms. To explain the behavior of the afﬁne combination of two algorithms, we present a simple geometrical interpretation. Furthermore, we also explain why fast-adaptation of the mixing parameter in general leads to a quite large variance around the optimum value. Then, we ﬁnd a model for the transient and steady-state behavior of two possible algorithms for estimation of the mixing parameter. In order to simplify the arguments, we assume that all quantities are real. II. PROBLEM FORMULATION A combination of two adaptive ﬁlters is depicted in Figure 1. In this scheme, the output of the overall ﬁlter is given by y(n)= η(n)y1(n) + [1 - η(n)]y2(n), (1) where η(n) is the mixing parameter, yi (n), i =1, 2 are the outputs of the transversal ﬁlters, i.e., yi (n)= u T (n)wi (n - 1), u(n) ∈ R M is the common regressor vector, and wi (n - 1) ∈ R M are the weight vectors of each length-M component ﬁlter. We focus on the afﬁne combination of two adaptive algorithms of the following general class wi (n)= wi (n - 1) + ρi (n) Mi (n)u(n)ei (n), i =1, 2, (2) This work is partly supported by CNPq under grants No. 136050/2008-5 and No. 303361/2004-2 and by FAPESP under grants No. 2008/00773-1 and No. 2008/04828-5. where ρi (n) is a step-size, Mi (n) is a symmetric non-singular matrix, ei (n)= d(n) - yi (n) is the estimation error, and d(n) is the desired response. The LMS, NLMS, and RLS algorithms employ the step-sizes ρi (n) and the matrices Mi (n) as in Table I. In this table, μi , ˜ μi and ǫ are positive constants, ‖·‖ is the Euclidian norm, I is the M × M identity matrix, and 0 ≪ λi < 1 is a forgetting factor. For RLS, Mi (n)=  R -1 i (n) is obtained via the matrix inversion lemma [4, Eq.(2.6.4)] applied to  Ri (n), which is an estimate (with forgetting factor λi ) of the autocorrelation matrix of the input signal, i.e., R  E{u(n)u T (n)}, where E{·} is the expectation operator. We assume that d(n) and u(n) are related via a linear regression model, that is, d(n)= u T (n)wo(n - 1) + v(n), where wo(n - 1) is the time-variant optimal solution and v(n) is an i.i.d. (independent and identically distributed) and zero mean random process with variance σ 2 v =E{v 2 (n)}, which plays the role of a disturbance independent of u(n) [4, Sec. 6.2.1]. Furthermore, the sequences {u(n)} and {v(n)} are assumed stationary. In the afﬁne combination, the mixing parameter η(n) is not restricted to the interval [0, 1] and can be adapted via η(n + 1) = η(n)+ μη e(n)[y1(n) - y2(n)], (3) where μη is a step-size, and e(n)= d(n) - y(n) is the estimation error of the overall ﬁlter. The recursion (3) was obtained in [1], using a stochastic gradient search to minimize the instantaneous mean- square error (MSE) cost function. In [1], η(n) was constrained to be less than or equal to 1 for all n, to ensure stability of (3). In this paper, we applied this constraint when using (3). The constraint was not necessary when the normalized version of (3) was used (see Sec. V). 1 ( 1 ) n − w 2 ( 1) n − w () yn 1 () y n 2 () y n () n 1- () n () T n u 1 () en 2 () e n () en () dn O ( 1) n − w () vn ( 1) n − w Fig. 1. Afﬁne combination of two transversal adaptive ﬁlters. III. STEADY-STATE PERFORMANCE OF ADAPTIVE FILTERS We assume that in a nonstationary environment, the variation in the optimal solution wo follows a random-walk model [4, p. 359], that is, wo(n)= wo(n - 1) + q(n). In this model, q(n) is an i.i.d. vector 236 978-1-4244-2941-7/08/$25.00 ©2008 IEEE Asilomar 2008