Optimal Data Compression and Filtering: the Case of Inﬁnite Signal Sets Anatoli Torokhti and Phil Howlett Abstract—We present a theory for optimal ﬁltering of inﬁnite sets of random signals. There are several new distinctive features of the proposed approach. First, we provide a single optimal ﬁlter for processing any signal from a given inﬁnite signal set. Second, the ﬁlter is presented in the special form of a sum with p terms where each term is represented as a combination of three operations. Each operation is a special stage of the ﬁltering aimed at facilitating the associated numerical work. Third, an iterative scheme is implemented into the ﬁlter structure to provide an improvement in the ﬁlter performance at each step of the scheme. The ﬁnal step of the concerns signal compression and decompression. This step is based on the solution of a new rank-constrained matrix approximation problem. The solution to the matrix problem is described in this paper. A rigorous error analysis is given for the new ﬁlter. Keywords—stochastic signals, optimization problems in signal processing. I. I NTRODUCTION A. Motivation I N this paper, we consider extensions of known approaches to optimal ﬁltering based on the Wiener idea 1 . We present a theory for a new nonlinear ﬁlter which processes inﬁnite sets of random signals. The ﬁlter is constructed via an iterative scheme that provides a signal processing improvement with each step. The ﬁlter provides simultaneous signal ﬁltering and compression and the subsequent decompression (reconstruc- tion). There has been signiﬁcant attention in the literature to ﬁlters that process ﬁnite sets of random signals but it seems that a ﬁlter which is able to process inﬁnite sets of random signals has not been developed. The ﬁlter presented in this paper is designed speciﬁcally to process inﬁnite sets of random signals. For the case of ﬁnite sets of random signals, we show that our ﬁlter leads to a lower computational load and better accuracy than the known ﬁlters; the improved accuracy is due to the special iteration procedure incorporated into the ﬁlter structure (see Section III-E2). There are three motivations for the proposed method which we now describe. 1) First motivation: inﬁnite sets of signals: Most of the literature on Wiener-like ﬁltering provides an optimal ﬁlter for an individual input signal given by a ﬁnite random vector 2 . This means that if we wish to transform an inﬁnite set Y = {y (1) , y (2) ,..., y (N) ,...} of input vector signal into an Anatoli Torokhti and Phil Howlett are with the School of Mathematics and Statistics, University of South Australia, Mawson Lakes, SA 5095, Australia, email: anatoli.torokhti@unisa.edu.au and phil.howlett@unisa.edu.au, 1 Some references on Wiener-like ﬁltering can be found in [8], [12], [16], [18], [19]. 2 We say a random vector x is ﬁnite if each realization x = x(ω) has a ﬁnite number of scalar components. inﬁnite set X = {x (1) , x (2) ,..., x (N) ,...} of output vector signals using a Wiener-like approach then we have to ﬁnd a set of corresponding Wiener ﬁlters {F (1) , F (2) ,..., F (N) ,...} so that each representative F i of the ﬁlter set relates to a representative y (i) of the signal-vector set Y . Therefore such a ﬁlter cannot be applied if X and Y are inﬁnite sets of signals. Moreover, in some situations, a recognizer must be used that will determine to which of the ﬁlters {F (1) , F (2) ,..., F (N) ,...} each component from Y should be directed. Note that even in the case when Y and X are ﬁnite sets, Y = {y (1) , y (2) ,..., y (N) } and X = {x (1) , x (2) ,..., x (N) } where Y and X can be represented as ﬁnite vectors, the Wiener approach leads to computation of large covariance matrices. Indeed, if each y i has n components and each x i has m components then the Wiener approach leads to computation of a product of an mN × nN matrix and an nN × nN matrix and computation of an nN × nN pseudo-inverse matrix [18]. This requires O(2mn 2 N 3 ) and O(22n 3 N 3 ) ﬂops, respectively [7]. If m, n and N are sufﬁciently large then the computational work associated with this approach becomes unreasonably hard. To avoid such drawbacks here, we here study an approach that allows us to use only one ﬁlter to process any signal from the inﬁnite set Y . The ﬁrst question we address in the paper is as follows. Let X and Y be inﬁnite sets of signals. How should we construct a single optimal ﬁlter F : Y → X which can be applied to each pair of signals (x, y) ∈ X × Y and which, moreover, transforms each y to a corresponding x with associated minimal error? Surprisingly, perhaps, the answer is based ﬁrstly, on an equivalent alternative signal representation in a different space and secondly, on the use of a special norm (3) in the statement of the problem. The dual representation means that x is considered as a single signal in one representation, and on the other hand, as an inﬁnite set of signals in the other, original, representation. A detailed explanation is given in Section VI. Examples of different special cases of the norm (3) used in our statement of the problem are presented in Section VI. The answer for the ﬁrst question is provided in Sections II-A, II-C, in Theorems 3 and 4, and in Section III-E. The special norm is given by (3) below. 2) Second motivation: improvement in the ﬁlter perfor- mance: The performance of ﬁlters used for data ﬁltering, compression and subsequent reconstruction, is characterized by the accuracy, the compression ratio and the related com- putational load. The Karhunen-Lo` eve ﬁlter (KLF) [13], [14], World Academy of Science, Engineering and Technology International Journal of Electronics and Communication Engineering Vol:2, No:11, 2008 2604 International Scholarly and Scientific Research & Innovation 2(11) 2008 scholar.waset.org/1307-6892/11167 International Science Index, Electronics and Communication Engineering Vol:2, No:11, 2008 waset.org/Publication/11167