An Efﬁcient Codebook for the SCELP Low Delay Audio Codec Hauke Kr¨ uger and Peter Vary Institute of Communication Systems and Data Processing RWTH Aachen University, D-52056 Aachen, Germany email: {krueger,vary}@ind.rwth-aachen.de Abstract— The SCELP (Spherical Code Excited Linear Predic- tion) audio codec has recently been proposed as a new candidate for low delay audio coding based on Linear Prediction (LP). The new codec applies closed-loop vector quantization employing a spherical code in a gain shape manner. The spherical code is based on the apple-peeling code construction rule and in general does not require a codebook table for the encoding and decoding process. In this contribution, however, we propose to employ auxiliary information gathered in advance to reduce the computational encoding and decoding complexity at runtime signiﬁcantly. This auxiliary information can be considered as the SCELP codebook. Due to the consideration of the characteristics of the apple- peeling code construction principle, this codebook can be stored very efﬁciently in read-only-memory. With the proposed princi- ple, low computational as well as low memory complexity can be achieved simultaneously in the SCELP codec. I. I NTRODUCTION Lossy compression of audio signals can be roughly subdivided into two principles: Perceptual audio coding is based on transform coding. The signal to be compressed is ﬁrstly transformed by an analysis ﬁlter bank, and the sub band rep- resentation is quantized in the transform domain. A perceptual model controls the adaptive bit allocation for the quantization. The goal is to keep the noise introduced by quantization below the masking threshold described by the perceptual model. In general, the algorithmic delay is rather high due to large transform lengths, e.g. [3]. Parametric audio coding is based on a source model. One parametric approach is based on linear prediction (LP), the basis for todays highly efﬁcient speech coding algorithms for mobile communications, for example [4]. An all-pole ﬁlter models the spectral envelope of the input signal. Based on the inverse of this ﬁlter, the input is ﬁltered to produce the LP residual signal which is quantized. Often vec- tor quantization with a sparse codebook is applied according to the CELP (Code Excited Linear Prediction [2]) analysis-by- synthesis approach to achieve very high compression. Due to the sparse codebook and additional modeling of the speaker’s instantaneous pitch period, speech coders perform well for speech but cannot compete with perceptual audio coding for non-speech input. The typical algorithmic delay is around 20 ms. In a previous contribution [1] we presented a new coding scheme for low delay parametric audio coding in which the principle of linear prediction is preserved while a spherical code is used in a gain-shape manner for the quantization of the residual signal at a moderate bit rate. This spherical code is based on the apple-peeling principle introduced in [5] for the purpose of channel coding and referenced in [6] in the context of a spherical code analysis. The apple-peeling code has been revisited in [7] for DPCM. In contrast to that approach, the SCELP codec employs the spherical code in a CELP scheme. The sphere construction principle according to the apple-peeling method in general enables the encoding and decoding of a signal vector without any codebook. In this contribution we propose to use auxiliary information which can be determined in advance during code construction. This auxiliary information is stored in read-only-memory (ROM) and can be considered as a compact vector codebook. At codec runtime it aids the process of transforming the spherical code vector index, used for signal transmission, into the reconstructed code vectors on encoder and decoder side. The compact codebook is based on a representation of the spherical code as a coding tree combined with a lookup table to store all required trigonometric function values for spherical coordinate transformation. Because both parts of this compact codebook are determined in advance the computational complexity for signal compression can be drastically reduced. The properties of the compact codebook can be exploited to store it with only a small demand for ROM compared to an approach that stores a lookup table as often applied for trained codebooks [11]. In this paper, the principle of the SCELP audio coding scheme will be shortly reviewed in Section II. The construction of the spherical code according to the apple-peeling method is described in Section III. A representation of this code construction principle as spherical coding tree for code vector decoding is explained in Section IV. In Section V, the principle to efﬁciently store the coding tree and the lookup table for trigonometric function values for code vector reconstruction is presented. Results considering the reduction of the compu- tational and memory complexity are given in Section VI. II. THE SCELP AUDIO CODEC The SCELP codec as proposed in [1] is based on block adaptive linear prediction. Linear predictive coding in general exploits correlation immanent to an input signal x(k) by decorrelating it before quantization. For short term block adaptive linear prediction of order N , a windowed segment of the input signal, x w (k), of length L LP C is analyzed 119 0-7803-9752-5/06/$20.00 ©2006 IEEE