ESCAPED-HUFFMAN AND ADAPTIVE RECURSIVE RICE CODING FOR LOSSLESS
COMPRESSION OF THE MAPPED DOMAIN LINEAR PREDICTION RESIDUAL
Noboru Harada
1
, Yutaka Kamamoto
1
, Takehiro Moriya
1
1
NTT Communication Science Laboratories, Nippon Telegraph and Telephone Corporation, Japan
ABSTRACT
ITU-T Recommendation G.711.0 has just been established. It
defines a lossless and stateless compression for G.711 packet
payloads (for both A-law and ȝ-law). This paper introduces some
coding technologies proposed and applied to the G.711.0 codec,
such as Plus-Minus zero mapping for the mapped domain linear
predictive coding and escaped-Huffman coding combined with
adaptive recursive Rice coding for lossless compression of the
prediction residual. Performance test results for those coding tools
are shown in comparison with the results for the conventional
technology. The performance is measured based on the figure of
merit (FoM), which is a function of the trade-off between
compression performance and computational complexity. The
proposed tools improve the compression performance by 0.16% in
total while keeping the computational complexity of
encoder/decoder pair low (about 1.0 WMOPS in average and 1.667
WMOPS in the worst-case).
Index Terms— Speech coding, Standardization, ITU-T
Recommendation G.711.0, Lossless compression of G.711 payload,
Plus-Minus zero mapping, Escaped-Huffman coding, Adaptive
recursive Rice coding, Mapped domain linear predictive coding.
1. INTRODUCTION
The ITU Rec. G.711 [1] is widely used for narrowband telephony
applications, including PSTN/GSTN and packet-based network
applications such as VoIP, and has been used for many decades
because of its proven voice quality, ubiquity, and utility. ITU has
just established a lossless coding technology for G.711 encoded
payloads. This new standard is ITU-T Rec. G.711.0 [2].
The G.711.0 codec may be used as a traditional codec and its
use negotiated (end-to-end) by the end terminals (IP phones,
conference bridge endpoints, etc.). Additionally, owing to its
lossless and stateless design, G.711.0 may also be used as a
lossless compression mechanism on any intermediate link (e.g.,
service provider VoIP backbone links at voice gateways) where
G.711 is used by the end systems. G.711.0 employed in these
transcoding applications provides bandwidth savings without any
degradation of audio quality relative to G.711 since it is a lossless
algorithm. For these gateway applications, low computational
complexity is desired. The figure of merit (FoM), defined in the
G.711.0 Terms of Reference (ToR) [3], was used to assess the
tradeoff between complexity and signal compression during the
design phase and in the G.711.0 selection process [4].
G.711.0 is a lossless compression algorithm that operates on 40,
80, 160, 240, and 320 samples per 8-kHz sampled G.711 input
frame. The bit rate is variable and the size of the (compressed)
output frame depends on the input signal characteristics. The
minimum size of an encoded frame is one byte. The maximum size
of an encoded frame is the input frame size plus one byte.
Following coding tools are included in G.711.0: An
uncompressed coding tool, constant coding tools (Constant Plus
zero coding, Constant Minus zero coding, Constant non-zero
coding), a Plus-Minus (PM) zero Rice coding tool, a binary coding
tool, a pulse mode coding tool, a value-location coding tool, a
fractional-bit coding tool, a min-max level coding tool, a direct
linear predictive coding tool, and a mapped domain linear
predictive (MDLP) coding tool.
The MDLP coding is a kind of LP coding but especially
designed for G.711 A-law and ȝ-law input (A similar scheme had
been also proposed by F. Ghido, et. al. [5]).
This paper introduces some new coding schemes proposed and
applied to the G.711.0 codec, especially related to the MDLP
coding tool. PM zero mapping is used to calculate the prediction
residual and Escaped-Huffman (E-Huffman) coding combined
with adaptive recursive Rice coding is used as an entropy coding
scheme for the prediction residual. Test results are also examined
in terms of the compression performance/computational
complexity trade-off based on the FoM.
Section 2 overviews the mapped domain linear prediction.
Sections 3 and 4 introduce PM zero mapping and E-Huffman
coding combined with adaptive recursive Rice coding, respectively.
Section 4 shows the test results. Finally, Section 6 concludes the
paper.
2. MAPPED DOMAIN LINEAR PREDICTION
Figure 1 shows a block diagram of the MDLP encoding tool used
in the G.711.0 encoder. The MDLP coding tool takes a sequence
of N G.711 A-law, ()
A
I n , or G.711 μ-law, () I n
μ
, symbols. First,
these N G.711 symbols are converted into
PCM
() x n , 0 n N ≤ < in
the uniform (linear) PCM domain and a short-term prediction is
carried out in that domain using LP analysis. The prediction
residual signal, however, is obtained in the range of [ 255, 255] −
since the predicted value is subtracted from the target value
int 8
() x n in the 8-bit logarithmic domain (denoted as the int8
domain in this paper). PARCOR coefficients are used to represent
and signal the LPC parameter.
Linear prediction is applied as follows:
( )
int8 PCM int8 int8 PCM int8
1
ˆ () ( )
P
i
i
x n f a f x n i
→ →
=
ァ キ
ィ ク
= ⋅ −
ィ ク
ゥ ケ
ヲ
…(1)
where
i
a is the i -th LPC coefficient of P -th order prediction,
int8
( ) x n i − and
int8
ˆ () x n are the previous sample value and the
predicted sample value in the int8 domain, and
PCM int8
f
→
and
4646 978-1-4244-4296-6/10/$25.00 ©2010 IEEE ICASSP 2010