FRAME BIT ALLOCATION FOR H.264 USING CAUCHY-DISTRIBUTION BASED SOURCE MODELLING Nejat Kamaci and Yucel Altunbasak Center for Signal and Image Processing Georgia Institute of Technology Atlanta, GA 30332 email:{kamaci,yucel}@ece.gatech.edu ABSTRACT Based on the observation that a Cauchy density is more accurate in estimating the distribution of the AC coefﬁcients than the tra- ditional Laplacian density for H.264 video coders, a rate model with improved accuracy is derived. Using the new rate model, a frame bit-allocation algorithm for the rate control of an H.264 video coder is developed. Extensive analysis with carefully se- lected anchor video sequences demonstrates a 0.43 dB average PSNR improvement over the JM 8.4 rate control algorithm, and a 0.27 dB average PSNR improvement over the TM5-based bit- allocation algorithm that has recently been proposed for H.264 by Ma et al. The analysis also demonstrates 20% and 60% reductions in PSNR variation among the encoded pictures when compared to the JM 8.4 rate control algorithm and the TM5-based bit-allocation algorithm, respectively. 1. INTRODUCTION Over the past few decades, transform-based compression for im- age and video sources has gained widespread popularity for visual information management, processing, and communications. As a result, several industry standards have been developed, such as MPEG-2 [1] and H.264 [2] for video coding. With all of these image and video processing methods, the image frame(s) is di- vided into nonoverlapping blocks, and a transformation is applied to the block before quantization and entropy coding. The two- dimensional discrete cosine transform (DCT) is the most common transform used in these methods 1 . The H.264/AVC (equivalently MPEG-4 Part 10) standard is one of the most advanced video coding standards that has been de- veloped. H.264/AVC features a number of new technologies such as intra prediction, integer transform and multi-frame prediction, in addition to improvements on the existing technologies. In com- mon with the earlier standards, the H.264/AVC does not explicitly deﬁne an encoder-decoder pair. Many functional parts of the en- coder and the decoder are left open for optimization. One of these functional parts is the rate-control module that is responsible for controlling the output bit rate of the encoder. 1.1. Rate control for H.264/AVC The output bit rate and video quality of a video encoder depend on several coding parameters such as the quantization scale (Q) and 1 The H.264 coder uses an integer transform that is a close approxima- tion to the DCT. coding mode. In particular, choosing a large quantization scale re- duces the resulting bit rate, while at the same time reducing the visual quality of the encoded video. In most applications, a pre- determined constant output bit rate is desired. These applications are referred to as constant bit-rate (CBR) applications. The output bit rate of an encoder can be controlled by carefully selecting the quantization parameters for each coding block. This task is per- formed by the rate-control module. The goal of rate-control unit is to keep the output bit rate within constrained limits while achiev- ing maximally uniform video quality. For practical reasons, the rate-control problem is usually stud- ied in three subproblems: (i) GOP bit allocation, (ii) picture bit allocation, and (iii) macroblock Q selection. GOP bit allocation involves selecting the number of bits to allocate to a GOP, which in the case of CBR rate-control, simply amounts to assigning a ﬁxed number of bits per GOP. Picture bit allocation involves distributing the GOP budget among the picture frames, so as to achieve a max- imal, uniform video quality. Macroblock Q selection involves tun- ing the Q parameter for each macroblock of a frame so that the the rate regulations are met and a uniform quality is achieved within the picture. As in the H.264 reference software, Q selection may also affect the motion estimation and compensation operations. Many conventional rate-control algorithms use rate and distor- tion models for their operation. The performance of a rate-control algorithm greatly depends on its ability to estimate the rate and the distortion. In our earlier study we have demonstrated that the accu- racy of modelling rate-distortion relation can be improved by us- ing a Cauchy-distribution ﬁt to the DCT coefﬁcients, especially for the H.264 video coders [3]. The Cauchy distribution’s heavy tails help estimating the actual statistical distribution of the transform coefﬁcients more accurately for larger values. Consecutively, it helps improving the accuracy of the rate and distortion models. In this study, we demonstrate the beneﬁt of using Cauchy-distribution based rate model in frame bit allocation for rate control purposes. 2. RATE MODELLING USING CAUCHY-DISTRIBUTION The entropy of a uniformly quantized Cauchy source with param- eter µ and quantization level Q is given in [3] by H (µ, Q) = − 2 π ξ  1 2 , 2µ, Q  log 2 ξ  1 2 , 2µ, Q  − 2 π ∞  i=1 ξ (i, µ, Q) log 2 ξ (i, µ, Q) , (1) II - 57 0-7803-8874-7/05/$20.00 ©2005 IEEE ICASSP 2005