BIT RATE ESTIMATION FOR COST FUNCTION OF 4x4 INTRA MODE
DECISION OF H.264/AVC
Mohammed Golam Sarwer, and Lai Man Po
Department of Electronic Engineering, City University of Hong Kong
Kowloon , Hong Kong SAR, China
sarwer@ee.cityu.edu.hk , eelmpo@cityu.edu.hk
ABSTRACT
H.264/AVC is a newest international video coding standard
that can achieve considerably higher coding efficiency than
previous standards. This comes at the cost of the complex
mode decision procedure using the rate-distortion
optimization, which makes real-time encoding difficult. To
reduce the complexity of rate-distortion cost, we propose a
bit rate estimation technique to avoid the entropy coding
method during mode decision of intra prediction. The
estimation method is based on the properties of context-
based variable length coding (CAVLC). Simulation results
demonstrate that the proposed estimation method achieves
up to 53 % reduced encoding time of intra coding with
ignorable degradation of coding performance.
1. INTRODUCTION
The appearance and development of various new
multimedia services have need for higher coding efficiency.
The ITU-T/ISO/IEC Joint Video Team established the
newest video coding standard known as H.264/AVC [1].
H.264/AVC offers a significant performance improvement
over previous video coding standards such as H.263++ and
MPEG-4 [2,3]. New and advanced techniques are
introduced in this new standard, such as intra prediction for
I-frame encoding, multi-frames inter prediction, small
block-size transform coding, context-adaptive arithmetic
entropy coding, de-blocking filtering, etc. These advanced
techniques make this new standard provides approximately
50% bit rate saving for equivalent perceptual quality relative
to the performance of prior standards.
To achieve the highest coding efficiency, H.264/AVC uses
rate-distortion optimization (RDO) technique to get the best
coding result in terms of maximizing coding quality and
minimizing bit rates. This means that the encoder has to
code the video by exhaustively trying all the mode
combinations including the different intra and inter
prediction modes. Therefore, the complexity and
computation load of video coding in H.264/AVC increase
drastically compared to any previous standards.
To reduce computational complexity of H.264/AVC, a
number of efforts have been made to explore the fast
algorithm in motion estimation, intra mode prediction and
inter mode prediction for H.264/AVC video coding [4-6]. In
4x4 intra mode decision, 9 prediction modes are used. To
reduce the complexity, H.264/AVC reference software
suggested [7] sum of absolute difference (SAD) and sum of
absolute transform difference (SATD) based cost functions.
These two cost functions reduce computation significantly
but performance of rate-distortion characteristics is not good
enough. To improve the rate-distortion performance, a new
cost function for intra 4x4 mode decision is proposed in [8].
In this cost function, sum of absolute integer transform
difference (SAITD) is used in distortion part and a rate
prediction algorithm is used in rate part. The major
drawback of this cost is that the bit estimation method can
not give the very good estimation.
In this paper, we propose a shortcut way to get the number
of entropy coded bits as soon as the transform coefficients
are quantized. A method for estimation of rate for cost
function of 4x4 intra mode decision is proposed. This
method is based on the properties of context-based variable
length coding (CAVLC) and observation of VLC tables.
2. COST FUNCTION FOR INTRA 4X4 MODE
DECISION OF H.264/AVC
To get a richer set of intra-prediction patterns, H.264/AVC
offers 9 prediction modes for 4x4 luma blocks and 4
prediction modes for 16x16 luma blocks. The best mode is
the one having minimum rate-distortion cost and this cost is
expressed as
R SSD J
RD
⋅ + = λ (1)
where, the SSD is the sum of squared difference between the
original block and the reconstructed block, and R is the true
bits needed to encode the block and Ȝ is an exponential
function of the quantization parameter (QP).
In order to compute RD cost for each mode, same operation
of forward and inverse transform/quantization and variable
length coding is repetitively performed. All of these
1579 1-4244-1017-7/07/$25.00 ©2007 IEEE ICME 2007