Emotion Triggered H.264 Compression: This Chapter Describes

This chapter describes H.264 video compression and the impact of quantization step size on output video quality. It discusses how compression removes redundancy between frames to reduce file size. Quantization step size is an important coding parameter - larger sizes lead to higher compression but lower quality, while smaller sizes improve quality at the cost of less compression. The chapter shows output frames from sample videos compressed with different quantization step sizes and their corresponding PSNR and bitrate values, demonstrating the quality-compression tradeoff.

Uploaded by

Rajshree Mandal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views

Emotion Triggered H.264 Compression: This Chapter Describes

Uploaded by

Rajshree Mandal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 27

CHAPTER 2

Emotion Triggered H.264 Compression

This chapter describes

1.1 Introduction
A video is nothing but large number of still images combined together. When these still images
are played frame by frame sequentially, we are able to see a video. Transfer or storage of a raw
video file may be the best solution in terms of quality but as the transmission and storage cost is
increasing day by day, it may not be the best solution in terms of cost. So, it becomes necessary to
develop a cost effective compression technique for this purpose.

Now, to understand the importance of a compression technique, let us consider the videos
shown in figure. We notice large that there is a large amount of redundancy in the consecutive
frames of these raw video files. The main aim of any compression technique is to remove the
redundancy present in a raw video signal, thereby reducing the number of binary bits required to
represent the raw video.

Video 1: suzie_qcif.yuv (add 2 more videos)

In order to carry out compression of a raw video, many standards are available nowadays like
MPEG-1, MPEG-2, JPEG, JPEG2000, and many more. A standard itself does not define the
encoding process. Rather, it defines the syntax in which the original video will be present in its
compressed form and also the method to decode the compressed data to get the decoded output. It
also makes sure that the encoder and decoder are compliant to each other, which means that the
raw video encoded by the encoder can be successfully decoded by the compliant decoder.

The video compression standard like MPEG-1 and MPEG-2 were developed by Moving
Pictures Expert Group (MPEG), which are now widely used in the field of communication and
storage of digital video. The JPEG and JPEG2000 standards which are widely popular were
developed by the Joint Photographic Experts Group (JPEG) for coding still images. ITU-T
Advanced Video Experts Group developed the H.263+ standard. The main advantage of H.264
standard over the previous standards was its improved compression efficiency for low bit-rate
encoding of video sequences. The latest development by ITU-T and JVT led to H.264 standard for
video compression which is widely used nowadays.

1.2 Quality Measure of an H.264 Compressed Video

From previous discussions, it is clear that compression is very essential in order to transfer or store
a raw digital video. On one hand, it gives us immense amount of added advantage in terms of
reduced storage space or lesser number of bits while transmission, but on the other hand we lose
the original quality of the video signal. In any communication system, comprising of an encoder
and a decoder, it is also equally essential to evaluate the quality of the video signal at the input of
the encoder and the output of the decoder.

The visual quality measurement of the video at the output of the decoder is not an easy task as
many factors are involved in the quality measurement process. The viewer’s state of mind or
his/her personal opinion about the quality may be taken, for example, as one of the important
factors. Other factors may include the kind of video for which the encoding procedure is carried
out. For example, a person who is watching a football match may not look into the detail of the
audiences watching the match, but the same person may look into the detail of the facial
expressions of a news reader reading news on TV. Again, the objective for which encoding is
done also affects the quality measure. For example, people may expect a high quality video output
for a videoconferencing or a surveillance video scene.

1.2.1 Subjective Quality Measurement

The subjective quality measurement also depends on many factors. How a video scene is
subjectively measured by a viewer depends on his/her perception which is mainly governed by the
Human Visual System (HVS). The other factors being how much the viewer is interacting with the
output video and the environment in which he/she is watching the video. Also, different viewers
may have their own opinion regarding the quality of the video. The quality which may be of
‘average’ quality to one viewer may seem to be of ‘good’ quality to someone else.

Keeping all these factors in mind, a very commonly used procedure for subjective quality
assessment of the video is outlined in the standard, known as the Double Stimulus
Continuous Quality Scale (DSCQS) method. The experimental setup for the
procedure is shown in Figure 2.

Original Video
Sequence

Display

ENCODER DECODER
Figure 2. Double Stimulus Continuous Quality Scale (DSCQS) method

In this procedure to assess the subjective quality of the video, the viewer is shown two
versions of the same video sequence. One version (version A) is the original or the reference video
and the other being the encoded and decoded one (version B). These two versions are shown
randomly to the viewer and he/she is made to rate the quality of these two version in the
continuous scale between ‘Excellent’ to ‘Bad’. Many such videos, each comprising of two
versions, are shown to the viewer to know the final assessment of the quality metric of the encoder
and decoder.

1.2.2 Objective Quality Measurement

Although the subjective quality measurement involves real experiences of people, this kind of
quality measure is both time consuming and expensive. Thus, another quality measure, known as
the objective quality measure is more often adopted. The most widely used objective quality
measurement is the Peak Signal to Noise Ratio (PSNR).

PSNR is measured on logarithmic scale and is given by

(2 n −1) 2
PSNR = 10 log 10
MSE

Here, MSE or Mean Square Error is calculated between the original video and the reconstructed
video at the output of the decoder. n being the number of bits per image sample. Although PSNR
is a convenient option to measure the quality of the reconstructed video sequence, but its
calculation requires the original video sequence, which may not always be present.

1.3 Effect of Quantization Step Size on the Output Video Quality

We have already been familiarized by the encoding and decoding procedure using H.264 standard
in the first chapter. It is seen that some coding parameters affect the quality of the video sequence.
Quantization Step Size is one such coding parameter which has an important role to play in the
quality of the reconstructed video. If the step size is large, we get high compression but at the cost
of poor video quality. On the other hand, making the quantization step size small for good quality
video, we have to pay the price with less compressed output. Some video sequences are shown
below with varying quantization step size and its effect on the visual quality. What is Quantization
Step Size
Video Sequence 1: akiyo_orig.yuv

Original: