2 imageAudioVideo
2 imageAudioVideo
Multimedia Basics
and
Representation
Text
• Text is a vital element of multimedia
presentations.
• Words and symbols in any form, spoken or
written, are the most common system of
communication. They deliver the most
widely understood meaning to the greatest
number of people— accurately and in
detail.
…continued
• Text is a visual representation of language,
as well as a graphic element in its own
right.
• The study of how to display text is known
as typography. It concerns the precise shape
of characters, their spacing, the layout of the
lines and paragraphs, etc
Typefaces and fonts
• To display text, we need to have a visual
representation of the characters stored as codes in the
computer.
• A typeface is a family of graphic characters with a
coherent design and usually includes many sizes and
styles.
• A font is a set of graphic characters with a specific
design in a specific size and style.
• For example, the typeface used in this paragraph
is ‘Arial’. The font is ‘Arial 28pt’. Arial may contain
many fonts such as Arial Black, Arial narrow etc
Classification of typeface
• Serif means tick marks that you often see at the
endings of a stroke in some fonts such as:
– Times new roman
– Courier
• Sans serif usually refers to those fonts that do
not have tick marks at the character endings.
Sans means without…Examples include
– Arial
– Helvetica
– Universe 55
• Note:-Serif typefaces may look
beautiful but could be difficult to
read in some environments, such as
on multimedia images etc or for
Example fonts and sizes
• Arial 32: Multimedia systems
• Times New Roman 22: Multimedia systems
• Albertus extra bold 24: Multimedia systems
• Algerian 28: Multimedia systems
• Abyssinica 30: Multimedia systems
• Apple chancery 30: Multimedia systems
• Bauhaus 93 22: Multimedia systems
• W Outline font
Font attributes
Five attributes are often used for specifying a font:
• Family — fonts in the same family have similar design, look and feel.
Here are some of the common families:
– Times, Helvetica, Courier, Garamond, Univers
• Shape — refers to the different appearance within a family.
– normal (upright), italic, SMALL CAP
• Weight — measures the darkness of the characters, or the thickness of
the strokes. The commonly used names are:
– ultra light, extra light, light, semi light, medium, semi bold, bold,
extra bold,etc.
• Width — the amount of expansion or contraction with respect to the
normal or medium in the family.
• Size — unit of measure is point.
– 1 inch = 72.27 point in printing industry.
– 1 inch = 72 point in PostScript systems.
An image
contains
a header and
a bunch of
(integer) numbers.
Types of Digital Images
• Grayscale image
– Usually we use 256 levels for each pixel.
That means, the numerical value for gray
levels range from 0 (for black pixels) to
FF (256) for white. Thus we need 8 bits
to represent each pixel (28 = 256)
– Gray scale ranges from black to grays A 8 bit grayscale
and finally to white. Image.
• Binary Image
– A binary image has only two values (0 or
1). A numerical value may represent
either a black (0) or a white (1) dot/pixel.
– Binary image is quite important in image
analysis and object detection
applications.
RGB Color Model
• To form a color with RGB, three B
separate color signals of one red,
one green, & one blue must be R
mixed. Each of the three signals
can have an arbitrary intensity,
from fully off to fully on, in the
mixture. G
–The RGB color model is an additive
color mixing model using which every
color can be encoded as a combination
of red, green, & blue light.
–Projection of primary color lights on a
screen shows secondary colors where
they overlap; for instance, the Red
combination of all three of red, green,
& blue in appropriate intensities
makes white.
RGB Color Model
• The main purpose of the RGB color
model is for display of images in
electronic systems, such as televisions
and computers.
– Typical RGB input devices are color TV &
video cameras, image scanners, and digital
cameras.
– Typical RGB output devices are TV sets of
various technologies (CRT, LCD, plasma,
etc.), computer and mobile video projectors,
phone displays, etc.
– Color printers, on the other hand, are usually
not RGB devices, but subtractive color
devices (typically CMYK color model).
RGB color model
The figure shows an RGB image, along with its
separate R, G and B components. Note that,
–strong red, green, and blue produces white
color; like wise, strong red and green with little
blue gives brown; strong green with little red or
blue gives dark green; strong blue and
moderately strong red and green provides light
blue sky.
The number of bits used to represent each pixel in RGB space is
called the pixel depth.
–Consider an RGB image in which each of the red, green, and
blue color is an 8-bit representation. Under these conditions
each RGB color pixel have a depth of 24 bits.
–Compute the total number of colors in a 24-bit RGB image ?
Color Image
• Characterization of light is central
to the science of color.
• There are different color models:
RGB, YUV, YIQ, HSV, CMYK
(Cyan, Magenta, Yellow, Black),
etc. color model
R
B
24 bit image
Color Table
Clusters of colors
•Video:
–A series of framed images put together, one after another, to
simulate motion and interactivity. A video can be transmitted by
number of frames per second and/or the amount of time between
switching frames.
–The difference between video and animation is that video is broken
down into individual frames.
Digital Media
• In computers, audio, image and video are stored as files
just like other text files (e.g. DOC, TXT, TEX, etc.).
– For images, these files can have an extension like
• BMP, JPG/JPEG, GIF, TIF, PNG, PPM, …
– For audios, the file extensions include
• WAV, MP3, m4a, AMR, WMA…
– The videos files usually have extensions:
• MOV, AVI, MPEG, MP4, 3gp, …
An Audio Signal
0.8
0.6
0.4
0.2
Amplitude
-0.2
-0.4
Sampling period Ts,
-0.6
fs =1/Ts
-0.8
-1
0 10 20 30 40 50 60 70 80 90 100
t
Sampling on a grid
• Video quantization is
time essentially the same as
Frame N-1
image quantization
• During video
quantization each pixel
is represented by a bit
depth of, say 8-bits
Frame 0
representing luminance
and color information.
Human visual system
• What characteristics of the human visual system can be exploited
in related to compression of color images and video?
• The eye is basically sensitive to color intensity
– Each neuron is either a rod or a cone . Rods are not sensitive to color.
– Cones come in 3 types: red, green and blue.
– Each responds differently --- Non linearly and not equally for RGB
differently to various frequencies of light.
Color System in Video
• Video signals are often transmitted to the receiver over a
single television channel
–In order to encode color, a video signal is decomposed into three
sub-signals: a luminance signal and two color signals.
–Since human vision is more sensitive to brightness than to color, a
more suitable color encoding system separates the luminance from
color information. Such models include YUV, YIQ, etc.
The YUV color model: While RGB model
separates colors, YUV model separates
brightness (luminance) information from
the color information. Y is the luminance
component (brightness) and U and V are
color components
– It is obtained from RGB using the following
equations.
Y = 0.299 R + 0.587 G + 0.144 B
U=B–Y
V=R-Y
Y U V
Color System in Video
YIQ color model
• YIQ color model is a similar encoding system
like YUV.
• It produces the I and Q colors and adds the
modulated signal to the luminance Y.
– It is obtained from RGB using the following
equations.
Y = 0.3 R + 0.59 G + 0.11 B
I = 0.60 R – 0.28 G – 0.32 B
Q = 0.21 R – 0.52 G + 0.31 B
I Q
Video Storing format & compression
• Each video formats support various resolutions and color
presentation. The following are the well-known video formats
• The Color Graphics Adaptor (CGA):
–Has a resolution of 320 x 200 pixels with simultaneous display of four
colors. (320*200 *log42 (2)=16000 byte
–What the necessary storage capacity per frame ?
• The Enhanced Graphics Adaptor (EGA):
–Supports display resolution of 640 x 350 pixels with 16 simultaneous
display colors
–What the necessary storage capacity per frame ?
• The Video Graphics Array (VGA):
–Works mostly with a resolution of 640 x 480 pixels with 256
simultaneous display colors
–What the necessary storage capacity per frame ?
• The Supper Video Graphics Array (VGA):
–Can present 256 colors at a resolution of 1024 x 768 pixels.
–What the necessary storage capacity per frame ?
–Other SVGA modes include 1280 x 1024 pixels and 1600 x 1280 pixels.
Exercise
• Suppose we have 24 bits per pixel available for a color
image. We also note that humans are more sensitive to
red and green colors than to blue, by a factor of
approximately 1.5 times. How may we design a simple
color representation to make use of the bits available?
• Why we use different type of format for specific media?
• Quite a simple scheme:
– Since Blue is less perceptually important use less bits to
represent blue color. Use proportionately more bits for red
and green rather than blue
– Therefore Red and Green use 9 bits each and Blue 6 bits to
represent values
– Need to quantize at different levels for blue and Red/green