mpeg audio
mpeg audio
Audio
Signal to
Mask Ratio
Psychoacoustic
Model
MPEG Approach
• MPEG standardizes only bitstream format and
decoder, not the encoder (“informative part”)
• MPEG-2 Audio
– Low sampling frequencies audio
add 16 - 24 kHz to Layer 1, 2, 3
– Multichannel audio, BC
AT&T (PXFM)
CNET
• Filter bank
– subdividing the input signal into spectral
components
– more lines more coding gain
– longer impulse response pre-echo artifacts
• Layer II
– Frame length: 1152 samples (24 ms@ 48 kHz)
– Frequency resolution: 32 subbands
– Quantization: Block-companding (12 samples)
– Use of Scalefactor select information
Prof. Dr.-Ing. K. Brandenburg, [email protected] Dr.-Ing. G. Schuller, [email protected] Page 9
MPEG Audio - Short Description of the Layers (2)
• Layer III
– Standard frame length: 1152 samples (24 ms @
48 kHz)
– Frequency resolution: 576/192 subbands
– Quantization: non-uniform with Huffman coding
– Use of Scalefactor Select Information
• Specifics of Layer-3:
– Hybrid filter bank (32*18 = 576 subbands or
32*6=192 subbands)
– nonuniform quantization (implicit noise shaping)
with a power law ( ^.75)
– Huffman coding
– Analysis-by-Synthesis structure
– Bitreservoir (short time buffer)
– Support for variable bitrate (mandatory)
32*18=576 bands
3dB
yk
mirrored original
Nyquist frequency fp
passband
6/18 -
bands
32 -
bands
6/18
bands
• Solution:
– Less downsampling in first stage (non-critical
sampling)
– better filter
– Aliasing reduction with subtraction from
neighboring bands (e.g.MP3)
– no cascaded filter bank (e.g. AAC)
• Detailed possibilities:
– Bit allocation between fixed „Worst-Case“ and
„Maximum-SNR“ situations
– Bit allocation is calculated from the Threshold
Estimation
– Direct calculation of the allowed noise (Noise
Allocation)
– Simplified „Noise Allocation“
Prof. Dr.-Ing. K. Brandenburg, [email protected] Dr.-Ing. G. Schuller, [email protected] Page 36
Layer-3 : Outer Loop
• Distortion Loop (control of the distortion)
• “Downward compatibility”:
LS RS Also 3/1, 3/0, 2/2, 2/1, 1/0 supported
• Multi-Lingual capability
Up to 7 channels e.g. for different languages,
commentary channel, “clean dialog” etc.
Prof. Dr.-Ing. K. Brandenburg, [email protected] Dr.-Ing. G. Schuller, [email protected] Page 54
MPEG - 2 Audio Multichannel
Structure of the ISO 13818-3
Layer II multichannel
extension, backwards
compatible with ISO 11172-3
Layer II