Date post: | 07-Apr-2018 |
Category: |
Documents |
Author: | sreedevi-abhilash |
View: | 235 times |
Download: | 0 times |
of 38
8/6/2019 MP3 Tutorial
1/38
8/6/2019 MP3 Tutorial
2/38
Introduction
Motion Picture Experts Group (MPEG)
I
nternational Standards Organization (I
SO) First High Fidelity Audio standard
Part of a multiple standard for
Video compression Audio compression
Audio, Video and Data synchronization at an
aggregate rate of1.5 Mbit/sec
8/6/2019 MP3 Tutorial
3/38
MPEG Audio Physically Lossy compression algorithm
Perceptually lossless, transparent algorithm Exploits perceptual properties of human ear
Psychoacoustic modeling
8/6/2019 MP3 Tutorial
4/38
Medium quality audio
compression Code Excited Linear Prediction
for speech coding
-law
Adaptive Differential Pulse Code
Modulation
8/6/2019 MP3 Tutorial
5/38
The MPEG Audio standard
Ensures inter-operability
Defines coded bitstream syntax
Defines decoding process
Guarantees decoders accuracy
8/6/2019 MP3 Tutorial
6/38
MPEG audio acceptance Wide acceptance
Large number of MPEG audio codecs
produced
Stand-alone, Mobile phone add-ons etc
8/6/2019 MP3 Tutorial
7/38
MPEG audio features No assumptions about the nature of the
audio source
Exploitation of human auditory system
perceptual limitations
Removal of perceptually irrelevant parts of
audio signal
8/6/2019 MP3 Tutorial
8/38
MPEG audio sampling rates 32 kHz
44.1 kHz
48 kHz
8/6/2019 MP3 Tutorial
9/38
MPEG audio supports One or two audio channels in
a monophonic mode for a single audio channel
a dual monophonic mode for two independent
audio channels
a stereo mode with sharing of bits
a joint stereo mode based on the correlation orthe phase difference between channels
8/6/2019 MP3 Tutorial
10/38
MPEG audio supports Several predefined fixed bit rates ranging
from 32 to 224 kbits/sec per channel
Free bit rate other than the predefined rates
8/6/2019 MP3 Tutorial
11/38
MPEG audio offers Three independent layers of compression
A wide range of tradeoffs between codec
complexity and compressed audio quality
8/6/2019 MP3 Tutorial
12/38
8/6/2019 MP3 Tutorial
13/38
MPEG Audio LayerII
Intermediate complexity
Bit rates around 128 kbits/sec per channel
Digital Audio Broadcasting (DAB)
Synchronized Video and Audio on CD-
ROM
Full motion CD-I
Video-CD
8/6/2019 MP3 Tutorial
14/38
MPEG Audio LayerIII
Most complex coding
Best audio quality
Bit rates around 64 kbits/sec per channel
Suitable for audio overISDN
8/6/2019 MP3 Tutorial
15/38
MPEG Audio extras All three layers allow single chip real-time
decoder implementation
Optional Cyclic Redundancy Check (CRC)
error detection
Ancillary data may be included in the bit
stream
8/6/2019 MP3 Tutorial
16/38
Overview Quantization, the key to MPEG audio
compression
Transparent, perceptually lossless
compression
No distinction between original and 6-to-1
compressed audio clips
stereo, 16 bit/sample, sampled at 48 kHz,
compressed at 256 kbits/sec
8/6/2019 MP3 Tutorial
17/38
8/6/2019 MP3 Tutorial
18/38
The Polyphase Filter Bank
Key component common to all layers
Divides the audio signal into 32 equal-width
frequency subbands
The filters provide good time and
reasonable frequency resolution
Critical bands associated with
psychoacoustic models
8/6/2019 MP3 Tutorial
19/38
8/6/2019 MP3 Tutorial
20/38
8/6/2019 MP3 Tutorial
21/38
8/6/2019 MP3 Tutorial
22/38
Psychoacoustics The aim is to remove acoustically irrelevant
parts of the audio signal
The human auditory system is unable to
hear quantization noise under conditions of
auditory masking
Masking occurs whenever a strong signalmakes a neighborhood of weaker audio
signals imperceptible
8/6/2019 MP3 Tutorial
23/38
Critical bands The human auditory system has a limited,
frequency dependent resolution
This frequency dependence is expressed in
the form of critical band widths, less then
100 Hz for low and more then 4kHz for
high frequencies The human ear blurs the various signal
components inside a critical band
8/6/2019 MP3 Tutorial
24/38
Noise masking threshold Human ear resolving power is frequency
dependent
Noise masking threshold, at any frequency,
depends only on the signal energy within a
limited bandwidth neighborhood that
frequency
8/6/2019 MP3 Tutorial
25/38
8/6/2019 MP3 Tutorial
26/38
8/6/2019 MP3 Tutorial
27/38
8/6/2019 MP3 Tutorial
28/38
The Psychoacoustic Model
Analyzes the audio signal and computes the
amount of noise masking as a function of
frequency
The encoder decides how best to represent
the input signal with a minimum number of
bits
8/6/2019 MP3 Tutorial
29/38
Basic Steps Time align audio data Convert audio to frequency domain
representation Process spectral values into tonal and non-
tonal components
Apply a spreading function Set a lower bound for threshold values
Find the threshold values for each subband
Calculate the signal to mask ratio
8/6/2019 MP3 Tutorial
30/38
8/6/2019 MP3 Tutorial
31/38
8/6/2019 MP3 Tutorial
32/38
8/6/2019 MP3 Tutorial
33/38
8/6/2019 MP3 Tutorial
34/38
MPEG LayerIII
coding Based on LayerI&II filter banks
Compensation of filter deficiencies by
processing outputs with a Modified Discrete
Cosine Transform
8/6/2019 MP3 Tutorial
35/38
8/6/2019 MP3 Tutorial
36/38
LayerIII
enhancements Alias reduction
Non uniform quantization
Scalefactor bands
Entropy coding of data values
Use of a bit reservoir
8/6/2019 MP3 Tutorial
37/38
8/6/2019 MP3 Tutorial
38/38