IEE 5037 Multimedia CommunicationsLecture 12: MPEG-4
Dep
t. Electro
nics E
ng
ineerin
g,N
ational Chiao T
ung University Adapted from Prof. Hang’s slides
MPEG-4 Video Coding
Part 2: Object-oriented codingFGS – Scalable coding
To be dropped from the standard?
Overview of MPEG-4
MPEG-4
ISO/IEC …/WG11: A standard for multimedia applicationsHistory: (Rao & Hwang, Chap. 12)— Nov. 1992: MPEG started new work item— Nov. 1994: Call for proposals — many submitted— Nov. 1995: Subjective testing and tool evaluation— Jan. 1996: Define Verification Model (VM1) (encoder)— July 1996: Evaluate SNHC proposals— Nov. 1996: Working draft (WD)— Apr. 1997: Video VM 7.0 (WD 3.0)— Nov. 1997: Committee draft (CD): IS/IEC 14496— Apr. 1999: International Standard (IS)— (Now) Working on newer versions with additional features
MPEG – 4 Documents
Part 1 Systems Part 2 VisualPart 3 AudioPart 4 ConformancePart 5 Reference SoftwarePart 6 DMIF - Delivery
Multimedia Integration Framework
Part 7 Optimized SoftwarePart 8 MPEG 4 on IPPart 9 Reference Hardware
Part 10 Advanced Video Coding (AVC) (JVT, H.264)
Part 11 Scene DescriptionPart 12 ISO base media file formatPart 13 IPMP extensionsPart 14 MP4 File formatPart 15 AVC File formatPart 16 Multimedia Animation
FrameworkPart 17 Streaming Text FormatPart 18 Font Compression and
Streaming
MPEG - 4 1998 Coding of audio-visual objects
MPEG-4 Goals
Content-based interactivity: Content-based manipulation and endingUniversal access: Robustness in error-prone environments; content-based scalabilityCoding of natural and synthetic data: Merging pixel-based video/audio with synthesized graphics /audio/ speech in highly flexible way.High compression: Improved coding efficiency for particularly low rate applicationsFlexible syntax and toolsTexture coding based on H.263
MPEG-4 Audio
Three core coders and some additional tools:— Parametric coder (PARA) — 2 to 16 kbs— CELP-based speech coder — 4 to 24 kbs— Time/frequency mapping based coder — 16 to 64 kbs— SNHC audio tools — text-to-speech, structured audio…Features:
— Improved coding efficiency— Time-scale change, pitch change (Karaoke)— Scalability: bitrate, bandwidth, …— Error resilience
MPEG-1 and MPEG-2MPEG-1 and MPEG-2
1992: MPEG-1 Standard
CD-ROM
(1.5 Mbit/s)
1994: MPEG-2 Standard
Digital Television (SDTV/HDTV)
(4 Mbit/s - 24 Mbit/s)
♦ Video Compression ♦ Audio Compression ♦ Systems (Multipl.)
MPEG-4MPEG-4
1999/2000: MPEG-4 Standard
Flexible Multimedia Communications
(5 kbits/s - 50 Mbit/s)
♦ Video Object Compression ♦ Audio Object Compression ♦ Synthetic Audio/Speech and Video ♦ Systems (Multiplex and flexible Composition)
First Things FirstFirst Things First
Top-quality MPEG-4 audio and video coders for streaming conventional speech, audio and video
•excellent AV compression•excellent robustness against packet loss •scalability of bitrate vs quality•MPEG-4 File Format
MPEG-4 attempts to become THEstandard for streaming AV media on the Internet and via wireless networks
But MPEG-4 Vision Goes MUCH Further
MPEG-4 attempts to provide a bridge between the www and conventional AV
media
But MPEG-4 Vision Goes MUCH Further
MPEG-4 attempts to provide a bridge between the www and conventional AV
media
Interactivities in MPEG-4
Example: MPEG-4 audio-visual SceneExample: MPEGExample: MPEG--4 4 audioaudio--visual Scenevisual Scene
2D Background2D Background
3D Furniture3D Furniture
SpeechSpeech
Video ObjectVideo Object
AV PresentationAV Presentation
MPEG-4 Systems: BIFS-Composition of Scenes
MPEGMPEG--4 Systems: 4 Systems: BIFSBIFS--Composition of ScenesComposition of Scenes
Scene
Person Audio-visualPresentation
2D Background Furniture
Globe TableSpeech Video
Integration of Natural and SyntheticContent
Integration of Natural and SyntheticContent
Application: Augmented Reality
Application: Telepresence
MPEG-4 New FunctionalitiesMPEG-4 New Functionalities
Streaming AV over mobile networks of much interest
More freedom to flexibly interact with what is within scenes
Support integration of natural and syntheticAV media (“Virtual Playground”)
Identification, Protection of intellectual property and rights on content
MPEG-4: Coding of AV Objects
AV scenes consist of ‘objects’Objects can be both natural or/and synthetic (A&V, Text & Graphics, animated faces, arbitrarily shaped or rectangular)A ‘compositor’ composes objects in a scene (A&V, 2&3D)Binary Format for Scene Description : ‘BIFS’Independent of Bitrate!
Object ManipulationObject Manipulation
Original Decoded Decoded and Manipulated
MPEG-4 Part II. Visual
CompressionCompression
Error ResilienceError Resilience
ScalabilityScalability
Content-based CodingContent-based Coding
Baseline Extended
Conventional coding Object coding
Still Texture CodingStill Texture Coding
MPEGMPEG--4 Video4 Video
MPEG-4 Video Standard
MPEG-4 Video Provides Tools for a Number of Functionalities
Many tools are not used
Integrated Approach (Baseline and Extensions)
Based on DCT Technology(except for Still Texture Coding) – DWTbased
MPEGMPEG--44Baseline and ExtensionsBaseline and Extensions
Compatibility Issues of MPEG-4 Video Standard
MPEG-4 Video is Compatible to Baseline H.263
And Almost Compatible to MPEG-1
And almost compatible to MPEG-2
Basic Structure for Video Standard
EntropyCoding
Scaling & Inv.
Transform
Motion-Compensation
ControlData
Quant.Transf. coeffs
MotionData
Intra/Inter
CoderControl
Decoder
MotionEstimation
Transform/Scal./Quant.-
InputVideoSignal
Split intoMacroblocks16x16 pixels
OutputVideoSignal
Baseline:Rectangular VOP (Conventional Coding)
EntropyCoding
Scaling & Inv.
Transform
Motion-Compensation
ControlData
Quant.Transf. coeffs
MotionData
Intra/Inter
CoderControl
Decoder
MotionEstimation
Transform/Scal./Quant.-
InputVideoSignal
Split intoMacroblocks16x16 pixels
OutputVideoSignal
8x8 DCT TransformAccuracy problem
(MPEG-2/4)Q: H.263 or MPEG-2 type
Intra DC/AC prediction (MPEG-4)
A
B C D
X MacroblockY
or or
Motion vector accuracy 1/4 (6-tap filter)
(MPEG-4)
0
16x16MB
Types
8x80 1
2 3
Scan:— Alternate-horizontal— Alternate-vertical— Zig-zagAdaptive DC prediction; adaptive AC predictionInverse Quantizer:— Quantization method 1 - similar to that of H.263— Quantization method 2 - similar to that of MPEG-2— Optimized nonlinear quantization for DC coeff.
(can be used together with previous two methods)
DCT and Quantization
Adaptive Intra-DC prediction
A
B C D
X Macroblock(16x16)
Y
or or
Block(8x8)
Choose best DC predictor based on gradients of the DC values (side info. not transmitted)
if (|QDCA - QDCB| < |QDCB - QDCC|) QDCX’ = QDCC
else QDCX’ = QDCA
Adaptive Intra-AC prediction
A
B
X
DC
or
Macroblock
Y
or
Shaded coefficients are predicted from previous coded blocks.The best direction is chosen based on the DC prediction.On/off Mblkbasis --transmitted
Functionality-Baseline
Similar to MPEG-2/H.263 structure and algorithms
8x8 DCT/Q/MC/ME/VLC
50% bit rate reduction compared to MPEG-2Intra DC/AC prediction, 8x8 ME, better VLC table
Widely used in current consumer marketMobile phoneDVDivX
Syntax
Inside the Bit Stream
VS1
VOL1
VO1
GOV1
VOPk
VS1VS1
VS1VS1
VO2
VS1VS1
VOL2
VS1VOPk+1VS1
VOP1
VS1GOV2
VOP1 VS1VOP2
Video session(VS)
Video Object(VO)
Video Object Layer(VOL)
Group Of VOPs(GOV)
Video Object Plane(VOP)
VS1…VSN
VO1…VON
VOL1…VOLN
GOV1…GOVN
VOP1…VOPk VOP1…VOPNVOPk+1…VOPN
Layer 1 Layer 2
SyntaxVideo-object Sequence (VS)
delivers the complete MPEG-4 visual scene, which may contain 2-D or 3-D natural or synthetic objects.
Video Object (VO)a particular object in the scene, which can be of arbitrary (non-rectangular) shape corresponding to an object or background of the scene.
Video Object Layer (VOL)facilitates a way to support (multi-layered) scalable coding. A VO can have multiple VOLs under scalable coding, or have a single VOL under non-scalable coding.
Group of Video Object Planes (GOV)groups Video Object Planes together (optional level).
Video Object Plane (VOP)a snapshot of a VO at a particular moment.
Syntax (1)
Syntax (2)Video_object_layer_start_code
(long Header)Video Object Layer
Video_plane_with_short_header(short Header)
Header User DataVideo Object
Plane (Optional)
Group_of_VideoObjectPlane
(optional)
Video ObjectPlane
Video ObjectPlane
Video ObjectPlane
Header Gob_layer Gob_layer Gob_layerShort_video_start
_markerShort_video_end
_marker
Gob_layer Header(Optional)
Macroblock Macroblock Macroblock
Video Object Plane Vop_start_code Header Sprite DataMotion_shapre_t
extureVideo_packet_he
aderMotion_shapre_t
exture
MacroblockHeader
Shape Data Motion Vector BlockMacroblock
HeaderShape Data Motion Vector BlockMacroblock
Differential DCCoefficient
Run-Level VLC Run-Level VLC End_of_blockBlock
Important Header Information (1)VOL
video_object_layer_shapevol_widthvol_heightinterlacedvol_quant_typenot_8_bitshort headerquarter_sample
VOPvop_coding_type (vop_prediction_type)vop_codedintra_dc_vlc_thrvop_quant
Important Header Information (2)
Macroblocknot_codedmcbpc
VLC to derive the macroblock type and coded block pattern for chrominanceTable B-6, -7 (Also Table B-1~2)
mcsselFor S-VOP
ac_pred_flagAC prediction
cbpyVLC for the pattern of non-transparent Y blocksTable B-8 ~11
Object based Video Coding
MPEG-4 Visual Standards
Video Object: 2-D representation of natural video — MPEG-1/2, H.263 + shape)Face Object: 3-D representation of human face — facial animation parameters; model-based codingMesh Object: 2-D deformable geometric shape (triangle)Still-texture: Wavelet-based still image coding using zero-tree technique
MPEG-4 Visual Decoding
Video object decoding
MPEG-4 Video
— Based on Verification Model 9 (April 1997)Video Object Plane (VOP)Motion / texture coding derived from
MPEG-1/2 & H.263 Polygon matching for motion estimationPadding for motion estimation / texture
codingShape coding: binary and gray-scaleSprite coding: extended background scene
Video Object Coding
Video Object Plane (VOP)
— An arbitrarily shaped image region
VOP Codec Structure
VOP DecoderVOP Decoder
ShapeDecoding
TextureDecoding
Shape InformationDEMULTIPLEXER
Motion Compensation
Bit
stre
am MotionDecoding
VOPMemory
Reconstructed VOP
CompositorVideo Out
Compositing script
Conventional decoding + shape capability
VOP-based v.s. Frame-based
VOP-based CodingMPEG-4 VOP-based coding also employs the Motion Compensation technique:
An Intra-frame coded VOP is called an I-VOP.The Inter-frame coded VOPs are called P-VOPs if only forward prediction is employed, or B-VOPs if bi-directional predictions are employed.
The new difficulty for VOPs: may have arbitrary shapes, shape information must be coded in addition to the texture of the VOP.
Note: texture here actually refers to the visual content, that is the gray-level (or chroma) values of the pixels in the VOP.
VOP-based Coding
1. Motion compensation codingMC + shape capability
By padding process to convert non-rectangular MBs (boundary MB) into rectangular MC and applying conventional ME
2. Texture coding8x8 DCT with zero padding or shape adaptive DCT + Q + VLC
3. Shape codingMC: binary ME or gray scale MEContext adaptive arithmetic coding (CAE)
1. VOP-based Motion CompensationMC-based VOP coding in MPEG-4 again involves three steps:
(a) Motion Estimation.(b) MC-based Prediction.(c) Coding of the prediction error.
Only pixels within the VOP of the current (Target) VOP are considered for matching in MC.
To facilitate MC, each VOP is divided into many macroblocks (MBs). MBs are by default 16x16 in luminance images and 8x8 in chrominance images.
Padding steps for MB processingTo help matching every pixel in the target VOP and meet the mandatory requirement of rectangular blocks in transform codine (e.g., DCT), a pre-processing step of padding is applied to the Reference VOPs prior to motion estimation.
VOP Formulation
— Minimize the number of MBs to be retained
Video ObjectPlane
bounding box
shapeblock(Binary Alpha Block)
Padding
Motion Compensation Tools
time
I-VOP
P-VOP
B-VOP
-- Motion compensated coding modes (I, B, P) (similar to MPEG-1/2 and H.263)
Motion Computation
modified block(polygon) matching
conventionalblock matching
nomatching
referenceP-VOP orI-VOP
padded referencepixels for blockmatching
reference VOPpixels for blockmatching
P-VOP orB-VOP
padded referencepixels forunrestrictedblock matching
boundingbox
advancedpredictionmode (four8x8 blocks)
Only pixels within the VOP of the current (Target) VOP are considered for matching in MC
Motion Vector CodingLet C(x + k; y +l) be pixels of the MB in Target VOP, and R(x+i+k; y+j+l) be pixels of the MB in Reference VOP.A Sum of Absolute Difference (SAD) for measuring
the difference between the two MBs can be defined as
2. Texture Coding Tools
macroblockentirely insideVOP(coded byconventionalDCT scheme)
VOP
macoblockpartially outside VOP(blocks partially outside the VOP are coded by DCT after padding)
macroblockentirely ousideVOP (not coded)
Texture Coding Tools (2/2)
VariableLengthDecoding
MotionCompen-sation
InverseScan
InverseQuantiz-ation
InverseDCT
VOPMemory
ReconstructedVOP
DecodedShape
CodedData
QFS[n] SQF[v][u]
F[v][u] f[y][x] d[y][x]
DecodedPels
InverseAC/DCprediction
QF[v][u]
Adaptive Intra-DC prediction
A
B C D
X Macroblock(16x16)
Y
or or
Block(8x8)
Choose best DC predictor based on gradients of the DC values (side info. not transmitted)
if (|QDCA - QDCB| < |QDCB - QDCC|) QDCX’ = QDCC
else QDCX’ = QDCA
Adaptive Intra-AC prediction
A
B
X
DC
or
Macroblock
Y
or
Shaded coefficients are predicted from previous coded blocks.The best direction is chosen based on the DC prediction.On/off Mblkbasis --transmitted
Boundary blocks: (DCT based)— Inter blocks — Padded with zeros— Intra blocks — Lowpass extrapolation padding
Step 1: Assign the mean value of object pels(inside MB) to the outside pels;
Step 2: f(I,j)=1/4[f(I,j-1) + f(I-1,j) + f(I,j+1) + f(I+1,j)]
starting from the top left corner. If any of the reference 4 pels is outside the block, do not include it and adjust the 1/4 factor accodingly.
Texture Coding
Scan:— Alternate-horizontal— Alternate-vertical— Zig-zagAdaptive DC prediction; adaptive AC predictionInverse Quantizer:— Quantization method 1 - similar to that of H.263— Quantization method 2 - similar to that of MPEG-2— Optimized nonlinear quantization for DC coeff.
(can be used together with previous two methods)
DCT and Quantization
Shape adaptive DCT for Boundary MBShape Adaptive DCT (SA-DCT) is another texture coding method for boundary MBs.Due to its effectiveness, SA-DCT has been adopted for coding boundary MBs in MPEG-4 Version 2.It uses the 1D DCT-N transform and its inverse, IDCT-N:
SA-DCT Flow
3. Shape Coding
The shape information is called alpha planesBinary alpha plane — Code the boundaries usingcontext-based arithmetic encoding (CAE)Gray scale alpha plane — Consists of support and alpha values (texture)
— Support is coded using CAE (as binary alpha plane)— Alpha values (texture) are coded using motion compensated DCT (similar to the texture coding)Motion compensation for shape — similar to that of texture but simpler
Shape CodingShape Coding
binary
arbitrary
X
CAE
Context-based Arithmetic Encoding(CAE) — Predict the current pel value (1 or 0) based on the conditional probability (table)
Other parts
Others
Scalability:— Object scalability— Temporal scalability— Spatial scalability
Error resilience: H.263 marker, MPEG-4 marker, …Sprite codingSNHC visual: Face and bodyDynamic 2-D meshesScalability still texture: Wavelet with zero-tree
MPEG-4 Visual Decoding
Video object decoding
Sprite CodingA sprite is a graphic image that can freely move around within a larger graphic image or a set of images.
To separate the foreground object from the background, we introduce the notion of a sprite panorama: a still image that describes the static background over a sequence of video frames.
The large sprite panoramic image can be encoded and sent to the decoder only once at the beginning of the video sequence.When the decoder receives separately coded foreground objects and parameters describing the camera movements thus far, it can reconstruct the scene in an efficient manner.
Sprite Coding
+Sprite Foreground
Object
DecodedFrame
2-D Mesh Coding
Objects are represented by 2-D polygons.
Node positions and motion vectors are coded.
3-D Face Animation
A 3-D face model is defined in terms of 68 Face Animation Parameters (FAPs)
FGS
Fine Granularity Scalability (FGS)
Amendment 2 (2001)Technique: Base layer + Enhancement layer
Enhance layer bit plane coding“Tuned” Huffman coding
ApplicationsInternet streamingBroadcastingUnicast with/without feedback control.Resource sharingWireless communications
Bandwidth Scalability
I P/B P/B P/B
MPEG-4 base layer
Fine-granular scalable enhancement layer
P/B
Wireless Applications
Ethernet
FGS Advantages
Channel Bandwidth
ReceivedQuality
TraditionalSourceCoding
NewObjective
Good
Moderate
Bad
HighLow
TraditionalDistortion-Rate
Curve
FGS Principles
Base layer: MPEG-4 motion compensated DCT codingEnhancement layer: DCT residuals (the quantization errors of the base layer) are bit-plane-coded. Enhancement layer bitstream can be truncated into any number of bits per frameDecoder may ignore some enhancement bitsReconstructed video quality is proportional to number of decoded bits
FGS Encoder
DCT Q
Q-1
IDCT
MotionCompensation
MotionEstimation
FrameMemory
VLCInput Video
Base LayerBitstream
Bit-planeShift
FindMaximum
Bit-planeVLC Enhancement
Bitstream
Enhancement Layer Encoding
Clipping
DCT
FGS Decoder
VLD Q-1 IDCT
MotionCompensation
FrameMemory
Bit-planeVLD IDCT
Enhancement Layer Decoding
Base LayerBitstream
EnhancementBitstream
Base Layer Video(optional output)
Enhancement VideoClipping
Clipping
Bit-planeShift
Bitplane Coding
+-
+- +
1
0
0+
0
0
0
0
1
111
1
1
1
MSB
LSB
Bit-Plane
A block of 8x8 DCT coefficient differences
Zigzag ordering of a block of 8x8 DCTcoefficient differences
+ - +
1
0
0
0
1
11
MSB
LSB
Bit-Plane
A block of 8x8 DCT coefficient differences after zigzag ordering
+
0
+
0
-
0
1
10 0 0 0 0 0 0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0 1
0
0
0
0 0
0
0
0
0
18 zeros 12 zeros 20 zeros
MSB
LSB
Bit-Plane
(RUN, EOP) symbols for a blockof 8x8 DCT coefficient differences
after zigzag ordering
(0, 1)
(28, 1)
(6, 0)
(0, 0) (0, 0) (26, 1)
(2, 0) (31, 1)
Profiles in Version 1Simple Profile -- Basic tools of I/P VOP, AC/DC Prediction and 4 MV unrestrictedCore Profile -- Simple + Binary Shape, Quantization Method 1/2 and B-VOPMain Profile -- Core + Grey Shape, Interlace and SpriteSimple Scalable Profile -- Simple + Spatial and temporal scalabilityand B-VOPN-Bit Profile -- Core + N-BitAnimated 2D Mesh -- Core + Scalable Still Texture, 2D dynamicMeshBasic Animated Texture -- Banary Shape, Scalable Still Texture and 2D Dynamic MeshStill Scalable Texture -- Scalable Still TextureSimple Face -- Face Animation Parameters
Profiles in Version 2
Advanced Real Time Simple Profile -- Simple + Advanced errorresilience + improved temporal scalabilityCore Scalable Profile -- Simple scalable + Core + SNR, Spatila/Temporal Scalability for Region or Object of interestAdvanced Coding Efficiency Profile -- Tools for improving codingefficiency for both rectangular and arbitrary shaped objectsAdvanced Scalable Texture Profile -- Tools for decoding arbitraryshaped texture and still image including scalable shape codingAdvanced Core Profile -- Core Profile + Tools for decodingarbitrary shaped video objects and arbitrary shaped scalable stillimageSimple Face and Body Animation Profile -- Simple face animation + body animation
Additional ProfilesAdvanced Simple Profile -- Simple Profile + efficient coding tools: B-frames, 1/4 pel MC, … Fine Granularity Scalable Profile
Advanced Simple Profile as base layerFine granularity scalability (FGS)Fine granularity scalability - temporal (FGST)
Simple Studio ProfileI-frames onlyArbitrary shapeMultiple alpha channelsUp to 2 Gbps
Core Studio Profile -- Simple Studio Profile + P-frames
MPEG-4 Video Profiles
Spatial &
TemporalScalability
ArbitraryShape
RectangularFrame
NoScalability
Quality &
TemporalScalability
AdditionalTools
HigherError
Resilience
Simple
Core
SimpleScalable
CoreScalable
Main
AdvancedSimple
AdvancedCoding
Efficiency
Fine Granularity
Scalable
AdvancedRealtimeSimple
SimpleStudio
CoreStudio
AdditionalTools
ISAMD-1AMD-2
Profiles limit the set of tools in a decoding deviceLevels specify parameter ranges (limit complexity)
LevelsVisualProfile
Level Typical VisualSession Size(indicative)
Maximumtotal numberof objects 1
Maximumnumber per
type
Maximumnumberdifferent
QuantizationTables
Max. totalReferencememory
(MB units)2
Maximumnumber of
MB/sec
Costfunction
equivalentI-MB/sec5
Maximumvbv_buffer_size (unitsof 16384
bits)
Max. videopacketlength(bits)6
Max spritesize (MB
units)
Waveletrestrictions
Maxbitrate
Max.enhancement
layers perobject
Main L4 1920 x 1088 32 32 x Main orCore orSimple
4 16320 489600 1290100 380 16384 65280 1 tapsdefaultinteger filter
38.4Mbit/s
1 temporal, 2spatial
Main L3 CCIR 601 32 32 x Main orCore orSimple
4 3240 97200 256200 160 16384 6480 1 tapsdefaultinteger filter
15 Mbit/s 1
Main L2 CIF 16 16 x Main orCore orSimple
4 792 23760 62700 40 8192 1584 1 tapsdefaultinteger filter
2 Mbit/s 1
Core L2 CIF 16 16 x Core orSimple
4 792 23760 62700 40 8192 N. A. N. A. 2 Mbit/s 1
Core L1 QCIF 4 4 x Core orSimple
4 198 5940 15700 8 4096 N. A. N. A. 384
kbit/s
1
SimpleScalable
L2 CIF 4 4 x Simpleor SimpleScalable
1 792 23760 N. A. 20 4096 N. A. N. A. 256 kbit/s 1 spatial ortemporalenhancementlayer
SimpleScalable
L1 CIF 4 4 x Simpleor SimpleScalable
1 495 7425 N. A. 20 2048 N. A. N. A. 128 kbit/s 1 spatial ortemporalenhancementlayer
Simple L3 CIF 4 4 x Simple 1 396 11880 N. A. 20 8192 N. A. N. A. 384 kbit/s N. A.
Simple L2 CIF 4 4 x Simple 1 396 5940 N. A. 20 4096 N. A. N. A. 128 kbit/s N. A.
Simple L1 QCIF 4 4 x Simple 1 99 1485 N. A. 5 2048 N. A. N. A. 64 kbit/s N. A.
F. Pereira & T. Ebrahimi, The MPEG-4 Book, Prentice-Hall, 2002A. Puri and T. Chen, ed., Multimedia Systems, Standards, and Networks, Marcel Dekker, 2000.ISO/IEC JTC1/SC29/WG11/Doc.N1869: MPEG-4 Video Verification Model Version 9.0, Oct. 1997.Image Communication: Tutorial Issue on MPEG-4, Jan. 2000.Weiping Li, “The Overview of fine granularity scalability in MPEG-4 video standard,” IEEE Trans. on Circuits and Systems for Video Tech., pp.301-317, March 2001.Weiping Li and et al., “Fine granularity scalability in MPEG-4 for streaming video,” IEEE ISCAS 2000, pp. 299–302.
References
H.M. Radha and et al., “MPEG-4 fine-grained scalable video coding method for multimedia streaming over IP,”IEEE Trans. on Multimedia, pp.53 –68, March 2001.F. Wu and et al., “A framework for efficient progressive fine granularity scalable video coding,” IEEE Trans. on Circuits and Systems for Video Tech., vol. 11, no. 3, March 2001.ISO/IEC MPEG and ITU-T VCEG, Joint Committee Draft (CD), JVT-C167, May. 2002.ISO/IEC MPEG and ITU-T VCEG, Low Complexity Transform and Quantization, JVT-B038, Feb. 2002.ITU-T VCEG, H.26L Test Model Long-Term Number 9 (TML-9) draft 0, VCEG-N83d1, Dec. 2001.ITU-T VCEG, New Intra Prediction Modes, VCEG-N54, Sept. 2001.