+ All Categories
Home > Documents > Recent, Current and Future Developments in Video...

Recent, Current and Future Developments in Video...

Date post: 05-Jul-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
49
Inst. of Commun. Engineering RWTH Aachen University J.-R. Ohm Recent, Current and Future Developments in Video Coding Jens-Rainer Ohm Institut für Nachrichtentechnik RWTH Aachen University
Transcript
Page 1: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

Inst. of Commun. Engineering RWTH Aachen UniversityJ.-R. Ohm

Recent, Current and Future Developments in Video Coding

Jens-Rainer Ohm

Institut für NachrichtentechnikRWTH Aachen University

Page 2: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

2 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Outline

Recent and current activities in MPEG Video and JVT Scalable Video CodingMultiview Video Coding and 3D VideoMPEG-C Reconfigurable Video Coding

The Future: Increasing Compression Performance?Will Video data rates and formats continue increasing?Examples of tools for improved compression

Page 3: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

Inst. of Commun. Engineering RWTH Aachen UniversityJ.-R. Ohm

Scalable Video Coding

Page 4: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

4 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Scalable Media Networking – The Idea

Universal Media Access: code once and then customize the stream to access content

“Anytime”from “Anywhere” (i.e. using any access network -wireless, internet etc.)and by “Anyone” (i.e. with any terminal complexity)

Compatibility of different formats/resolutions

Terminal capabilities & Network characteristics feedback

..0100000110110101011..

Scalable Coded Content Terminal

Network

MANE

Page 5: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Scalability of Video - Modalities

Temporal: change of frame rate

Spatial: change of frame size

Fidelity: change of quality (a.k.a. SNR)

30 Hz15 Hz7.5 Hz

TV

CIF

QCIF

HDTV

High rate Low rate

Page 6: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

6 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Example: Format Enhancements in Mobile TV

Backwards-compatible introductionof VGA services

Spatial and temporal format extensionIntroduction of QVGA services

Page 7: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

7 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Hierarchical prediction structures

Non-dyadic temporal scalability

Low-delay prediction structure (structural delay is 0)

I0/P0 I0/P0B3B3 B3B3 B2B2 B3 B3B1 B3B3

I0/P0 B1B2B3 I0/P0 I0/P0B3 B3 B3B3 B3 B3 B3B2 B2 B2B1I0/P0 B1BB I /P I /PB B BB B B BB B BB

Page 8: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

8 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Layered Coding for Spatial Scalability

Layered codingOversampled pyramid for each resolution:e.g. QCIF, CIF, 4CIF, 16CIFMC prediction structures of all layers are aligned

Inter-layer prediction:Switchable prediction(with upsampling)

Prediction of intra macroblocksPrediction of partitioning and motion information Prediction of residual data

Layer n

Layer n+1

Page 9: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

9 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Intra Prediction

4-tap filter: [-3, 19, 19, -3]

F B

E H

A

D

CL

GF I

full sample positionshalf sample positions

Up-sampling

Page 10: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

10 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Spatial Prediction of Data

Spatial Scalability: Foreman, 150 pics, QCIF 15Hz @ 48 kbit/s -> CIF 15 Hz

26

27

28

29

30

31

32

33

34

35

0 48 96 144 192

Y-PSNR [dB]QCIF Base Layer

CIF Single Layer

QCIF & CIF Simulcast

Bit rate used for Intra in QCIF Base Layer

Target RD curve for Intra prediction

Achieved RD curve for Intra prediction

bit-rate [kbit/s]

Page 11: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

11 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Spatial Prediction of Motion Data

Upsample macroblock partitioning as switchable partitioning predictorMultiply motion vectors by 2 and use them as switchablepredictors (keep list 0, list 1, bi-predictive and reference indices information)

16x1616x16

16x1616x16

Intra-BLIntra-BL

Intra-BLIntra-BL16x16 16x8

8x16 8x8

8x8Direct,16x16,16x8,8x16

8x4

4x8 4x4Intra

Page 12: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

12 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Spatial Prediction of Data

26

27

28

29

30

31

32

33

34

35

0 48 96 144 192

QCIF Base LayerCIF Single Layer

QCIF & CIF Simulcast

Bit rate used for Motion Data

Target RD curve for Motion Data prediction

Achieved RD curve for Motion Data prediction

Spatial Scalability: Foreman, 150 pics, QCIF 15Hz @ 48 kbit/s -> CIF 15 HzY-PSNR [dB]

bit-rate [kbit/s]

Page 13: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

13 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Residual Prediction

Block-wise bi-linear up-sampling filterBlock-wise: block boundaries of 4x4 or 8x8 blocksbi-linear: small 4x4 block size

Up-sampling

Page 14: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

14 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Spatial Prediction of Data

26

27

28

29

30

31

32

33

34

35

0 48 96 144 192

QCIF Base LayerCIF Single Layer

QCIF & CIF Simulcast

Bit rate used for Residuals

Achieved RD curve for Residual prediction

Target RD curve for Residual prediction

Spatial Scalability: Foreman, 150 pics, QCIF 15Hz @ 48 kbit/s -> CIF 15 HzY-PSNR [dB]

bit-rate [kbit/s]

Page 15: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

15 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Spatial Prediction of Data

26

27

28

29

30

31

32

33

34

35

0 48 96 144 192

QCIF Base LayerCIF Single Layer

QCIF & CIF Simulcast

Overall Performance of SVC

Spatial Scalability: Foreman, 150 pics, QCIF 15Hz @ 48 kbit/s -> CIF 15 HzY-PSNR [dB]

bit-rate [kbit/s]

Page 16: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

16 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Coarse-grain SNR scalability (CGS)

Extreme case of (extended) spatial scalabilityresolution ratio is equal to 1, no croppingRequantization of residualno upsampling (motion, texture) requiredsinge-loop decoding !

Multiplex Scalablebit-stream

H.264/AVC-compatiblebase layer bit-streamH.264/AVC MCP &

Intra predictionBase layer

coding

texture

motionH.264/AVC compatible encoder

Inter-layer prediction of Intra, motion, and residual

Hierarchical MCP &Intra prediction

Base layercoding

texture

motion

Hierarchical MCP &Intra prediction

Base layercoding

texture

motion

Inter-layer prediction of Intra, motion, and residual

Page 17: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

17 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Single-loop Decoding

Unlike MPEG-2, H.263, MPEG-4 scalability, the current SVC uses

inter-layer intra prediction is restricted to base layer macroblock that are coded in intra modesingle motion compensation loop (including deblocking) is sufficient at decoder sideonly pictures of highest layer are stored in the decoded picture buffer

Impactadditionally required complexity for supporting spatial and SNR scalability is smallMinor impact on compression performance (0 – 0.5 dB)

Page 18: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

18 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Subjective Testing

Subjective Performance Evaluation of SVCSeveral test cases for spatial, quality and combinedscalabilityThree application scenarios: TV broadcasting, Conversational, Movie ProductionThree profiles tested: Scalable Baseline, ScalableHigh, Scalable High IntraSVC compared to AVC either at 10% additional bitrate or at the same bit rateVisual quality evaluated in subjective tests

Page 19: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

19 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

0

1

2

3

4

5

6

7

8

9

10

Crew CrowdRun Seeking Soccer

Vis

ual Q

ualit

y

AVC

SVC

Results of Subjective Testing

Baseline Broadcasting SNR Scalability QVGA - QVGA

Uncertainty Area

BL

EL

Page 20: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

Inst. of Commun. Engineering RWTH Aachen UniversityJ.-R. Ohm

Multiview and 3D Video Coding

Page 21: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

21 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Multi-View Coding

Goal: Joint compression of an entire set of video views (captured by multiple cameras

Possible applications:Free-viewpoint systems (e.g. interactive navigation)3D TV, multiview displays

Challenges:How much compression performance can be gained by utilizing inter-view redundanciesThere may be more than one usage scenario (e.g. different types of displays), which would require different types of optimization

Page 22: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

22 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

MVC Applications

3D displays have largely improved recently, and are targeting consumer markets

High-end displays require many views Autostereoscopic displays for several users and with view correction are comingLow cost stereoscopic e.g. for mobiles In these cases, all available views need to be decoded

Adaptation of view directionSupport head motion parallax viewingIntegral imagingIn these cases, only some views are selectively decoded (all must be available for choice)

Page 23: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

23 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

MVC Reference Model

• Fully compatible to MPEG4-AVC at slice layer and below• Can be seen as reorganization of input images into a

single stream prior to encoding• Uses hierarchical B-pictures combined in temporal and

inter-view dimension

Page 24: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

24 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Objective Results Ballroom

30

31

32

33

34

35

36

37

192 256 320 384 448 512 576 640Avg. Rate [kbps]

Avg

. PS

NR

Y [d

B]

Inter-view PredicitionSimulcastAnchor

(hier. B)(IBBP)

Page 25: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

25 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Overall Observations and Results

Without any changes at slice layer and below, roughly 20% bit rate reduction can be achieved allowing inter-view prediction

Sophisticated temporal+inter-view prediction structures can be implemented based on flexible reference picture indexing of AVCNo distinction between motion vectors and disparity vectors in this case

Efficient methods for buffer and view access management have been defined, including possibilities for low delay and random access

Page 26: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

26 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Overall Observations and Results

Using additional tools (changing AVC at macroblocklevel), only 12% additional bitrate saving have been achieved so far, in particular using

Illumination compensation

Combined motion/disparity vector coding (e.g. disparity-based motion skip)

It appears that these achievements are not (yet) mature for standardization, first phase MVC standard (ready July 2008) will not include them

Page 27: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

27 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

3D Video

MVC (in JVT) targets joint compression of a set of multiple views, typically dense camera setup3D Video is a complementary technology to enable generating a continuum of views from a much more sparse view set

Most probably requiring depth map representation/ compression and interpolation/rendering methodMost probably MVC as video compression for the sparse views

Focus of the work: Upcoming 3D (M-view) displays in consumer applications

Goal: Much less data rate than direct compression of all M views by MVCGood subjective quality, but certainly not measurable by PSNR

Still unclear whether high-quality view synthesis (interpolation) needs to be normative

Page 28: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

28 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

3D Video

Basic principle:Advancedstereoscopicand multiviewdisplays need support to present various views

Maximum angle between leftmost and rightmost position expected to be around 20 degrees for the upcoming generations of displays

Page 29: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

29 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

3D Video

High-level view of 3DV technology chain

K

N c

aptu

reca

mer

as

Var

ious

sourc

esof

3D

conte

nt

Main

focu

son M

-view, co

mpatib

lew

ithoth

erdisp

laysOnly sparse set

of K viewsencoded

Page 30: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

Inst. of Commun. Engineering RWTH Aachen UniversityJ.-R. Ohm

Reconfigurable Video Coding

Page 31: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

31 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Approach: Break down monolithic standards into more elementary building blocksPossible applications:

Simplified, more flexible standards developmentTailoring of existing standards for specific applications (small modifications simple)Devices could "learn" updates of standards

Current status: Both parts expected for October 200821001-4 Codec Configuration Description (useful for non-video and non-MPEG devices) 21002-4 Video Tool Library (including only video tools defined by MPEG)

Reconfigurable Video Coding

Page 32: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

32 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Reconfigurable Video Coding

Decoder

Receiver

Decoder Description

DecoderDecoder Description

Encoded Video Data

Toolbox

Decoding Solution Decoded Video Data

Decoder FormulationMechanism

23001-4 23002-4

Page 33: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

33 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Status of RVC Standardization

Codec Configuration Description (23001-4) Based on subset of formal description language CAL (RVC-CAL) – data-flow oriented, timing and synchronization fully supported Bitstream syntax expressed using MPEG-21 BSDL schema (RVC-BSDL), which enables efficient representation by XML-based description

Straightforward to use this throughout the entire design flow of development of video devices

Tools for automatic generation of C code and VHDL are availableBeyond the "conventional" C simulation, formal description provides rigid testing of timing behaviour, data flow etc.

Page 34: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

34 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Status of RVC Standardization

Video Tool Library (23002-4)Currently building blocks from existing standards defined as "functional units" (FUs) are implementedPseudocode formulation in style of old MPEG standards & CAL based reference software implementationFirst version will support MPEG-4 Simple Profile, MPEG-2 Main Profile and AVC Baseline ProfileFuture versions for more MPEG profiles and new toolsFor more advanced entropy decoders (CABAC, CAVLC) dedicated FUs need to be defined

Page 35: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

Inst. of Commun. Engineering RWTH Aachen UniversityJ.-R. Ohm

Future Video Coding –Further Improving Compression?

Page 36: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

36 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Until now, we have seen a continuous tendency with new standards appearing mostly for the purpose of increased compression performance

Additional functionality seems "nice to have“, but not necessarily mandatoryFor consumer applications, mainly the "low-to-medium" rate points where motion compensation is effective (not too much distortion, sufficiently far from lossless)

Higher compression will certainly be urged by demand for higher resolution

Beyond HD & stereo/multiview for the homeMPEG-4 AVC was mainly developed for CIF/QCIF at medium/low rates, increasing usage for (low-rate) HD

Increasing Compression Performance

Page 37: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

37 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Today's video codecs (such as AVC) still working by same principle as developed around 1990

Separating texture and motion codingHave however become much more intelligent in encoder decisions (mode switching, rate/distortion based decisions in AVC)Paradigm of normative bitstream & decoder (and non-normative encoder) is traditionally used in video

Towards lower rates, high percentage of entire bit allocation goes into mode and motion info

Dramatic break-down of quality below certain rate points (sequence dependent)

Saving data rate in motion/mode and texture (intra in particular) seems to be key to improve compression

Increasing Compression Performance

Page 38: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

38 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Possible improvements could be possible by:Texture synthesis (spatially, temporally)

Including structured and noisy texturesImproved intra prediction & codingImproved motion models, including receiver-side motion updateAdaptive transforms or other basis functionsImproved motion compensation by adaptive filtersCloser integration of motion compensation loop and texture coding?

Most of these require higher complexity, more "intelligence" also at the decoder sideAnother approach to go could be to decrease complexity while achieving same good results as AVC

Increasing Compression Performance

Page 39: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

39 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Improved intra prediction by texture modeling

Examples from RWTH Work

Page 40: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

40 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Displacement intra prediction

Examples from RWTH Work

Page 41: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

41 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Displacement intra prediction – example of prediction image for "spincalendar" sequence

Examples from RWTH Work

Page 42: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

42 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Markovian texture prediction by "template matching" Examples from RWTH Work

Page 43: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

43 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

New intra texture prediction - resultsExamples from RWTH Work

Page 44: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

44 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

New intra texture prediction - resultsExamples from RWTH Work

Page 45: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

45 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Extension of template matching to inter predictionBasic principle: Motion estimation at decoder, comparing neighboring template of current block against reference frame

Template can be derived from the decoded signal or from the prediction signalSmall search range around predictor (±1..4 pixels)

Examples from RWTH Work

Page 46: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

46 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Advantages of decoder-side motion vector derivation using template prediction

If selected (signaled by an additional mode definition) it is not necessary to transmit motion vectorIf combined with multi-frame prediction, it is not even necessary to signal the reference frame index Weighted averaging of several (best, second best etc.) matches becomes appropriate (unlike conventional coding, where a motion vectors would need to be encoded for each

Examples from RWTH Work

Page 47: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

47 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Combination of template inter prediction, multi-frame and weighted averaging – HD (720p) results

Examples from RWTH Work

0 5000 10000 1500030

32

34

36

38

40

42

Bit t [kbit/ ]

PSNR [dB]

Bigships

JM13.2DMVD 1 HypDMVD 8 Hyp AllRefs Trunc

0 0.5 1 1.5 2 2.5 328

30

32

34

36

38

40

42

Bit t [kbit/ ]

PSNR [dB]

City

JM13.2DMVD 1 HypDMVD 8 Hyp AllRefs Trunc

0 2000 4000 6000 8000 10000 12000 1400034

35

36

37

38

39

40

41

42

Bit t [kbit/ ]

PSNR [dB]

Crew

JM13.2DMVD 1 HypDMVD 8 Hyp AllRefs Trunc

0 0.5 1 1.5 2 2.5 330

32

34

36

38

40

42

Bit t [kbit/ ]

PSNR [dB]

Night

JM13.2DMVD 1 HypDMVD 8 Hyp AllRefs Trunc

0 1000 2000 3000 4000 5000 600034

35

36

37

38

39

40

41

42

43

44

Bit t [kbit/ ]

PSNR [dB]

Shuttlestart

JM13.2DMVD 1 HypDMVD 8 Hyp AllRefs Trunc

Page 48: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

48 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Examples from RWTH Work

Template inter prediction – overall results in bit rate reduction

-8.6112-8.3693-5.1655-3.3090Overall-11.8249-11.1096-6.4639-4.0747720p-4.5941-4.9441-3.5424-2.3520CIFaverage

-10.9685-9.4766-5.0851-2.8586720p 60Hzshuttlestart

-7.7954-7.3306-4.6359-1.8803720p 60Hznight

-7.6865-7.3916-5.4089-4.1780720p 60Hzcrew

-19.1836-17.1830-9.0854-5.4663720p 60Hzcity

-13.4904-14.1659-8.1042-5.9903720p 60Hzbigships

-4.8984-4.4590-2.7669-1.6167CIF 30Hztempete

-1.2819-2.0272-1.6025-1.0622CIF 15Hzparis

-5.4051-5.4924-3.2016-1.5080CIF 30Hzmobile

-6.7912-7.7978-6.5985-5.2209CIF 30Hzforeman

DMVD 8 HypDMVD 4 HypDMVD 2 HypDMVD 1 HypResolutionSequence

Page 49: Recent, Current and Future Developments in Video Codingwiamis2008.itec.uni-klu.ac.at/keynotes/ohm.pdf · 3D TV, multiview displays Challenges: How much compression performance can

49 | May 09, 2008 | Recent, Current and Future Developments in Video Coding – WIAMIS 2008, Klagenfurt

Institut für Nachrichtentechnik RWTH Aachen UniversityJ.-R. Ohm

Continuing efforts made to resolve previously open problems in video coding – example SVC

Not more than 10% rate increase for scalability functionality has been achieved, otherwise it would not be acceptable

Increasing compression stays most important factor of development – looking for increasing resolution and multiview

Desire for higher pixel rates grows faster than affordable network bandwidth (both for wireless and wired)Bit rate must not scale up with resolution

Current paradigms in video standardization may need revisitsJust some examples: non-SNR quality criteria, role of RVC, roles of encoder/bitstream/decoder/postprocessing

Summary


Recommended