+ All Categories
Home > Documents > Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice...

Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice...

Date post: 02-Apr-2018
Category:
Upload: hoanganh
View: 238 times
Download: 6 times
Share this document with a friend
13
Date EVS Enhanced Voice Services EVS Enhanced Voice Services Next Generation in Speech Quality ETSI STQ Workshop, Nov 2012 Dr. Imre Varga Qualcomm Inc.
Transcript
Page 1: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

Date

EVS – Enhanced Voice ServicesEVS – Enhanced Voice Services

Next Generation in Speech Quality

ETSI STQ Workshop, Nov 2012

Dr. Imre VargaQualcomm Inc.

Page 2: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

kbps4.75 12.2

6.6 23.85kbps

5.9 kbps 128

AMR

AMR-WB

EVS

EVS – Next Gen 3GPP Speech Coding for Improved User Experience

Internal Use Only 2

AMR

AMR-WB

EVSQu

ali

ty

1995 2002 2013

Page 3: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

What is EVS?

� EVS – Enhanced Voice Services� Next generation 3GPP speech coding � Following the successful FR, HR, EFR, AMR, AMR-WB codecs� Designed for packet-switched networks / mobile VoIP� VoLTE is a key target application� Application in other networks� AMR-WB interoperable mode� Rel-12 Work Item in 3GPP preceded by a Study Item TR 22.813

� Key features

Internal Use Only 3

� Key features� Super-wideband speech (32 kHz sampling) – improved speech quality � Source-controlled variable bit-rate operation – improved capacity� Designed for VoIP – improved robustness� Improved music performance� Wide bit-rate range and all bandwidths for maximum flexibility� Backward interoperable mode to AMR-WB

� Standardization process� Qualification phase – currently on-going rigorous testing� Selection phase� Characterization phase

Page 4: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

3GPP EVS is the Next Generation Speech Coder –Value for the Ecosystem

� Speech quality determines user experience

� Operators are very concerned about voice quality� Ensuring voice quality on new VoLTE deployments

� EVS addresses all networks – mobile VoIP with QoS, best effort VoIP, CS

� 3GPP goals of Enhanced Voice Services (EVS) standardization� Feature-rich coder

� Designed for VoIP applications such as MTSI in TS 26.114

� It is further desirable that the codec fulfills needs in other networks such as CS

Internal Use Only 4

� It is further desirable that the codec fulfills needs in other networks such as CS

� NB, WB, SWB bandwidths, FB and stereo optional, high robustness mode

� Bit rates: 7.2, 8, 9.6, 13.2, 16.4, and 24.4 kb/s gross rates that comply with LTE TBSs; 32, 48, 64, 96, 128 kb/s net source rates.

� Quality improvements – improving user experience

� Better quality in VoLTE and UMTS (with no new RAB)

� Evolution path: EVS provides SWB at around 13 kbps – lower rate and lower delay SWB than other industry coders without sacrificing quality

� Better quality for music and mixed content in conversational applications

� Capacity improvements – increasing system efficiency

� VBR at 5.9 kbps provides high capacity mode

� Robustness improvements – optimized behavior in VoIP applications

� More robust NB/WB through significantly better error resilience

� High robustness mode

Page 5: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

EVS FeaturesJitter Buffer Management

NB WB SWB FB Stereo

New EVS

Modes(CBR)

New EVS

Modes (VBR)

AMR-WB

InteropModes

New EVS Modes (CBR)7.2-128 kb/s

New EVS

Modes (VBR)

New EVS Modes

7.2-128 kb/s

New EVS Modes

(optional)New EVS Modes

(optional)

Speech Speech Speech Speech Music Speech Speech Music Speech Music Speech Music

7.2-13.2kb/s

5.9kb/s average

6.6-23.85 kb/s

7.2-128 kb/s

12-128kb/s

5.9 kb/s average

13.2-128kb/s

12-128kb/s

7.2-128 kb/s

7.2-128 kb/s

7.2-128 kb/s

7.2-128 kb/s

Internal Use Only 5

Wh

y D

ep

loy

EV

S?

kb/s kb/s kb/s kb/s

Better Capacity

Same quality as

legacy NB/WB

Better Music

Near AAC Quality at

much lower delay

Better Quality

Same capacity as

legacy NB/WB

Page 6: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

EVS From User’s Perspective

• More natural sounding speech for enhanced user experience during voice call

• Consistent voice quality during call

• Super wideband speech coding• Better wideband and narrowband quality at same bit rates as

legacy coders

Internal Use Only 6

• Better in-call music quality

• Improved error resilience• Customizations for VoIP deployment

• Improved coding of music for better sounding • Ring back and music on hold• Hear what I hear,• Remote music education and collaboration

Page 7: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

EVS Design Requirements

Wideband(0-8 kHz) Coding of

Superwideband(0-16 kHz) Coding of

Speech better than

AMR-WB

Improved Error Resilience

for both Circuit

Switched and Source

Controlled

Improved Coding of

Music

Constraints on Frame Length,

Max. Algorithmic

Delay, Complexity,

Internal Use Only 7

Narrowband(0-4 KHz) Coding of

Speech better than

AMR

(0-8 kHz) Coding of

Speech better than

AMR-WB.

Switched and

Packet Switched

Communication

and

VoIP Capability

Controlled Variable Rate

Coding

Music for In-call Music

(Music on hold

and Ringback)

Complexity, JBM, Rate Switching, PLC, RTP Payload Format,

VAD/DTX/CNG

Page 8: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

EVS Variable Bit-Rate Operation

� Improved system efficiency through lower average bit-rate

� Illustration of Source-Controlled Variable Bit-Rate Operation in the Figure

12.65 12.81 12.5913.04

5

12

14

Avera

ge D

ata

Rate

(kb

ps)

Speech Quality Active ADR

Internal Use Only 8

7.36

12.65 12.81

7.36

12.5913.04

1

2

3

4

EVRC-WB AM R-WB at

12.65k bps

VM R-WB,

M ode 0

EVRC-WB AM R-WB at

12.65k bps

VM R-WB,

M ode 0

CT1 - Input Level and FER Conditions CT2 - Noise Conditions

Qu

ali

ty

0

2

4

6

8

10

12

Avera

ge D

ata

Rate

(kb

ps)

Page 9: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

EVS Requirements in SWB at Low RatesCategory Bitrate

(kbit/s)

FER DTX Requirements

Clean speech

-26,-16,-36dBov

13.2 0% On†/Off NWT G.722.1C @ 32

16.4 NWT G.722.1C @ 48

24.4 NWT G.718B @ 36

Clean speech

-26 dBov

13.2 x=3%,

6%

Off

On† for 13.2

NWT G.722.1C @ 48, x% FER

16.4 NWT G.719 @ 48, x% FER

24.4 NWT G.719 @ 56, x% FER

Internal Use Only 9

Noisy Speech (Car, Office,

Street)

-26 dBov

13.2 0% On‡/Off NWT G.722.1C @ 24 when EVS DTX off

NWT AMR-WB @19.85 DTX on when EVS DTX on

16.4 NWT G.722.1C @ 32 when EVS DTX off

NWT AMR-WB @23.05 DTX on when EVS DTX on

24.4 NWT G.722.1C @ 48 when EVS DTX off

NWT AMR-WB @23.85 DTX on when EVS DTX on

Noisy Speech (Car, Office,

Street)

-26 dBov

13.2 x=3%,

6%

Off

On‡ for 13.2

NWT G.722.1C @ 24, x% FER and DTX off

NWT AMR-WB @19.85, x% FER and DTX on when EVS DTX

on

16.4 NWT G.722.1C @ 32, x% FER

24.4 NWT G.722.1C @ 48, x% FER

Page 10: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

EVS Standardization Process

� Requirements phase – design constraints and performance requirements

� Candidate coders� 13 companies submitted a candidate by 16 November 2012

� Ericsson, Fraunhofer, Huawei, Motorola, Nokia, NTT, NTTDoCoMo, Orange, Panasonic,Qualcomm, Samsung, VoiceAge, ZTE

� Standardization by competition

� Qualification phase� Aim is to keep the most promising candidates for selection (at most 5)

� Extensive testing

Internal Use Only 10

� Extensive testing

� 12 experiments, each candidate is tested in-house and in another listening lab

� Global Analysis Lab performs collection and analysis of test results

� Qualification meeting in March 2013

� Selection phase� Aim is to select the best candidate out of the max. 5 kept in qualification

� Codec selection is based on extensive testing in neutral listening labs

� Characterization phase� Aim is to test the coder performance for all conditions and special signals / conditions

� Approval of EVS Specifications and Technical Report

Page 11: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

Schedule

� 3GPP Standardization Schedule

BeginningEVS

QualificationEVS

Selection

EVS

characterization

13 Candidates 5 CandidatesWinning

candidateWinning

candidate

Aug ’10 March ’13 2013 June ’14

Internal Use Only 11

� Rel-12 Work Item closing in June 2014

Page 12: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

EVS Deployment

� EVS targets – VoLTE and other networks� EVS Rel-12 standardization timeline matches VoLTE mass deployment plans

� Provide EVS in WCDMA and best-effort VoIP at the same time

� VoLTE trials� Goal is to make EVS available for VoLTE trials and deployment

� Pre-commercial phase during 2013

Internal Use Only 12

� Qualcomm is key partner in EVS deployment� Serve ecosystem by making EVS codec available in mobile chipsets

� Pre-standard version available for VoLTE trials and branded voice services

� Deployment of standardized version can begin at availability of specifications

� Qualcomm supports best-in-class voice quality for IMS based voice service deployments on LTE with a complete suite of tools and features, including EVS, IMS client, and voice enhancement.

Page 13: Datedocbox.etsi.org/Workshop/2012/201211_STQWORKSHO… ·  · 2012-11-28Date EVS –Enhanced Voice Services Next Generation in Speech ... 4.75 kbps 12.2 6.6 kbps 23.85 5.9 kbps 128

THANK YOU!

Internal Use Only 13

THANK YOU!


Recommended