+ All Categories
Home > Documents > On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan...

On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan...

Date post: 27-Oct-2020
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
96
On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I 2 R) National University of Singapore (NUS) January 20, 2013 Vincent Tan (I 2 R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 1 / 29
Transcript
Page 1: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

On Third-Order Asymptotics for DMCs

Vincent Y. F. Tan

Institute for Infocomm Research (I2R)National University of Singapore (NUS)

January 20, 2013

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 1 / 29

Page 2: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Acknowledgements

This is joint work with Marco Tomamichel

Centre for Quantum TechnologiesNational University of Singapore

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 2 / 29

Page 3: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Transmission of Information

Shannon’s Figure 1TRANSMITTER

MESSAGE

SIGNAL RECEIVEDSIGNAL

RECEIVER DESTINATION

MESSAGE

NOISESOURCE

INFORMATIONSOURCE

Shannon abstracted away information meaning, “semantics”• treat all data equally — bits as a “universal currency”• crucial abstraction for modern communication and computing systems

Also relaxed computation and delay constraints to discover a fundamental limit: capacity, providing a goal-post to work toward

Saturday, June 11, 2011

Shannon’s Figure 1

Information theory ≡ Finding fundamental limits for reliableinformation transmission

Channel coding: Concerned with the maximum rate ofcommunication in bits/channel use

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 3 / 29

Page 4: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Transmission of Information

Shannon’s Figure 1TRANSMITTER

MESSAGE

SIGNAL RECEIVEDSIGNAL

RECEIVER DESTINATION

MESSAGE

NOISESOURCE

INFORMATIONSOURCE

Shannon abstracted away information meaning, “semantics”• treat all data equally — bits as a “universal currency”• crucial abstraction for modern communication and computing systems

Also relaxed computation and delay constraints to discover a fundamental limit: capacity, providing a goal-post to work toward

Saturday, June 11, 2011

Shannon’s Figure 1

Information theory ≡ Finding fundamental limits for reliableinformation transmission

Channel coding: Concerned with the maximum rate ofcommunication in bits/channel use

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 3 / 29

Page 5: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Channel Coding (One-Shot)

- - - -M X Ye W d M̂

A code is an triple C = {M, e, d} whereM is the message set

The average error probability perr(C) is

perr(C) := Pr [M̂ 6= M]

where M is uniform onM

ε-Error Capacity is

M∗(W, ε) := sup{

m ∈ N∣∣ ∃ C s.t. m = |M|, perr(C) ≤ ε

}

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 4 / 29

Page 6: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Channel Coding (One-Shot)

- - - -M X Ye W d M̂

A code is an triple C = {M, e, d} whereM is the message set

The average error probability perr(C) is

perr(C) := Pr [M̂ 6= M]

where M is uniform onM

ε-Error Capacity is

M∗(W, ε) := sup{

m ∈ N∣∣ ∃ C s.t. m = |M|, perr(C) ≤ ε

}

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 4 / 29

Page 7: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Channel Coding (One-Shot)

- - - -M X Ye W d M̂

A code is an triple C = {M, e, d} whereM is the message set

The average error probability perr(C) is

perr(C) := Pr [M̂ 6= M]

where M is uniform onM

ε-Error Capacity is

M∗(W, ε) := sup{

m ∈ N∣∣ ∃ C s.t. m = |M|, perr(C) ≤ ε

}Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 4 / 29

Page 8: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Channel Coding (n-Shot)

- - - -M Xn Yn

e Wn d M̂

Consider n independent uses of a channel

Assume W is a discrete memoryless channel

For vectors x = (x1, . . . , xn) ∈ X n and y := (y1, . . . , yn) ∈ Yn,

Wn(y|x) =

n∏i=1

W(yi|xi)

Blocklength n, ε-Error Capacity is

M∗(Wn, ε)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 5 / 29

Page 9: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Channel Coding (n-Shot)

- - - -M Xn Yn

e Wn d M̂

Consider n independent uses of a channel

Assume W is a discrete memoryless channel

For vectors x = (x1, . . . , xn) ∈ X n and y := (y1, . . . , yn) ∈ Yn,

Wn(y|x) =

n∏i=1

W(yi|xi)

Blocklength n, ε-Error Capacity is

M∗(Wn, ε)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 5 / 29

Page 10: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Channel Coding (n-Shot)

- - - -M Xn Yn

e Wn d M̂

Consider n independent uses of a channel

Assume W is a discrete memoryless channel

For vectors x = (x1, . . . , xn) ∈ X n and y := (y1, . . . , yn) ∈ Yn,

Wn(y|x) =

n∏i=1

W(yi|xi)

Blocklength n, ε-Error Capacity is

M∗(Wn, ε)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 5 / 29

Page 11: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Channel Coding (n-Shot)

- - - -M Xn Yn

e Wn d M̂

Consider n independent uses of a channel

Assume W is a discrete memoryless channel

For vectors x = (x1, . . . , xn) ∈ X n and y := (y1, . . . , yn) ∈ Yn,

Wn(y|x) =

n∏i=1

W(yi|xi)

Blocklength n, ε-Error Capacity is

M∗(Wn, ε)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 5 / 29

Page 12: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Contribution

Upper bound log M∗(Wn, ε) for n large (converse)

Concerned with the third-order term of the asymptotic expansion

Going beyond the normal approximation terms

Theorem (Tomamichel-Tan (2013))

For all DMCs with positive ε-dispersion Vε,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

where Q(a) :=∫ +∞

a1√2π

exp(− 1

2 x2)

dx

The 12 log n term is our main contribution

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 6 / 29

Page 13: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Contribution

Upper bound log M∗(Wn, ε) for n large (converse)

Concerned with the third-order term of the asymptotic expansion

Going beyond the normal approximation terms

Theorem (Tomamichel-Tan (2013))

For all DMCs with positive ε-dispersion Vε,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

where Q(a) :=∫ +∞

a1√2π

exp(− 1

2 x2)

dx

The 12 log n term is our main contribution

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 6 / 29

Page 14: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Contribution

Upper bound log M∗(Wn, ε) for n large (converse)

Concerned with the third-order term of the asymptotic expansion

Going beyond the normal approximation terms

Theorem (Tomamichel-Tan (2013))

For all DMCs with positive ε-dispersion Vε,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

where Q(a) :=∫ +∞

a1√2π

exp(− 1

2 x2)

dx

The 12 log n term is our main contribution

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 6 / 29

Page 15: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Contribution

Upper bound log M∗(Wn, ε) for n large (converse)

Concerned with the third-order term of the asymptotic expansion

Going beyond the normal approximation terms

Theorem (Tomamichel-Tan (2013))

For all DMCs with positive ε-dispersion Vε,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

where Q(a) :=∫ +∞

a1√2π

exp(− 1

2 x2)

dx

The 12 log n term is our main contribution

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 6 / 29

Page 16: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Contribution

Upper bound log M∗(Wn, ε) for n large (converse)

Concerned with the third-order term of the asymptotic expansion

Going beyond the normal approximation terms

Theorem (Tomamichel-Tan (2013))

For all DMCs with positive ε-dispersion Vε,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

where Q(a) :=∫ +∞

a1√2π

exp(− 1

2 x2)

dx

The 12 log n term is our main contribution

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 6 / 29

Page 17: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Contribution: Remarks

Our bound

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

Best upper bound till date:

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +

(|X | − 1

2

)log n + O(1)

V. Strassen (1964) Polyanskiy-Poor-Verdú or PPV (2010)

Requires new converse techniques

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 7 / 29

Page 18: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Contribution: Remarks

Our bound

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

Best upper bound till date:

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +

(|X | − 1

2

)log n + O(1)

V. Strassen (1964) Polyanskiy-Poor-Verdú or PPV (2010)

Requires new converse techniques

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 7 / 29

Page 19: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Contribution: Remarks

Our bound

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

Best upper bound till date:

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +

(|X | − 1

2

)log n + O(1)

V. Strassen (1964) Polyanskiy-Poor-Verdú or PPV (2010)

Requires new converse techniques

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 7 / 29

Page 20: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Outline

1 Background

2 Related work

3 Main result

4 New converse

5 Proof sketch

6 Summary and open problems

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 8 / 29

Page 21: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: Shannon’s Channel Coding Theorem

Shannon’s noisy channel codingtheorem and

Wolfowitz’s strong converse state that

Theorem (Shannon (1949), Wolfowitz (1959))

limn→∞

1n

log M∗(Wn, ε) = C, ∀ ε ∈ (0, 1)

where C is the channel capacity defined as

C = C(W) = maxP

I(P,W)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 9 / 29

Page 22: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: Shannon’s Channel Coding Theorem

Shannon’s noisy channel codingtheorem and

Wolfowitz’s strong converse state that

Theorem (Shannon (1949), Wolfowitz (1959))

limn→∞

1n

log M∗(Wn, ε) = C, ∀ ε ∈ (0, 1)

where C is the channel capacity defined as

C = C(W) = maxP

I(P,W)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 9 / 29

Page 23: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: Shannon’s Channel Coding Theorem

limn→∞

1n

log M∗(Wn, ε) = C bits/channel use

Noisy channel coding theorem is independent of ε ∈ (0, 1)

-

6

CR0

1

limn→∞

perr(C)

Phase transition at capacity

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 10 / 29

Page 24: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: Shannon’s Channel Coding Theorem

limn→∞

1n

log M∗(Wn, ε) = C bits/channel use

Noisy channel coding theorem is independent of ε ∈ (0, 1)

-

6

CR0

1

limn→∞

perr(C)

Phase transition at capacity

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 10 / 29

Page 25: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: Shannon’s Channel Coding Theorem

limn→∞

1n

log M∗(Wn, ε) = C bits/channel use

Noisy channel coding theorem is independent of ε ∈ (0, 1)

-

6

CR0

1

limn→∞

perr(C)

Phase transition at capacity

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 10 / 29

Page 26: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: Shannon’s Channel Coding Theorem

limn→∞

1n

log M∗(Wn, ε) = C bits/channel use

Noisy channel coding theorem is independent of ε ∈ (0, 1)

-

6

CR0

1

limn→∞

perr(C)

Phase transition at capacity

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 10 / 29

Page 27: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

What happens at capacity?

More precisely, what happens when

log |M| ≈ nC + a√

n

for some a ∈ R?

Assume capacity-achieving input distribution (CAID) P∗ is unique

The ε-dispersion is an operational quantity that is equal to

Vε = V(P∗,W) = EP∗

[VarW(·|X)

(log

W(·|X)

Q∗(·)∣∣X)]

where (X,Y) ∼ P∗ ×W and Q∗(y) =∑

x P∗(x)W(y|x)

Since CAID is unique, Vε = V

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 11 / 29

Page 28: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

What happens at capacity?

More precisely, what happens when

log |M| ≈ nC + a√

n

for some a ∈ R?

Assume capacity-achieving input distribution (CAID) P∗ is unique

The ε-dispersion is an operational quantity that is equal to

Vε = V(P∗,W) = EP∗

[VarW(·|X)

(log

W(·|X)

Q∗(·)∣∣X)]

where (X,Y) ∼ P∗ ×W and Q∗(y) =∑

x P∗(x)W(y|x)

Since CAID is unique, Vε = V

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 11 / 29

Page 29: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

What happens at capacity?

More precisely, what happens when

log |M| ≈ nC + a√

n

for some a ∈ R?

Assume capacity-achieving input distribution (CAID) P∗ is unique

The ε-dispersion is an operational quantity that is equal to

Vε = V(P∗,W) = EP∗

[VarW(·|X)

(log

W(·|X)

Q∗(·)∣∣X)]

where (X,Y) ∼ P∗ ×W and Q∗(y) =∑

x P∗(x)W(y|x)

Since CAID is unique, Vε = V

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 11 / 29

Page 30: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

What happens at capacity?

More precisely, what happens when

log |M| ≈ nC + a√

n

for some a ∈ R?

Assume capacity-achieving input distribution (CAID) P∗ is unique

The ε-dispersion is an operational quantity that is equal to

Vε = V(P∗,W) = EP∗

[VarW(·|X)

(log

W(·|X)

Q∗(·)∣∣X)]

where (X,Y) ∼ P∗ ×W and Q∗(y) =∑

x P∗(x)W(y|x)

Since CAID is unique, Vε = V

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 11 / 29

Page 31: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

What happens at capacity?

More precisely, what happens when

log |M| ≈ nC + a√

n

for some a ∈ R?

Assume capacity-achieving input distribution (CAID) P∗ is unique

The ε-dispersion is an operational quantity that is equal to

Vε = V(P∗,W) = EP∗

[VarW(·|X)

(log

W(·|X)

Q∗(·)∣∣X)]

where (X,Y) ∼ P∗ ×W and Q∗(y) =∑

x P∗(x)W(y|x)

Since CAID is unique, Vε = V

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 11 / 29

Page 32: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

Assume rate of the code satisfies

1n

log |M| = C +a√n

-

6

0

0.5

1

a

limn→∞

perr(C)

perr(C) ≈ Φ(

a√V

)

Here, we have fixed a, the second-order coding rate [Hayashi (2009)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 12 / 29

Page 33: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

Assume rate of the code satisfies

1n

log |M| = C +a√n

-

6

0

0.5

1

a

limn→∞

perr(C)

perr(C) ≈ Φ(

a√V

)

Here, we have fixed a, the second-order coding rate [Hayashi (2009)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 12 / 29

Page 34: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

Assume rate of the code satisfies

1n

log |M| = C +a√n

-

6

0

0.5

1

a

limn→∞

perr(C)

perr(C) ≈ Φ(

a√V

)

Here, we have fixed a, the second-order coding rate [Hayashi (2009)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 12 / 29

Page 35: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

Assume rate of the code satisfies

1n

log |M| = C +a√n

-

6

0

0.5

1

a

limn→∞

perr(C)

perr(C) ≈ Φ(

a√V

)

Here, we have fixed a, the second-order coding rate [Hayashi (2009)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 12 / 29

Page 36: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

Theorem (Strassen (1964), Hayashi (2009), Polyanskiy-Poor-Verdú(2010))

For every ε ∈ (0, 1), and if Vε > 0, we have

log M∗(Wn, ε) = nC −√

nVQ−1(ε) + O(log n)

V. Strassen(1964)

M. Hayashi(2009) Polyanskiy-Poor-Verdú (2010)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 13 / 29

Page 37: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

Theorem (Strassen (1964), Hayashi (2009), Polyanskiy-Poor-Verdú(2010))

For every ε ∈ (0, 1), and if Vε > 0, we have

log M∗(Wn, ε) = nC −√

nVQ−1(ε) + O(log n)

V. Strassen(1964)

M. Hayashi(2009) Polyanskiy-Poor-Verdú (2010)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 13 / 29

Page 38: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

Berry-Esséen theorem: For independent Xi with zero-mean andvariances σ2

i ,

P

(1√n

n∑i=1

Xi ≥ a

)= Q

( aσ̄

)± 6 B√

n

where σ̄2 = 1n

∑ni=1 σ

2i and B is related to the third moment

PPV showed that the normal approximation

log M∗(Wn, ε) ≈ nC −√

nVQ−1(ε)

is very accurate even at moderate blocklengths of ≈ 100

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 14 / 29

Page 39: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion

Berry-Esséen theorem: For independent Xi with zero-mean andvariances σ2

i ,

P

(1√n

n∑i=1

Xi ≥ a

)= Q

( aσ̄

)± 6 B√

n

where σ̄2 = 1n

∑ni=1 σ

2i and B is related to the third moment

PPV showed that the normal approximation

log M∗(Wn, ε) ≈ nC −√

nVQ−1(ε)

is very accurate even at moderate blocklengths of ≈ 100

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 14 / 29

Page 40: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Background: ε-Dispersion for the BSC

For a BSC with crossover probability p = 0.11, the normalapproximation yields:

0 100 200 300 400 500 600 700 800 900 10000.3

0.35

0.4

0.45

0.5

Blocklength n

Bits

per

cha

nnel

use

Normal approximation

Capacityε = 0.01ε = 0.1

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 15 / 29

Page 41: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Third-Order Term

Recall that we are interested in quantifying the third-order term ρn

ρn = log M∗(Wn, ε)−[nC −

√nVQ−1(ε)

]ρn = O(log n) if channel is non-exotic

Motivation 1: ρn may be important at very short blocklengths

Motivation 2: Because we’re information theorists

Wir müssen wissen – wir werden wissen (David Hilbert)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 16 / 29

Page 42: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Third-Order Term

Recall that we are interested in quantifying the third-order term ρn

ρn = log M∗(Wn, ε)−[nC −

√nVQ−1(ε)

]ρn = O(log n) if channel is non-exotic

Motivation 1: ρn may be important at very short blocklengths

Motivation 2: Because we’re information theorists

Wir müssen wissen – wir werden wissen (David Hilbert)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 16 / 29

Page 43: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Third-Order Term

Recall that we are interested in quantifying the third-order term ρn

ρn = log M∗(Wn, ε)−[nC −

√nVQ−1(ε)

]ρn = O(log n) if channel is non-exotic

Motivation 1: ρn may be important at very short blocklengths

Motivation 2: Because we’re information theorists

Wir müssen wissen – wir werden wissen (David Hilbert)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 16 / 29

Page 44: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Third-Order Term

ρn = log M∗(Wn, ε)−[nC −

√nVQ−1(ε)

]For the BSC [PPV10]

ρn =12

log n + O(1)

For the BEC [PPV10]ρn = O(1)

For the AWGN under maximum-power constraints [PPV10]

O(1) ≤ ρn ≤12

log n + O(1)

Our converse technique can be applied to the AWGN channel

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 17 / 29

Page 45: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Third-Order Term

ρn = log M∗(Wn, ε)−[nC −

√nVQ−1(ε)

]For the BSC [PPV10]

ρn =12

log n + O(1)

For the BEC [PPV10]ρn = O(1)

For the AWGN under maximum-power constraints [PPV10]

O(1) ≤ ρn ≤12

log n + O(1)

Our converse technique can be applied to the AWGN channel

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 17 / 29

Page 46: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Third-Order Term

ρn = log M∗(Wn, ε)−[nC −

√nVQ−1(ε)

]For the BSC [PPV10]

ρn =12

log n + O(1)

For the BEC [PPV10]ρn = O(1)

For the AWGN under maximum-power constraints [PPV10]

O(1) ≤ ρn ≤12

log n + O(1)

Our converse technique can be applied to the AWGN channel

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 17 / 29

Page 47: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Third-Order Term

ρn = log M∗(Wn, ε)−[nC −

√nVQ−1(ε)

]For the BSC [PPV10]

ρn =12

log n + O(1)

For the BEC [PPV10]ρn = O(1)

For the AWGN under maximum-power constraints [PPV10]

O(1) ≤ ρn ≤12

log n + O(1)

Our converse technique can be applied to the AWGN channelVincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 17 / 29

Page 48: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Achievability for Third-Order Term

Proposition (Polyanskiy (2010))

Assume that all elements of {W(y|x) : x ∈ X , y ∈ Y} are positive andC > 0. Then,

ρn ≥12

log n + O(1)

This is an achievability result

BEC doesn’t satisfy assumptions

We will not try to improve on it

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 18 / 29

Page 49: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Achievability for Third-Order Term

Proposition (Polyanskiy (2010))

Assume that all elements of {W(y|x) : x ∈ X , y ∈ Y} are positive andC > 0. Then,

ρn ≥12

log n + O(1)

This is an achievability result

BEC doesn’t satisfy assumptions

We will not try to improve on it

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 18 / 29

Page 50: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Achievability for Third-Order Term

Proposition (Polyanskiy (2010))

Assume that all elements of {W(y|x) : x ∈ X , y ∈ Y} are positive andC > 0. Then,

ρn ≥12

log n + O(1)

This is an achievability result

BEC doesn’t satisfy assumptions

We will not try to improve on it

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 18 / 29

Page 51: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Achievability for Third-Order Term

Proposition (Polyanskiy (2010))

Assume that all elements of {W(y|x) : x ∈ X , y ∈ Y} are positive andC > 0. Then,

ρn ≥12

log n + O(1)

This is an achievability result

BEC doesn’t satisfy assumptions

We will not try to improve on it

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 18 / 29

Page 52: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Polyanskiy (2010))

If W is weakly input-symmetric

ρn ≤12

log n + O(1)

This is a converse result

Gallager-symmetric channels are weakly input-symmetric

The set of weakly input-symmetric channels is very thin

We dispense of this symmetry assumption

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 19 / 29

Page 53: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Polyanskiy (2010))

If W is weakly input-symmetric

ρn ≤12

log n + O(1)

This is a converse result

Gallager-symmetric channels are weakly input-symmetric

The set of weakly input-symmetric channels is very thin

We dispense of this symmetry assumption

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 19 / 29

Page 54: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Polyanskiy (2010))

If W is weakly input-symmetric

ρn ≤12

log n + O(1)

This is a converse result

Gallager-symmetric channels are weakly input-symmetric

The set of weakly input-symmetric channels is very thin

We dispense of this symmetry assumption

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 19 / 29

Page 55: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Polyanskiy (2010))

If W is weakly input-symmetric

ρn ≤12

log n + O(1)

This is a converse result

Gallager-symmetric channels are weakly input-symmetric

The set of weakly input-symmetric channels is very thin

We dispense of this symmetry assumption

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 19 / 29

Page 56: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Polyanskiy (2010))

If W is weakly input-symmetric

ρn ≤12

log n + O(1)

This is a converse result

Gallager-symmetric channels are weakly input-symmetric

The set of weakly input-symmetric channels is very thin

We dispense of this symmetry assumption

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 19 / 29

Page 57: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Strassen (1964), PPV (2010))

If W is a DMC with positive ε-dispersion,

ρn ≤(|X | − 1

2

)log n + O(1)

Every code can be partitioned into no more than (n + 1)|X |−1

constant-composition subcodes

M∗P(Wn, ε): Max size of a constant-composition code with type P

As such,

M∗(Wn, ε) ≤ (n + 1)|X |−1 maxP∈Pn(X )

M∗P(Wn, ε)

This is where the dependence on |X | comes in

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 20 / 29

Page 58: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Strassen (1964), PPV (2010))

If W is a DMC with positive ε-dispersion,

ρn ≤(|X | − 1

2

)log n + O(1)

Every code can be partitioned into no more than (n + 1)|X |−1

constant-composition subcodes

M∗P(Wn, ε): Max size of a constant-composition code with type P

As such,

M∗(Wn, ε) ≤ (n + 1)|X |−1 maxP∈Pn(X )

M∗P(Wn, ε)

This is where the dependence on |X | comes in

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 20 / 29

Page 59: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Strassen (1964), PPV (2010))

If W is a DMC with positive ε-dispersion,

ρn ≤(|X | − 1

2

)log n + O(1)

Every code can be partitioned into no more than (n + 1)|X |−1

constant-composition subcodes

M∗P(Wn, ε): Max size of a constant-composition code with type P

As such,

M∗(Wn, ε) ≤ (n + 1)|X |−1 maxP∈Pn(X )

M∗P(Wn, ε)

This is where the dependence on |X | comes in

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 20 / 29

Page 60: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Strassen (1964), PPV (2010))

If W is a DMC with positive ε-dispersion,

ρn ≤(|X | − 1

2

)log n + O(1)

Every code can be partitioned into no more than (n + 1)|X |−1

constant-composition subcodes

M∗P(Wn, ε): Max size of a constant-composition code with type P

As such,

M∗(Wn, ε) ≤ (n + 1)|X |−1 maxP∈Pn(X )

M∗P(Wn, ε)

This is where the dependence on |X | comes in

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 20 / 29

Page 61: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Related Work: Converse for Third-Order Term

Proposition (Strassen (1964), PPV (2010))

If W is a DMC with positive ε-dispersion,

ρn ≤(|X | − 1

2

)log n + O(1)

Every code can be partitioned into no more than (n + 1)|X |−1

constant-composition subcodes

M∗P(Wn, ε): Max size of a constant-composition code with type P

As such,

M∗(Wn, ε) ≤ (n + 1)|X |−1 maxP∈Pn(X )

M∗P(Wn, ε)

This is where the dependence on |X | comes inVincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 20 / 29

Page 62: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Result: Tight Third-Order Term

Theorem (Tomamichel-Tan (2013))

If W is a DMC with positive ε-dispersion,

ρn ≤12

log n + O(1)

The 12 cannot be improved without further assumptions

For BSCρn =

12

log n + O(1)

We can dispense of the positive ε-dispersion assumption as well

No need for unique CAID

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 21 / 29

Page 63: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Result: Tight Third-Order Term

Theorem (Tomamichel-Tan (2013))

If W is a DMC with positive ε-dispersion,

ρn ≤12

log n + O(1)

The 12 cannot be improved without further assumptions

For BSCρn =

12

log n + O(1)

We can dispense of the positive ε-dispersion assumption as well

No need for unique CAID

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 21 / 29

Page 64: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Result: Tight Third-Order Term

Theorem (Tomamichel-Tan (2013))

If W is a DMC with positive ε-dispersion,

ρn ≤12

log n + O(1)

The 12 cannot be improved without further assumptions

For BSCρn =

12

log n + O(1)

We can dispense of the positive ε-dispersion assumption as well

No need for unique CAID

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 21 / 29

Page 65: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Result: Tight Third-Order Term

Theorem (Tomamichel-Tan (2013))

If W is a DMC with positive ε-dispersion,

ρn ≤12

log n + O(1)

The 12 cannot be improved without further assumptions

For BSCρn =

12

log n + O(1)

We can dispense of the positive ε-dispersion assumption as well

No need for unique CAID

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 21 / 29

Page 66: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Result: Tight Third-Order Term

Theorem (Tomamichel-Tan (2013))

If W is a DMC with positive ε-dispersion,

ρn ≤12

log n + O(1)

The 12 cannot be improved without further assumptions

For BSCρn =

12

log n + O(1)

We can dispense of the positive ε-dispersion assumption as well

No need for unique CAID

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 21 / 29

Page 67: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Main Result: Tight Third-Order Term

All cases are covered

����>

ZZZZ~

Yes

No

Vε > 0

≤nC−√

nVεQ−1(ε)+ 12 log n+O(1)

����>

ZZZZ~

Yes

No

not exoticor ε< 1

2

≤nC+O(1)

����>

ZZZZ~

Yes

No

exoticand ε= 1

2

≤nC+ 12 log n+O(1)

≤nC+O(n

13)

[PPV10]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 22 / 29

Page 68: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique for Tight Third-Order Term

For the regular case, ρn ≤ 12 log n + O(1)

The type-counting trick and upper bounds on M∗P(Wn, ε) are notsufficiently tight

We need a new converse bound for general DMCs

Information spectrum divergence

Dεs (P‖Q) := sup

{R ∈ R

∣∣P(logP(X)

Q(X)≤ R

)≤ ε}

“Information Spectrum Methods in Information Theory”by T. S. Han (2003)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 23 / 29

Page 69: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique for Tight Third-Order Term

For the regular case, ρn ≤ 12 log n + O(1)

The type-counting trick and upper bounds on M∗P(Wn, ε) are notsufficiently tight

We need a new converse bound for general DMCs

Information spectrum divergence

Dεs (P‖Q) := sup

{R ∈ R

∣∣P(logP(X)

Q(X)≤ R

)≤ ε}

“Information Spectrum Methods in Information Theory”by T. S. Han (2003)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 23 / 29

Page 70: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique for Tight Third-Order Term

For the regular case, ρn ≤ 12 log n + O(1)

The type-counting trick and upper bounds on M∗P(Wn, ε) are notsufficiently tight

We need a new converse bound for general DMCs

Information spectrum divergence

Dεs (P‖Q) := sup

{R ∈ R

∣∣P(logP(X)

Q(X)≤ R

)≤ ε}

“Information Spectrum Methods in Information Theory”by T. S. Han (2003)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 23 / 29

Page 71: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique for Tight Third-Order Term

For the regular case, ρn ≤ 12 log n + O(1)

The type-counting trick and upper bounds on M∗P(Wn, ε) are notsufficiently tight

We need a new converse bound for general DMCs

Information spectrum divergence

Dεs (P‖Q) := sup

{R ∈ R

∣∣P(logP(X)

Q(X)≤ R

)≤ ε}

“Information Spectrum Methods in Information Theory”by T. S. Han (2003)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 23 / 29

Page 72: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Information Spectrum Divergence

Dεs (P‖Q) := sup

{R ∈ R

∣∣P(logP(X)

Q(X)≤ R

)≤ ε}

t t -

“Density” of log P(X)Q(X)

R∗

ε 1− ε

If Xn is i.i.d. P, the central limit theorem yields

Dεs (Pn‖Qn) ≈ nD(P‖Q)−

√nV(P‖Q)Q−1(ε)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 24 / 29

Page 73: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Information Spectrum Divergence

Dεs (P‖Q) := sup

{R ∈ R

∣∣P(logP(X)

Q(X)≤ R

)≤ ε}

t t -

“Density” of log P(X)Q(X)

R∗

ε

1− ε

If Xn is i.i.d. P, the central limit theorem yields

Dεs (Pn‖Qn) ≈ nD(P‖Q)−

√nV(P‖Q)Q−1(ε)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 24 / 29

Page 74: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Information Spectrum Divergence

Dεs (P‖Q) := sup

{R ∈ R

∣∣P(logP(X)

Q(X)≤ R

)≤ ε}

t t -

“Density” of log P(X)Q(X)

R∗

ε 1− ε

If Xn is i.i.d. P, the central limit theorem yields

Dεs (Pn‖Qn) ≈ nD(P‖Q)−

√nV(P‖Q)Q−1(ε)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 24 / 29

Page 75: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Information Spectrum Divergence

Dεs (P‖Q) := sup

{R ∈ R

∣∣P(logP(X)

Q(X)≤ R

)≤ ε}

t t -

“Density” of log P(X)Q(X)

R∗

ε 1− ε

If Xn is i.i.d. P, the central limit theorem yields

Dεs (Pn‖Qn) ≈ nD(P‖Q)−

√nV(P‖Q)Q−1(ε)

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 24 / 29

Page 76: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: The New Converse Bound

Lemma (Tomamichel-Tan (2013))

For every channel W, every ε ∈ (0, 1) and δ ∈ (0, 1− ε), we have

log M∗(W, ε) ≤ minQ∈P(Y)

maxx∈X

Dε+δs (W(·|x)‖Q) + log

When DMC is used n times,

log M∗(Wn, ε) ≤ minQ(n)∈P(Yn)

maxx∈X n

Dε+δs (Wn(·|x)‖Q(n)) + log

Choose δ = n−12 so log 1

δ = 12 log n

Since all x within a type class result in the same Dε+δs (if Q(n) is

permutation invariant), it’s really a max over types Px ∈ Pn(X )

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 25 / 29

Page 77: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: The New Converse Bound

Lemma (Tomamichel-Tan (2013))

For every channel W, every ε ∈ (0, 1) and δ ∈ (0, 1− ε), we have

log M∗(W, ε) ≤ minQ∈P(Y)

maxx∈X

Dε+δs (W(·|x)‖Q) + log

When DMC is used n times,

log M∗(Wn, ε) ≤ minQ(n)∈P(Yn)

maxx∈X n

Dε+δs (Wn(·|x)‖Q(n)) + log

Choose δ = n−12 so log 1

δ = 12 log n

Since all x within a type class result in the same Dε+δs (if Q(n) is

permutation invariant), it’s really a max over types Px ∈ Pn(X )

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 25 / 29

Page 78: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: The New Converse Bound

Lemma (Tomamichel-Tan (2013))

For every channel W, every ε ∈ (0, 1) and δ ∈ (0, 1− ε), we have

log M∗(W, ε) ≤ minQ∈P(Y)

maxx∈X

Dε+δs (W(·|x)‖Q) + log

When DMC is used n times,

log M∗(Wn, ε) ≤ minQ(n)∈P(Yn)

maxx∈X n

Dε+δs (Wn(·|x)‖Q(n)) + log

Choose δ = n−12 so log 1

δ = 12 log n

Since all x within a type class result in the same Dε+δs (if Q(n) is

permutation invariant), it’s really a max over types Px ∈ Pn(X )

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 25 / 29

Page 79: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: The New Converse Bound

Lemma (Tomamichel-Tan (2013))

For every channel W, every ε ∈ (0, 1) and δ ∈ (0, 1− ε), we have

log M∗(W, ε) ≤ minQ∈P(Y)

maxx∈X

Dε+δs (W(·|x)‖Q) + log

When DMC is used n times,

log M∗(Wn, ε) ≤ minQ(n)∈P(Yn)

maxx∈X n

Dε+δs (Wn(·|x)‖Q(n)) + log

Choose δ = n−12 so log 1

δ = 12 log n

Since all x within a type class result in the same Dε+δs (if Q(n) is

permutation invariant), it’s really a max over types Px ∈ Pn(X )

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 25 / 29

Page 80: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

log M∗(Wn, ε) ≤ maxx∈X n

Dε+δs (Wn(·|x)‖Q(n)) + log

1δ, ∀Q(n) ∈ P(Yn)

Q(n)(y): invariant to permutations of the n channel uses

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

First term: Qk’s and λ(k)’s designed to form an n−12 -cover of P(Y):

∀Q ∈ P(Y), ∃k ∈ K s.t. ‖Q− Qk‖2 ≤ n−12 .

Second term: Mixture over output distributions induced by inputtypes [Hayashi (2009)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 26 / 29

Page 81: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

log M∗(Wn, ε) ≤ maxx∈X n

Dε+δs (Wn(·|x)‖Q(n)) + log

1δ, ∀Q(n) ∈ P(Yn)

Q(n)(y): invariant to permutations of the n channel uses

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

First term: Qk’s and λ(k)’s designed to form an n−12 -cover of P(Y):

∀Q ∈ P(Y), ∃k ∈ K s.t. ‖Q− Qk‖2 ≤ n−12 .

Second term: Mixture over output distributions induced by inputtypes [Hayashi (2009)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 26 / 29

Page 82: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

log M∗(Wn, ε) ≤ maxx∈X n

Dε+δs (Wn(·|x)‖Q(n)) + log

1δ, ∀Q(n) ∈ P(Yn)

Q(n)(y): invariant to permutations of the n channel uses

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

First term: Qk’s and λ(k)’s designed to form an n−12 -cover of P(Y):

∀Q ∈ P(Y), ∃k ∈ K s.t. ‖Q− Qk‖2 ≤ n−12 .

Second term: Mixture over output distributions induced by inputtypes [Hayashi (2009)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 26 / 29

Page 83: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

log M∗(Wn, ε) ≤ maxx∈X n

Dε+δs (Wn(·|x)‖Q(n)) + log

1δ, ∀Q(n) ∈ P(Yn)

Q(n)(y): invariant to permutations of the n channel uses

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

First term: Qk’s and λ(k)’s designed to form an n−12 -cover of P(Y):

∀Q ∈ P(Y), ∃k ∈ K s.t. ‖Q− Qk‖2 ≤ n−12 .

Second term: Mixture over output distributions induced by inputtypes [Hayashi (2009)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 26 / 29

Page 84: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

-

6

Q(0)

Q(1)

(0, 1)

(1, 0) P(Y)���)@@@@@@@@@@@

sQ∗ s s s

sssQ[−1,1]

Q[1,−1]

Q[2,−2]

Q[−2,2]

1√2n

1√2n

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 27 / 29

Page 85: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

-

6

Q(0)

Q(1)

(0, 1)

(1, 0) P(Y)���)@@@@@@@@@@@

sQ∗

s s s

sssQ[−1,1]

Q[1,−1]

Q[2,−2]

Q[−2,2]

1√2n

1√2n

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 27 / 29

Page 86: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

-

6

Q(0)

Q(1)

(0, 1)

(1, 0) P(Y)���)@@@@@@@@@@@

sQ∗ s s s

sss

Q[−1,1]

Q[1,−1]

Q[2,−2]

Q[−2,2]

1√2n

1√2n

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 27 / 29

Page 87: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

-

6

Q(0)

Q(1)

(0, 1)

(1, 0) P(Y)���)@@@@@@@@@@@

sQ∗ s s s

sssQ[−1,1]

Q[1,−1]

Q[2,−2]

Q[−2,2]

1√2n

1√2n

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 27 / 29

Page 88: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Choice of Output Distribution

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

-

6

Q(0)

Q(1)

(0, 1)

(1, 0) P(Y)���)@@@@@@@@@@@

sQ∗ s s s

sssQ[−1,1]

Q[1,−1]

Q[2,−2]

Q[−2,2]

1√2n

1√2n

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 27 / 29

Page 89: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Summary

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

This construction ensures that for every type Px near the CAID iswell-approximated by by a Qk(x)

Well in the sense that the loss is

− logλ(k) = O(1)

for every x such that Px is near the CAID

For types Px far from the CAID, use the second part and

I(Px,W) ≤ C′ < C

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 28 / 29

Page 90: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Summary

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

This construction ensures that for every type Px near the CAID iswell-approximated by by a Qk(x)

Well in the sense that the loss is

− logλ(k) = O(1)

for every x such that Px is near the CAID

For types Px far from the CAID, use the second part and

I(Px,W) ≤ C′ < C

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 28 / 29

Page 91: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Proof Technique: Summary

Q(n)(y) :=12

∑k∈K

λ(k)Qnk(y) +

12

∑P∈Pn(X )

1|Pn(X )|

(PW)n(y)

This construction ensures that for every type Px near the CAID iswell-approximated by by a Qk(x)

Well in the sense that the loss is

− logλ(k) = O(1)

for every x such that Px is near the CAID

For types Px far from the CAID, use the second part and

I(Px,W) ≤ C′ < C

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 28 / 29

Page 92: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Summary and Food for Thought

We showed that for DMCs with positive ε-dispersion,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

How important is the assumption of discreteness?

Does our uniform quantization technique extend to lossy sourcecoding? [Ingber-Kochman (2010), Kostina-Verdú (2012)]

Alternate proof using Bahadur-Ranga Rao [Moulin (2012)]?

P

(1n

n∑i=1

Xi ≥ c

)= Θ

(exp(−nI(c))√

n

)

This result has been used to refine the sphere-packing bound[Altug-Wagner (2012)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 29 / 29

Page 93: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Summary and Food for Thought

We showed that for DMCs with positive ε-dispersion,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

How important is the assumption of discreteness?

Does our uniform quantization technique extend to lossy sourcecoding? [Ingber-Kochman (2010), Kostina-Verdú (2012)]

Alternate proof using Bahadur-Ranga Rao [Moulin (2012)]?

P

(1n

n∑i=1

Xi ≥ c

)= Θ

(exp(−nI(c))√

n

)

This result has been used to refine the sphere-packing bound[Altug-Wagner (2012)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 29 / 29

Page 94: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Summary and Food for Thought

We showed that for DMCs with positive ε-dispersion,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

How important is the assumption of discreteness?

Does our uniform quantization technique extend to lossy sourcecoding? [Ingber-Kochman (2010), Kostina-Verdú (2012)]

Alternate proof using Bahadur-Ranga Rao [Moulin (2012)]?

P

(1n

n∑i=1

Xi ≥ c

)= Θ

(exp(−nI(c))√

n

)

This result has been used to refine the sphere-packing bound[Altug-Wagner (2012)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 29 / 29

Page 95: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Summary and Food for Thought

We showed that for DMCs with positive ε-dispersion,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

How important is the assumption of discreteness?

Does our uniform quantization technique extend to lossy sourcecoding? [Ingber-Kochman (2010), Kostina-Verdú (2012)]

Alternate proof using Bahadur-Ranga Rao [Moulin (2012)]?

P

(1n

n∑i=1

Xi ≥ c

)= Θ

(exp(−nI(c))√

n

)

This result has been used to refine the sphere-packing bound[Altug-Wagner (2012)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 29 / 29

Page 96: On Third-Order Asymptotics for DMCs · On Third-Order Asymptotics for DMCs Vincent Y. F. Tan Institute for Infocomm Research (I2R) National University of Singapore (NUS) January 20,

Summary and Food for Thought

We showed that for DMCs with positive ε-dispersion,

log M∗(Wn, ε) ≤ nC −√

nVεQ−1(ε) +12

log n + O(1)

How important is the assumption of discreteness?

Does our uniform quantization technique extend to lossy sourcecoding? [Ingber-Kochman (2010), Kostina-Verdú (2012)]

Alternate proof using Bahadur-Ranga Rao [Moulin (2012)]?

P

(1n

n∑i=1

Xi ≥ c

)= Θ

(exp(−nI(c))√

n

)

This result has been used to refine the sphere-packing bound[Altug-Wagner (2012)]

Vincent Tan (I2R and NUS) Third-Order Asymptotics for DMCs HKTW Workshop 2013 29 / 29


Recommended