+ All Categories
Home > Documents > Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains...

Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains...

Date post: 25-Jul-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
89
Spectral Theory of Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet Energy References Spectral Theory of Finite Markov Chains Austin Eide University Of Nebraska – Lincoln Spring 2020 1 / 25
Transcript
Page 1: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Spectral Theory of Finite Markov Chains

Austin Eide

University Of Nebraska – Lincoln

Spring 2020

1 / 25

Page 2: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Markov chains

Definition (Markov Chain)

A Markov chain on state space X is a sequence of X -valuedr.v.’s (X0, X1, . . . ) satisfying the Markov property :

P(Xt+1 = y|(Xt, . . . , X0)) = P(Xt+1 = y|Xt = x) =: P (x, y)

A chain is thus entirely described by an initial distributionµ0 ∈ R|X | for X0 and a |X | × |X | row-stochastic matrix Pwhich stores transition probabilities.

2 / 25

Page 3: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Markov chains

Definition (Markov Chain)

A Markov chain on state space X is a sequence of X -valuedr.v.’s (X0, X1, . . . ) satisfying the Markov property :

P(Xt+1 = y|(Xt, . . . , X0)) = P(Xt+1 = y|Xt = x) =: P (x, y)

A chain is thus entirely described by an initial distributionµ0 ∈ R|X | for X0 and a |X | × |X | row-stochastic matrix Pwhich stores transition probabilities.

2 / 25

Page 4: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The transition matrix

If today’s distribution (i.e., the distribution on Xt) is µt, thentomorrow’s distribution is µt+1 = µtP.

Given the initial distribution µ0, inductively we have µt = µ0Pt.

Note: almost always, we’ll think of µ0 as a point mass onsome state x ∈ X .

3 / 25

Page 5: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The transition matrix

If today’s distribution (i.e., the distribution on Xt) is µt, thentomorrow’s distribution is µt+1 = µtP.

Given the initial distribution µ0, inductively we have µt = µ0Pt.

Note: almost always, we’ll think of µ0 as a point mass onsome state x ∈ X .

3 / 25

Page 6: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The transition matrix

If today’s distribution (i.e., the distribution on Xt) is µt, thentomorrow’s distribution is µt+1 = µtP.

Given the initial distribution µ0, inductively we have µt = µ0Pt.

Note: almost always, we’ll think of µ0 as a point mass onsome state x ∈ X .

3 / 25

Page 7: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The transition matrix

If today’s distribution (i.e., the distribution on Xt) is µt, thentomorrow’s distribution is µt+1 = µtP.

Given the initial distribution µ0, inductively we have µt = µ0Pt.

Note: almost always, we’ll think of µ0 as a point mass onsome state x ∈ X .

3 / 25

Page 8: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Irreducibility & Aperiodicity

For x, y ∈ X and t ≥ 0, P t(x, y) is the probability of travelingfrom x to y in t steps.

Definition (I. & A.)

A chain is irreducible if ∀ pairs x, y ∈ X , ∃ integer t withP t(x, y) > 0.

A chain is aperiodic if

gcdt ≥ 1 : P t(x, x) > 0 = 1.

(For example, a “bipartite” chain is periodic, since then theabove quantity is 2.)

4 / 25

Page 9: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Irreducibility & Aperiodicity

For x, y ∈ X and t ≥ 0, P t(x, y) is the probability of travelingfrom x to y in t steps.

Definition (I. & A.)

A chain is irreducible if ∀ pairs x, y ∈ X , ∃ integer t withP t(x, y) > 0.

A chain is aperiodic if

gcdt ≥ 1 : P t(x, x) > 0 = 1.

(For example, a “bipartite” chain is periodic, since then theabove quantity is 2.)

4 / 25

Page 10: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Irreducibility & Aperiodicity

For x, y ∈ X and t ≥ 0, P t(x, y) is the probability of travelingfrom x to y in t steps.

Definition (I. & A.)

A chain is irreducible if ∀ pairs x, y ∈ X , ∃ integer t withP t(x, y) > 0.

A chain is aperiodic if

gcdt ≥ 1 : P t(x, x) > 0 = 1.

(For example, a “bipartite” chain is periodic, since then theabove quantity is 2.)

4 / 25

Page 11: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Irreducibility & Aperiodicity

For x, y ∈ X and t ≥ 0, P t(x, y) is the probability of travelingfrom x to y in t steps.

Definition (I. & A.)

A chain is irreducible if ∀ pairs x, y ∈ X , ∃ integer t withP t(x, y) > 0.

A chain is aperiodic if

gcdt ≥ 1 : P t(x, x) > 0 = 1.

(For example, a “bipartite” chain is periodic, since then theabove quantity is 2.)

4 / 25

Page 12: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Irreducibility & Aperiodicity

For x, y ∈ X and t ≥ 0, P t(x, y) is the probability of travelingfrom x to y in t steps.

Definition (I. & A.)

A chain is irreducible if ∀ pairs x, y ∈ X , ∃ integer t withP t(x, y) > 0.

A chain is aperiodic if

gcdt ≥ 1 : P t(x, x) > 0 = 1.

(For example, a “bipartite” chain is periodic, since then theabove quantity is 2.)

4 / 25

Page 13: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Perron-Frobenius

Theorem

If P is irreducible and aperiodic, then ∃! distribution π suchthat πP = π, and moreover for any µ0 we have µ0P

t → π.

Proof.

I + A =⇒ P t > 0 for all t sufficiently large.

Easy to show that σ(P ) ≤ 1, and that 1 is an eigenvalue.

Thus, by Perron-Frobenius (and a corollary thereof):

P has a unique, strictly positive left eigenvector π witheigenvalue 1—the stationary distribution of P

For any distribution µ0 on X , µ0Pt → π

5 / 25

Page 14: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Perron-Frobenius

Theorem

If P is irreducible and aperiodic, then ∃! distribution π suchthat πP = π, and moreover for any µ0 we have µ0P

t → π.

Proof.

I + A =⇒ P t > 0 for all t sufficiently large.

Easy to show that σ(P ) ≤ 1, and that 1 is an eigenvalue.

Thus, by Perron-Frobenius (and a corollary thereof):

P has a unique, strictly positive left eigenvector π witheigenvalue 1—the stationary distribution of P

For any distribution µ0 on X , µ0Pt → π

5 / 25

Page 15: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Perron-Frobenius

Theorem

If P is irreducible and aperiodic, then ∃! distribution π suchthat πP = π, and moreover for any µ0 we have µ0P

t → π.

Proof.

I + A =⇒ P t > 0 for all t sufficiently large.

Easy to show that σ(P ) ≤ 1, and that 1 is an eigenvalue.

Thus, by Perron-Frobenius (and a corollary thereof):

P has a unique, strictly positive left eigenvector π witheigenvalue 1—the stationary distribution of P

For any distribution µ0 on X , µ0Pt → π

5 / 25

Page 16: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Perron-Frobenius

Theorem

If P is irreducible and aperiodic, then ∃! distribution π suchthat πP = π, and moreover for any µ0 we have µ0P

t → π.

Proof.

I + A =⇒ P t > 0 for all t sufficiently large.

Easy to show that σ(P ) ≤ 1, and that 1 is an eigenvalue.

Thus, by Perron-Frobenius (and a corollary thereof):

P has a unique, strictly positive left eigenvector π witheigenvalue 1—the stationary distribution of P

For any distribution µ0 on X , µ0Pt → π

5 / 25

Page 17: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Perron-Frobenius

Theorem

If P is irreducible and aperiodic, then ∃! distribution π suchthat πP = π, and moreover for any µ0 we have µ0P

t → π.

Proof.

I + A =⇒ P t > 0 for all t sufficiently large.

Easy to show that σ(P ) ≤ 1, and that 1 is an eigenvalue.

Thus, by Perron-Frobenius (and a corollary thereof):

P has a unique, strictly positive left eigenvector π witheigenvalue 1—the stationary distribution of P

For any distribution µ0 on X , µ0Pt → π

5 / 25

Page 18: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Perron-Frobenius

Theorem

If P is irreducible and aperiodic, then ∃! distribution π suchthat πP = π, and moreover for any µ0 we have µ0P

t → π.

Proof.

I + A =⇒ P t > 0 for all t sufficiently large.

Easy to show that σ(P ) ≤ 1, and that 1 is an eigenvalue.

Thus, by Perron-Frobenius (and a corollary thereof):

P has a unique, strictly positive left eigenvector π witheigenvalue 1—the stationary distribution of P

For any distribution µ0 on X , µ0Pt → π

5 / 25

Page 19: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Convergence in...?

Usually,

Definition (Total Variation Distance)

For probability distributions µ, ν ∈ R|X | on X , define

‖µ− ν‖TV =1

2

∑x∈X|µ(x)− ν(x)| = 1

2‖µ− ν‖1 .

Equivalent to ‖µ− ν‖TV = maxA⊆Ω |µ(A)− ν(A)|.

6 / 25

Page 20: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Convergence in...?

Usually,

Definition (Total Variation Distance)

For probability distributions µ, ν ∈ R|X | on X , define

‖µ− ν‖TV =1

2

∑x∈X|µ(x)− ν(x)| = 1

2‖µ− ν‖1 .

Equivalent to ‖µ− ν‖TV = maxA⊆Ω |µ(A)− ν(A)|.

6 / 25

Page 21: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Convergence in...?

Usually,

Definition (Total Variation Distance)

For probability distributions µ, ν ∈ R|X | on X , define

‖µ− ν‖TV =1

2

∑x∈X|µ(x)− ν(x)| = 1

2‖µ− ν‖1 .

Equivalent to ‖µ− ν‖TV = maxA⊆Ω |µ(A)− ν(A)|.

6 / 25

Page 22: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Mixing times

For x ∈ X , let µx ∈ R|X | be the point-mass distribution at x.

Define d(t) := maxx∥∥µxP t − π∥∥TV .

For ε > 0, define tmix(ε) = mint ∈ Z≥0 : d(t) < ε.

tmix(ε) is the (ε)-mixing time of the chain.

7 / 25

Page 23: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Mixing times

For x ∈ X , let µx ∈ R|X | be the point-mass distribution at x.

Define d(t) := maxx∥∥µxP t − π∥∥TV .

For ε > 0, define tmix(ε) = mint ∈ Z≥0 : d(t) < ε.

tmix(ε) is the (ε)-mixing time of the chain.

7 / 25

Page 24: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Mixing times

For x ∈ X , let µx ∈ R|X | be the point-mass distribution at x.

Define d(t) := maxx∥∥µxP t − π∥∥TV .

For ε > 0, define tmix(ε) = mint ∈ Z≥0 : d(t) < ε.

tmix(ε) is the (ε)-mixing time of the chain.

7 / 25

Page 25: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Mixing times

For x ∈ X , let µx ∈ R|X | be the point-mass distribution at x.

Define d(t) := maxx∥∥µxP t − π∥∥TV .

For ε > 0, define tmix(ε) = mint ∈ Z≥0 : d(t) < ε.

tmix(ε) is the (ε)-mixing time of the chain.

7 / 25

Page 26: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Reversibility & r.w.’s on graphs

Henceforth, we’ll restrict attention to chains which are randomwalks on edge-weighted graphs

Proceed by P (x, yi) = wi∑wi. What do we get when all edges

have weight 1?

8 / 25

Page 27: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Reversibility & r.w.’s on graphs

Henceforth, we’ll restrict attention to chains which are randomwalks on edge-weighted graphs

Proceed by P (x, yi) = wi∑wi. What do we get when all edges

have weight 1?

8 / 25

Page 28: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Reversibility & r.w.’s on graphs

Henceforth, we’ll restrict attention to chains which are randomwalks on edge-weighted graphs

Proceed by P (x, yi) = wi∑wi.

What do we get when all edgeshave weight 1?

8 / 25

Page 29: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Reversibility & r.w.’s on graphs

Henceforth, we’ll restrict attention to chains which are randomwalks on edge-weighted graphs

Proceed by P (x, yi) = wi∑wi. What do we get when all edges

have weight 1?8 / 25

Page 30: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Reversibility & r.w.’s on graphs

Reversibility is the core property relating general chains tor.w.’s on graphs.

Definition (Reversibility)

A Markov chain is reversible with respect to stationarydistribution π if ∀x, y ∈ X ,

π(x)P (x, y) = π(y)P (y, x).

reversible chains P ⇐⇒ weighted graphs

P 7→ GP where V (GP ) = X ,edge weights π(x)P (x, y) = π(y)P (y, x)

9 / 25

Page 31: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Reversibility & r.w.’s on graphs

Reversibility is the core property relating general chains tor.w.’s on graphs.

Definition (Reversibility)

A Markov chain is reversible with respect to stationarydistribution π if ∀x, y ∈ X ,

π(x)P (x, y) = π(y)P (y, x).

reversible chains P ⇐⇒ weighted graphs

P 7→ GP where V (GP ) = X ,edge weights π(x)P (x, y) = π(y)P (y, x)

9 / 25

Page 32: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Reversibility & r.w.’s on graphs

Reversibility is the core property relating general chains tor.w.’s on graphs.

Definition (Reversibility)

A Markov chain is reversible with respect to stationarydistribution π if ∀x, y ∈ X ,

π(x)P (x, y) = π(y)P (y, x).

reversible chains P ⇐⇒ weighted graphs

P 7→ GP where V (GP ) = X ,edge weights π(x)P (x, y) = π(y)P (y, x)

9 / 25

Page 33: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Reversibility aka “Detailed Balance”

10 / 25

Page 34: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The π-inner product

If P is irreducible and reversible w.r.t. π, then〈·, ·〉π : R|X | → R by

〈f, g〉π =∑x∈X

f(x)g(x)π(x)

is an inner product on R|X |, which is a Hilbert space withrespect to 〈·, ·〉π.

So...

11 / 25

Page 35: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The π-inner product

If P is irreducible and reversible w.r.t. π, then〈·, ·〉π : R|X | → R by

〈f, g〉π =∑x∈X

f(x)g(x)π(x)

is an inner product on R|X |, which is a Hilbert space withrespect to 〈·, ·〉π.

So...

11 / 25

Page 36: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The spectral representation of the chain

Lemma

Let P be aperiodic, irreducible, and reversible with respect toπ. Then:

1 P is a self-adjoint operator on (R|X |, 〈·, ·〉π).

2 1 has multiplicity 1 as an eigenvalue of P , and thecorresponding (right) eigenspace is spanned by the all 1’svector 1.

3 −1 is not an eigenvalue of P.

Let λ∗ = max|λ| : λ ∈ spec(P ), λ 6= 1. By the above andfact σ(P ) = 1, have 0 ≤ λ∗ < 1.

12 / 25

Page 37: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The spectral representation of the chain

Lemma

Let P be aperiodic, irreducible, and reversible with respect toπ. Then:

1 P is a self-adjoint operator on (R|X |, 〈·, ·〉π).

2 1 has multiplicity 1 as an eigenvalue of P , and thecorresponding (right) eigenspace is spanned by the all 1’svector 1.

3 −1 is not an eigenvalue of P.

Let λ∗ = max|λ| : λ ∈ spec(P ), λ 6= 1.

By the above andfact σ(P ) = 1, have 0 ≤ λ∗ < 1.

12 / 25

Page 38: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The spectral representation of the chain

Lemma

Let P be aperiodic, irreducible, and reversible with respect toπ. Then:

1 P is a self-adjoint operator on (R|X |, 〈·, ·〉π).

2 1 has multiplicity 1 as an eigenvalue of P , and thecorresponding (right) eigenspace is spanned by the all 1’svector 1.

3 −1 is not an eigenvalue of P.

Let λ∗ = max|λ| : λ ∈ spec(P ), λ 6= 1. By the above andfact σ(P ) = 1, have 0 ≤ λ∗ < 1.

12 / 25

Page 39: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time with λ∗

Recalld(t) = max

x∈X

∥∥µxP t − π∥∥TV .

λ∗ controls the asymptotic (in t) rate of convergence of d(t) to0, i.e., for some c and C which depend on P we have

cλt∗ ≤ d(t) ≤ Cλt∗.

13 / 25

Page 40: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time with λ∗

Recalld(t) = max

x∈X

∥∥µxP t − π∥∥TV .λ∗ controls the asymptotic (in t) rate of convergence of d(t) to0, i.e., for some c and C which depend on P we have

cλt∗ ≤ d(t) ≤ Cλt∗.

13 / 25

Page 41: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

A statistical perspective: think of vector f ∈ R|X | as a function(“statistic”) on X .

Distinguish the distributions µxPt and π using the statistic f .

14 / 25

Page 42: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

A statistical perspective: think of vector f ∈ R|X | as a function(“statistic”) on X .

Distinguish the distributions µxPt and π using the statistic f .

14 / 25

Page 43: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

A statistical perspective: think of vector f ∈ R|X | as a function(“statistic”) on X .

Distinguish the distributions µxPt and π using the statistic f .

14 / 25

Page 44: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Theorem (Spectral Lower Bound)

For P as before, ε > 0:

tmix(ε) ≥(

1

1− λ∗− 1

)log

1

2ε.

Proof.

For any f ∈ R|X | and x ∈ X ,

|EµxP t(f)− Eπ(f)| =

∣∣∣∣∣∣∑y∈X

(µxPt(y)− π(y))f(y)

∣∣∣∣∣∣≤ ‖f‖∞ 2d(t)

where Eν(·) is expected value taken against distribution ν.

15 / 25

Page 45: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Theorem (Spectral Lower Bound)

For P as before, ε > 0:

tmix(ε) ≥(

1

1− λ∗− 1

)log

1

2ε.

Proof.

For any f ∈ R|X | and x ∈ X ,

|EµxP t(f)− Eπ(f)| =

∣∣∣∣∣∣∑y∈X

(µxPt(y)− π(y))f(y)

∣∣∣∣∣∣≤ ‖f‖∞ 2d(t)

where Eν(·) is expected value taken against distribution ν.

15 / 25

Page 46: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Theorem (Spectral Lower Bound)

For P as before, ε > 0:

tmix(ε) ≥(

1

1− λ∗− 1

)log

1

2ε.

Proof.

For any f ∈ R|X | and x ∈ X ,

|EµxP t(f)− Eπ(f)| =

∣∣∣∣∣∣∑y∈X

(µxPt(y)− π(y))f(y)

∣∣∣∣∣∣≤ ‖f‖∞ 2d(t)

where Eν(·) is expected value taken against distribution ν.

15 / 25

Page 47: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Theorem (Spectral Lower Bound)

For P as before, ε > 0:

tmix(ε) ≥(

1

1− λ∗− 1

)log

1

2ε.

Proof.

For any f ∈ R|X | and x ∈ X ,

|EµxP t(f)− Eπ(f)| =

∣∣∣∣∣∣∑y∈X

(µxPt(y)− π(y))f(y)

∣∣∣∣∣∣

≤ ‖f‖∞ 2d(t)

where Eν(·) is expected value taken against distribution ν.

15 / 25

Page 48: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Theorem (Spectral Lower Bound)

For P as before, ε > 0:

tmix(ε) ≥(

1

1− λ∗− 1

)log

1

2ε.

Proof.

For any f ∈ R|X | and x ∈ X ,

|EµxP t(f)− Eπ(f)| =

∣∣∣∣∣∣∑y∈X

(µxPt(y)− π(y))f(y)

∣∣∣∣∣∣≤ ‖f‖∞ 2d(t)

where Eν(·) is expected value taken against distribution ν.15 / 25

Page 49: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 50: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 51: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) =

πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 52: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf =

πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 53: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf =

λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 54: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf =

λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 55: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f)

=⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 56: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 57: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) =

µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 58: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf =

µxλtf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 59: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf =

λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 60: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 61: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

We have |EµxP t(f)− Eπ(f)| ≤ ‖f‖∞ 2d(t). So any lowerbound on the LHS gives a lower bound on d(t).

If f is an eigenvector of P with eigenvalue λ 6= 1, we know twothings:

1 Eπ(f) = πf = πPf = λπf = λEπ(f) =⇒ Eπ(f) = 0.

2 EµxP t(f) = µxPtf = µxλ

tf = λtf(x).

So |λtf(x)| ≤ ‖f‖∞ 2d(t) for any x and eigenvalue λ 6= 1.

16 / 25

Page 62: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

Optimizing over x and λ gives λt∗2 ≤ d(t).

Setting the LHS to be at least ε and solving for t yields

tmix(ε) ≥(

1

1− λ∗− 1

)log

1

2ε.

This can be understood as a “first moment” bound, i.e.,relying only on expectations. If variances are computable,better bounds sometimes exist.

17 / 25

Page 63: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

Optimizing over x and λ gives λt∗2 ≤ d(t).

Setting the LHS to be at least ε and solving for t yields

tmix(ε) ≥(

1

1− λ∗− 1

)log

1

2ε.

This can be understood as a “first moment” bound, i.e.,relying only on expectations. If variances are computable,better bounds sometimes exist.

17 / 25

Page 64: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time below with λ∗

Proof.

Optimizing over x and λ gives λt∗2 ≤ d(t).

Setting the LHS to be at least ε and solving for t yields

tmix(ε) ≥(

1

1− λ∗− 1

)log

1

2ε.

This can be understood as a “first moment” bound, i.e.,relying only on expectations. If variances are computable,better bounds sometimes exist.

17 / 25

Page 65: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time above with λ∗

Theorem (Spectral Upper Bound)

P as before, ε > 0:

tmix(ε) ≤1

1− λ∗log

1

επmin

Proof.

A bit more technical, uses the diagonalization of P .

18 / 25

Page 66: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Bounding mixing time above with λ∗

Theorem (Spectral Upper Bound)

P as before, ε > 0:

tmix(ε) ≤1

1− λ∗log

1

επmin

Proof.

A bit more technical, uses the diagonalization of P .

18 / 25

Page 67: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Remarks

(1

1− λ∗− 1

)log

1

2ε≤ tmix(ε) ≤

1

1− λ∗log

1

επmin

For a fixed chain, these bounds are quite tight...

But common to have |X | = n and n→∞. Here, you paya price for the log 1

πmin.

In many chains like this, a cutoff phenomenon is observed:as n→∞, d(t) approaches a step function which jumpsfrom 1 (completely unmixed) to 0 (completely mixed) at acritical threshold t∗ = t∗(n).

19 / 25

Page 68: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Remarks

(1

1− λ∗− 1

)log

1

2ε≤ tmix(ε) ≤

1

1− λ∗log

1

επmin

For a fixed chain, these bounds are quite tight...

But common to have |X | = n and n→∞. Here, you paya price for the log 1

πmin.

In many chains like this, a cutoff phenomenon is observed:as n→∞, d(t) approaches a step function which jumpsfrom 1 (completely unmixed) to 0 (completely mixed) at acritical threshold t∗ = t∗(n).

19 / 25

Page 69: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Remarks

(1

1− λ∗− 1

)log

1

2ε≤ tmix(ε) ≤

1

1− λ∗log

1

επmin

For a fixed chain, these bounds are quite tight...

But common to have |X | = n and n→∞.

Here, you paya price for the log 1

πmin.

In many chains like this, a cutoff phenomenon is observed:as n→∞, d(t) approaches a step function which jumpsfrom 1 (completely unmixed) to 0 (completely mixed) at acritical threshold t∗ = t∗(n).

19 / 25

Page 70: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Remarks

(1

1− λ∗− 1

)log

1

2ε≤ tmix(ε) ≤

1

1− λ∗log

1

επmin

For a fixed chain, these bounds are quite tight...

But common to have |X | = n and n→∞. Here, you paya price for the log 1

πmin.

In many chains like this, a cutoff phenomenon is observed:as n→∞, d(t) approaches a step function which jumpsfrom 1 (completely unmixed) to 0 (completely mixed) at acritical threshold t∗ = t∗(n).

19 / 25

Page 71: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Remarks

(1

1− λ∗− 1

)log

1

2ε≤ tmix(ε) ≤

1

1− λ∗log

1

επmin

For a fixed chain, these bounds are quite tight...

But common to have |X | = n and n→∞. Here, you paya price for the log 1

πmin.

In many chains like this, a cutoff phenomenon is observed:as n→∞, d(t) approaches a step function which jumpsfrom 1 (completely unmixed) to 0 (completely mixed) at acritical threshold t∗ = t∗(n).

19 / 25

Page 72: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The Cycle

Random walk on the (odd) n-cycle has eigenvalues

cos 2πjn

n−12

j=0 .

So λ∗ = cos 2πn = 1− 2π2

n2 +O(n−4).

Since stationary is uniform, our bounds give

π2n2

2log

1

2ε. tmix(ε) .

π2n2

2log

n

ε

20 / 25

Page 73: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The Cycle

Random walk on the (odd) n-cycle has eigenvalues

cos 2πjn

n−12

j=0 . So λ∗ = cos 2πn = 1− 2π2

n2 +O(n−4).

Since stationary is uniform, our bounds give

π2n2

2log

1

2ε. tmix(ε) .

π2n2

2log

n

ε

20 / 25

Page 74: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The Cycle

Random walk on the (odd) n-cycle has eigenvalues

cos 2πjn

n−12

j=0 . So λ∗ = cos 2πn = 1− 2π2

n2 +O(n−4).

Since stationary is uniform, our bounds give

π2n2

2log

1

2ε. tmix(ε) .

π2n2

2log

n

ε20 / 25

Page 75: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Card Shuffling

Think of Sn as set of orderings of an n-card deck, laidside-by-side on a table.

Consider the Markov chain on Sn obtained by iterating thefollowing rule: pick a random pair of cards and transpose them.

Theorem (Diaconis & Shashashani ’81)

For this chain, for any ε > 0

tmix(ε) ∼1

2n log n

(independent of ε).

21 / 25

Page 76: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Card Shuffling

Think of Sn as set of orderings of an n-card deck, laidside-by-side on a table.

Consider the Markov chain on Sn obtained by iterating thefollowing rule: pick a random pair of cards and transpose them.

Theorem (Diaconis & Shashashani ’81)

For this chain, for any ε > 0

tmix(ε) ∼1

2n log n

(independent of ε).

21 / 25

Page 77: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

Card Shuffling

Think of Sn as set of orderings of an n-card deck, laidside-by-side on a table.

Consider the Markov chain on Sn obtained by iterating thefollowing rule: pick a random pair of cards and transpose them.

Theorem (Diaconis & Shashashani ’81)

For this chain, for any ε > 0

tmix(ε) ∼1

2n log n

(independent of ε).

21 / 25

Page 78: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

A combinatorial optimization problem

Let P be reversible with respect to π.

Challenge: pick a function f ∈ ±1|X | minimizing E(f)

‖f‖22, where

E(f) :=1

2

∑x,y∈X

[f(x)− f(y)]2π(x)P (x, y) = 〈(I − P )f, f〉π

s.t. f · 1 = 0.

If we identify P with it’s edge weighted graph GP , this isequivalent to finding a balanced labeling of the vertices of GPwith ±1 minimizing the above.

22 / 25

Page 79: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

A combinatorial optimization problem

Let P be reversible with respect to π.

Challenge: pick a function f ∈ ±1|X | minimizing E(f)

‖f‖22, where

E(f) :=1

2

∑x,y∈X

[f(x)− f(y)]2π(x)P (x, y) = 〈(I − P )f, f〉π

s.t. f · 1 = 0.

If we identify P with it’s edge weighted graph GP , this isequivalent to finding a balanced labeling of the vertices of GPwith ±1 minimizing the above.

22 / 25

Page 80: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

A combinatorial optimization problem

Let P be reversible with respect to π.

Challenge: pick a function f ∈ ±1|X | minimizing E(f)

‖f‖22, where

E(f) :=1

2

∑x,y∈X

[f(x)− f(y)]2π(x)P (x, y)

= 〈(I − P )f, f〉π

s.t. f · 1 = 0.

If we identify P with it’s edge weighted graph GP , this isequivalent to finding a balanced labeling of the vertices of GPwith ±1 minimizing the above.

22 / 25

Page 81: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

A combinatorial optimization problem

Let P be reversible with respect to π.

Challenge: pick a function f ∈ ±1|X | minimizing E(f)

‖f‖22, where

E(f) :=1

2

∑x,y∈X

[f(x)− f(y)]2π(x)P (x, y) = 〈(I − P )f, f〉π

s.t. f · 1 = 0.

If we identify P with it’s edge weighted graph GP , this isequivalent to finding a balanced labeling of the vertices of GPwith ±1 minimizing the above.

22 / 25

Page 82: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

A combinatorial optimization problem

Let P be reversible with respect to π.

Challenge: pick a function f ∈ ±1|X | minimizing E(f)

‖f‖22, where

E(f) :=1

2

∑x,y∈X

[f(x)− f(y)]2π(x)P (x, y) = 〈(I − P )f, f〉π

s.t. f · 1 = 0.

If we identify P with it’s edge weighted graph GP , this isequivalent to finding a balanced labeling of the vertices of GPwith ±1 minimizing the above.

22 / 25

Page 83: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

A combinatorial optimization problem

Let P be reversible with respect to π.

Challenge: pick a function f ∈ ±1|X | minimizing E(f)

‖f‖22, where

E(f) :=1

2

∑x,y∈X

[f(x)− f(y)]2π(x)P (x, y) = 〈(I − P )f, f〉π

s.t. f · 1 = 0.

If we identify P with it’s edge weighted graph GP , this isequivalent to finding a balanced labeling of the vertices of GPwith ±1 minimizing the above.

22 / 25

Page 84: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The Dirichlet Energy

Minimizing E(f) is analogous to the problem of minimizing

∫Ω‖∇u‖2 dx

over u : Ω ⊆ Rn → R s.t. some boundary conditions.

To solve the continuous version, one solves Laplace’s Equation∆u = 0.

23 / 25

Page 85: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The Dirichlet Energy

Minimizing E(f) is analogous to the problem of minimizing∫Ω‖∇u‖2 dx

over u : Ω ⊆ Rn → R s.t. some boundary conditions.

To solve the continuous version, one solves Laplace’s Equation∆u = 0.

23 / 25

Page 86: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The Dirichlet Energy

Minimizing E(f) is analogous to the problem of minimizing∫Ω‖∇u‖2 dx

over u : Ω ⊆ Rn → R s.t. some boundary conditions.

To solve the continuous version, one solves Laplace’s Equation∆u = 0.

23 / 25

Page 87: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The (Discrete) Dirichlet Energy

We can relax our combinatorial problem to minimizing E(f)

‖f‖22over any f ∈ R|X | s.t. 〈f,1〉π = 0 (and f 6= 0).

Theorem

Let P have eigenvalues λ1 ≥ λ2 ≥ · · · ≥ λ|X | with eigenvectorsf1, f2, . . . , f|X |. The above optimization problem is solved bytaking f = 1− f2, and thus has minimum value γ = 1− λ2.

24 / 25

Page 88: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

The (Discrete) Dirichlet Energy

We can relax our combinatorial problem to minimizing E(f)

‖f‖22over any f ∈ R|X | s.t. 〈f,1〉π = 0 (and f 6= 0).

Theorem

Let P have eigenvalues λ1 ≥ λ2 ≥ · · · ≥ λ|X | with eigenvectorsf1, f2, . . . , f|X |. The above optimization problem is solved bytaking f = 1− f2, and thus has minimum value γ = 1− λ2.

24 / 25

Page 89: Spectral Theory of Finite Markov Chainsaeide2/talks/SpectralMarkovTalk.pdf · Finite Markov Chains Austin Eide Preliminaries Spectral Rep. Chains Examples Intuition: the Dirichlet

SpectralTheory of

Finite MarkovChains

Austin Eide

Preliminaries

Spectral Rep.Chains

Examples

Intuition: theDirichletEnergy

References

References

1 Markov Chains and Mixing Times. Levin, D., and Peres,Y.

25 / 25


Recommended