+ All Categories
Home > Documents > ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations...

ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations...

Date post: 19-Oct-2020
Category:
Upload: others
View: 4 times
Download: 0 times
Share this document with a friend
89
ON SOME FACTORIZATIONS OF RANDOM WORDS PHILIPPE CHASSAING INSTITUT ELIE CARTAN & ELAHE ZOHOORIAN-AZAD DAMGHAN UNIVERSITY Maresias, AofA’08
Transcript
Page 1: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

ON SOME FACTORIZATIONS OF

RANDOM WORDS

PHILIPPE CHASSAING INSTITUT ELIE CARTAN

&

ELAHE ZOHOORIAN-AZADDAMGHAN UNIVERSITY

Maresias, AofA’08

Page 2: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GLOSSARY

Alphabet

n-letters long words

Language

U is a factor of w

U is a Prefix of w

U is a Suffix of wRotation

Necklace, circular word

Primitive word

Page 3: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GLOSSARY

Alphabet

n-letters long words

Language

U is a factor of w

U is a Prefix of w

U is a Suffix of wRotation

Necklace, circular word

Primitive word

Page 4: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GLOSSARY

Alphabet

n-letters long words

Language

U is a factor of w

U is a Prefix of w

U is a Suffix of wRotation

Necklace, circular word

Primitive word

Page 5: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GLOSSARY

Alphabet

n-letters long words

Language

U is a factor of w

U is a Prefix of w

U is a Suffix of wRotation

Necklace, circular word

Primitive word

Page 6: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GLOSSARY

Alphabet

n-letters long words

Language

U is a factor of w

U is a Prefix of w

U is a Suffix of wRotation

Necklace, circular word

Primitive word

Page 7: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

LYNDON WORDS

Lexicographic Order

Page 8: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

LYNDON WORDS

Lexicographic Order

Page 9: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

LYNDON WORDS

Lexicographic Order

w is a Lyndon word if w is primitive, and is the

smallest word in its necklace

Page 10: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

LYNDON WORDS

Lexicographic Order

w is a Lyndon word if w is primitive, and is the

smallest word in its necklace

cbaa, baac, aacb, acba: ! aacb is a Lyndon word,

Page 11: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

LYNDON WORDS

Lexicographic Order

w is a Lyndon word if w is primitive, and is the

smallest word in its necklace

cbaa, baac, aacb, acba: ! aacb is a Lyndon word,

aabaab, baac ! are not

Page 12: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

Page 13: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

Page 14: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

Page 15: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Page 16: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Theorem (Lyndon, 1954) Any word w may be written uniquely as a non-increasing product of Lyndon words (by iteration of the standard factorization).

Page 17: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Theorem (Lyndon, 1954) Any word w may be written uniquely as a non-increasing product of Lyndon words (by iteration of the standard factorization).

Page 18: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Theorem (Lyndon, 1954) Any word w may be written uniquely as a non-increasing product of Lyndon words (by iteration of the standard factorization).

Page 19: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Theorem (Lyndon, 1954) Any word w may be written uniquely as a non-increasing product of Lyndon words (by iteration of the standard factorization).

Page 20: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Theorem (Lyndon, 1954) Any word w may be written uniquely as a non-increasing product of Lyndon words (by iteration of the standard factorization).

Page 21: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Theorem (Lyndon, 1954) Any word w may be written uniquely as a non-increasing product of Lyndon words (by iteration of the standard factorization).

Page 22: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Theorem (Lyndon, 1954) Any word w may be written uniquely as a non-increasing product of Lyndon words (by iteration of the standard factorization).

Page 23: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

FACTORIZATIONS

The standard right factor v of a word w is its smallest proper suffix.

The related factorization w=uv is often called the standard factorization of w.

w=abaabbabaabb! u=abaabbab! v=aabb

w=abaabbabaabb! u’=ab! v’=aabbabaabb!! v<v’

Theorem (Lyndon, 1954) Any word w may be written uniquely as a non-increasing product of Lyndon words (by iteration of the standard factorization).

The standard factorization of a Lyndon word is the first step in the construction of some basis of the free Lie algebra over A

Page 24: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROBABILISTIC MODEL

Page 25: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROBABILISTIC MODEL

Page 26: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROBABILISTIC MODEL

Page 27: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROBABILISTIC MODEL

Page 28: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROBABILISTIC MODEL

WLOG, {i | pi>0} has no gaps and contains 1.

Page 29: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROFILE OF THE DECOMPOSITION

Page 30: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROFILE OF THE DECOMPOSITION

For a word , setN(w)=(Nk(w))k≥1,

in which Nk(w) is the number of k-letters long factors in the Lyndon decomposition of w.

Page 31: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROFILE OF THE DECOMPOSITION

For a word , setN(w)=(Nk(w))k≥1,

in which Nk(w) is the number of k-letters long factors in the Lyndon decomposition of w.

Page 32: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROFILE OF THE DECOMPOSITION

For a word , setN(w)=(Nk(w))k≥1,

in which Nk(w) is the number of k-letters long factors in the Lyndon decomposition of w.

N=(2,0,0,2,0,0,1,0,0, ... ).

Page 33: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

UNIFORM CASE

In the uniform case (pi=1/q, 1≤i≤q), Diaconis, McGrath and Pitman (Riffle shuffles, cycles, and descents, 1995) give the exact distribution of the profile

N(w)=(Nk(w))k≥1.

Page 34: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

UNIFORM CASE

In the uniform case (pi=1/q, 1≤i≤q), Diaconis, McGrath and Pitman (Riffle shuffles, cycles, and descents, 1995) give the exact distribution of the profile

N(w)=(Nk(w))k≥1.

Page 35: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

UNIFORM CASE

In the uniform case (pi=1/q, 1≤i≤q), Diaconis, McGrath and Pitman (Riffle shuffles, cycles, and descents, 1995) give the exact distribution of the profile

N(w)=(Nk(w))k≥1.

Page 36: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

UNIFORM CASE

In the uniform case (pi=1/q, 1≤i≤q), Diaconis, McGrath and Pitman (Riffle shuffles, cycles, and descents, 1995) give the exact distribution of the profile

N(w)=(Nk(w))k≥1.

in which µ is the Moebius function.

Page 37: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

ASYMPTOTICS

Page 38: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

ASYMPTOTICS

pq,n(ξ) converges, as q grows, to

Page 39: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

ASYMPTOTICS

pq,n(ξ) converges, as q grows, to

Page 40: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

ASYMPTOTICS

pq,n(ξ) converges, as q grows, to

in which Ck(w) is the number of k-cycles in the cycle-decomposition of the n-permutation w, and C(w)=(Ck(w))k≥1.

Page 41: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

ASYMPTOTICS

pq,n(ξ) converges, as q grows, to

in which Ck(w) is the number of k-cycles in the cycle-decomposition of the n-permutation w, and C(w)=(Ck(w))k≥1.

As n grows, pn(.) converges to the law of a sequence of independent Poisson random variables (with respective parameters 1/k for Ck).

Page 42: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE

Page 43: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE

Page 44: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE

Page 45: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

Page 46: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

RSa* RSb= RSab

Page 47: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

RSa* RSb= RSabDoing a b-riffle-shuffle, followed by an independent a-riffle-shuffle, results in an ab-riffle-shuffle (not so obvious ...).

Page 48: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

RSa* RSb= RSabDoing a b-riffle-shuffle, followed by an independent a-riffle-shuffle, results in an ab-riffle-shuffle (not so obvious ...).

Proof:

Page 49: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

RSa* RSb= RSabDoing a b-riffle-shuffle, followed by an independent a-riffle-shuffle, results in an ab-riffle-shuffle (not so obvious ...).

Proof:

Let {x} be the fractional part of the real number x.

Page 50: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

RSa* RSb= RSabDoing a b-riffle-shuffle, followed by an independent a-riffle-shuffle, results in an ab-riffle-shuffle (not so obvious ...).

Proof:

Let {x} be the fractional part of the real number x.

Let U=(Uk)1≤k≤n be n random numbers, uniform on [0,1].

Page 51: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

RSa* RSb= RSabDoing a b-riffle-shuffle, followed by an independent a-riffle-shuffle, results in an ab-riffle-shuffle (not so obvious ...).

Proof:

Let {x} be the fractional part of the real number x.

Let U=(Uk)1≤k≤n be n random numbers, uniform on [0,1].

Map the rank of {aUi} in {aU} to the rank of Ui in U: this is a realisation of an a-riffle-shuffle.

Page 52: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

RSa* RSb= RSabDoing a b-riffle-shuffle, followed by an independent a-riffle-shuffle, results in an ab-riffle-shuffle (not so obvious ...).

Proof:

Let {x} be the fractional part of the real number x.

Let U=(Uk)1≤k≤n be n random numbers, uniform on [0,1].

Map the rank of {aUi} in {aU} to the rank of Ui in U: this is a realisation of an a-riffle-shuffle.

{a{bx}}={abx}.

Page 53: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE #2

RSa* RSb= RSabDoing a b-riffle-shuffle, followed by an independent a-riffle-shuffle, results in an ab-riffle-shuffle (not so obvious ...).

Proof:

Let {x} be the fractional part of the real number x.

Let U=(Uk)1≤k≤n be n random numbers, uniform on [0,1].

Map the rank of {aUi} in {aU} to the rank of Ui in U: this is a realisation of an a-riffle-shuffle.

{a{bx}}={abx}.

{aUi} is random uniform on [0,1] and independent of [aUi].

Page 54: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE: ASYMPTOTICS

Page 55: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE: ASYMPTOTICS

Bonus:

! RSq ----> uniform permutation,

leading to the convergence of M=(Mk)k≥1 to a Cauchy distribution, for

! (q,n) ----> + ∞,

in which Mk(w) is the number of cycles with length k in the permutation w.

Page 56: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE: ASYMPTOTICS

Bonus:

! RSq ----> uniform permutation,

leading to the convergence of M=(Mk)k≥1 to a Cauchy distribution, for

! (q,n) ----> + ∞,

in which Mk(w) is the number of cycles with length k in the permutation w.

Birthday paradox: ! DV(RSq,uniform) =O(n2/2q).

Page 57: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RIFFLE SHUFFLE: ASYMPTOTICS

Bonus:

! RSq ----> uniform permutation,

leading to the convergence of M=(Mk)k≥1 to a Cauchy distribution, for

! (q,n) ----> + ∞,

in which Mk(w) is the number of cycles with length k in the permutation w.

Birthday paradox: ! DV(RSq,uniform) =O(n2/2q).

Bayer & Diaconis (1992):! DV(RSq,uniform) = O(n3/2/q).

Page 58: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GESSEL’S BIJECTION

Page 59: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GESSEL’S BIJECTION

Correspondance

! {random uniform words from a q-letters alphabet}

! <---->

! {RSq-distributed permutations}

Page 60: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GESSEL’S BIJECTION

Correspondance

! {random uniform words from a q-letters alphabet}

! <---->

! {RSq-distributed permutations}

In which cycles are sent on Lyndon factors with the same length,

Page 61: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GESSEL’S BIJECTION

Correspondance

! {random uniform words from a q-letters alphabet}

! <---->

! {RSq-distributed permutations}

In which cycles are sent on Lyndon factors with the same length,

And the profile of the permutation is sent on N.

Page 62: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

NEXT ...

Page 63: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

NEXT ...

Diaconis et al. gives the asymptotic distribution of the lengths of the shortest factors, while the position of these factors is lost.

Page 64: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

NEXT ...

Diaconis et al. gives the asymptotic distribution of the lengths of the shortest factors, while the position of these factors is lost.

What about the lengths of the longest factors ? the lengths of the last factors ?

Page 65: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

NEXT ...

Diaconis et al. gives the asymptotic distribution of the lengths of the shortest factors, while the position of these factors is lost.

What about the lengths of the longest factors ? the lengths of the last factors ?

More general distribution p=(pi)i≥1 on letters ?

Page 66: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

MAIN RESULT

X(1)X(2)X(3)X(4)X(5)

Page 67: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

MAIN RESULT

X(1)X(2)X(3)X(4)X(5)

X20= (1,1,4,9,5,0,0,...)/20

Xn(k) is the renormalised size of the kth Lyndon factor, starting from the end of the word.

For a general alphabet A={ai}, and a general distribution p=(pi), Xn

converges to a p1-sticky GEM(1).

Page 68: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

U12U (1-U )1.....

Page 69: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

U12U (1-U )1..... U12U (1-U )1.....

Page 70: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U1

Page 71: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U12U (1-U )1..... U1

Page 72: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U12U (1-U )1..... U1..... U12U (1-U )1

Page 73: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U12U (1-U )1..... U1..... U12U (1-U )1 U12U (1-U )1.....

Page 74: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

Terminology: Griffiths-Engen-McClosey r.v. with parameter 1, size-biased reordering of Poisson-Dirichlet(0,1) (population genetics, etc ...), stickbreaking scheme ...

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U12U (1-U )1..... U1..... U12U (1-U )1 U12U (1-U )1.....

Page 75: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

Terminology: Griffiths-Engen-McClosey r.v. with parameter 1, size-biased reordering of Poisson-Dirichlet(0,1) (population genetics, etc ...), stickbreaking scheme ...

The sequence of residual sizes after the kth break, Wk, satisfies ! Wk/Wk-1 are independant and uniform on [0,1].

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U12U (1-U )1..... U1..... U12U (1-U )1 U12U (1-U )1.....

Page 76: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

Terminology: Griffiths-Engen-McClosey r.v. with parameter 1, size-biased reordering of Poisson-Dirichlet(0,1) (population genetics, etc ...), stickbreaking scheme ...

The sequence of residual sizes after the kth break, Wk, satisfies ! Wk/Wk-1 are independant and uniform on [0,1].

W0=1

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U12U (1-U )1..... U1..... U12U (1-U )1 U12U (1-U )1.....

Page 77: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

Terminology: Griffiths-Engen-McClosey r.v. with parameter 1, size-biased reordering of Poisson-Dirichlet(0,1) (population genetics, etc ...), stickbreaking scheme ...

The sequence of residual sizes after the kth break, Wk, satisfies ! Wk/Wk-1 are independant and uniform on [0,1].

W0=1

The size Xk of the kth piece of the stick is given byXk = Wk-Wk-1= U1 U2 ... Uk-1(1-Uk).

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U12U (1-U )1..... U1..... U12U (1-U )1 U12U (1-U )1.....

Page 78: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

GEM(1)

Terminology: Griffiths-Engen-McClosey r.v. with parameter 1, size-biased reordering of Poisson-Dirichlet(0,1) (population genetics, etc ...), stickbreaking scheme ...

The sequence of residual sizes after the kth break, Wk, satisfies ! Wk/Wk-1 are independant and uniform on [0,1].

W0=1

The size Xk of the kth piece of the stick is given byXk = Wk-Wk-1= U1 U2 ... Uk-1(1-Uk).

W=(Wk )k≥0 is a Markov chain with transition kernel! p(x,dy)=1[0,x](y)dy/x.

U12U (1-U )1..... U12U (1-U )1..... 2U (1-U )1..... U12U (1-U )1..... U1..... U12U (1-U )1 U12U (1-U )1.....

Page 79: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

STICKY GEM(1)

The a-sticky GEM(1): the residual size Wk is a Markov chain starting from 1, with transition kernel

U12U (1-U )1.....

Page 80: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

STICKY GEM(1)

The a-sticky GEM(1): the residual size Wk is a Markov chain starting from 1, with transition kernel

! p(x,dy)=1[0,x](y)dy/x,! x≠1,

U12U (1-U )1.....

Page 81: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

STICKY GEM(1)

The a-sticky GEM(1): the residual size Wk is a Markov chain starting from 1, with transition kernel

! p(x,dy)=1[0,x](y)dy/x,! x≠1,

! p(1,dy)=aδ1 +(1-a)1[0,1](y)dy.

U12U (1-U )1.....

Page 82: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

STICKY GEM(1)

The a-sticky GEM(1): the residual size Wk is a Markov chain starting from 1, with transition kernel

! p(x,dy)=1[0,x](y)dy/x,! x≠1,

! p(1,dy)=aδ1 +(1-a)1[0,1](y)dy.

W starts with a sequence of S 1’s, P(S=k)=ak-1(1-a), k≥1, rather than with only W0=1.

U12U (1-U )1.....

Page 83: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

STICKY GEM(1)

The a-sticky GEM(1): the residual size Wk is a Markov chain starting from 1, with transition kernel

! p(x,dy)=1[0,x](y)dy/x,! x≠1,

! p(1,dy)=aδ1 +(1-a)1[0,1](y)dy.

W starts with a sequence of S 1’s, P(S=k)=ak-1(1-a), k≥1, rather than with only W0=1.

X starts with a sequence of T 0’s, P(T=k)=ak(1-a), k≥0, rather than with X0>0.

U12U (1-U )1.....

Page 84: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

STICKBREAKING OCCURENCES

! Xk = U1 U2 ... Uk-1(1-Uk).

Rearranging X=(Xk)k≥0 in decreasing order gives the asymptotic

distributions of the normalised sizes of cycles, or of logarithms of

prime factors of integers, or of degrees of prime factors of

polynomials on finite fields.

U12U (1-U )1.....

Page 85: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

STICKBREAKING OCCURENCES

! Xk = U1 U2 ... Uk-1(1-Uk).

Rearranging X=(Xk)k≥0 in decreasing order gives the asymptotic

distributions of the normalised sizes of cycles, or of logarithms of

prime factors of integers, or of degrees of prime factors of

polynomials on finite fields.

The distribution of max Xk is related to the Dickman function:K. Dickman, On the frequency of numbers containing prime factors of a certain relative magnitude.

Ark. Mat. Astronomi och Fysik 22, 1930, 1-14.

U12U (1-U )1.....

Page 86: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

STICKBREAKING OCCURENCES

! Xk = U1 U2 ... Uk-1(1-Uk).

Rearranging X=(Xk)k≥0 in decreasing order gives the asymptotic

distributions of the normalised sizes of cycles, or of logarithms of

prime factors of integers, or of degrees of prime factors of

polynomials on finite fields.

The distribution of max Xk is related to the Dickman function:K. Dickman, On the frequency of numbers containing prime factors of a certain relative magnitude.

Ark. Mat. Astronomi och Fysik 22, 1930, 1-14.

The normalised size of the longest factor in the Lyndon

decomposition converges to the Dickman distribution, regardless

of p=(pi).

U12U (1-U )1.....

Page 87: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RELATED RESULTS

X(1)X(2)X(3)X(4)X(5)

Page 88: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

RELATED RESULTS

X(1)X(2)X(3)X(4)X(5)

D. Bayer & P. Diaconis, Trailing the Dovetail Shuffle to Its Lair, Ann. Appl. Probability 2, 294-313, 1992.

P. Diaconis, M.J. McGrath & J. Pitman, Riffle shuffles, cycles, and descents, Combinatorica, 15, no. 1, 11-29, 1995.

F. Bassino, J. Clément & C. Nicaud, The standard factorization of Lyndon words: an average point of view, Discrete Mathematics, 290, 1-25, 2005.

R. Marchand & E. Zohoorian-Azad, Limit law of the length of the standard right factor of a Lyndon word, Combinatorics, Probability and Computing, 16, 417-434, 2007.

Page 89: ON SOME FACTORIZATIONS OF RANDOM WORDScris/AofA2008/slides/chassaing.pdf · on some factorizations of random words philippe chassaing institut elie cartan & elahe zohoorian-azad damghan

PROOF OF THE MAIN RESULT

EXERCISES 1 & 2 ???


Recommended