+ All Categories
Home > Documents > Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps...

Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps...

Date post: 07-Jul-2020
Category:
Upload: others
View: 12 times
Download: 0 times
Share this document with a friend
162
Submodular Functions, Optimization, and Applications to Machine Learning — Spring Quarter, Lecture 8 — http://www.ee.washington.edu/people/faculty/bilmes/classes/ee596b_spring_2016/ Prof. Jeff Bilmes University of Washington, Seattle Department of Electrical Engineering http://melodi.ee.washington.edu/ ~ bilmes Apr 25th, 2016 + f (A)+ f (B) f (A B) =f (Ar ) + f ( C ) + f ( B r ) =f (AB) f (A B) =f (Ar ) +2 f ( C ) + f ( B r ) Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F1/40 (pg.1/162)
Transcript
Page 1: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Submodular Functions, Optimization,and Applications to Machine Learning

— Spring Quarter, Lecture 8 —http://www.ee.washington.edu/people/faculty/bilmes/classes/ee596b_spring_2016/

Prof. Jeff Bilmes

University of Washington, SeattleDepartment of Electrical Engineering

http://melodi.ee.washington.edu/~bilmes

Apr 25th, 2016

+f (A) + f (B) f (A ∪ B)

= f (Ar ) +f (C ) + f (Br )

≥= f (A ∩ B)

f (A ∩ B)

= f (Ar ) + 2f (C ) + f (Br )

Clockwise from top left:vLásló Lovász

Jack EdmondsSatoru Fujishige

George NemhauserLaurence Wolsey

András FrankLloyd ShapleyH. NarayananRobert Bixby

William CunninghamWilliam TutteRichard Rado

Alexander SchrijverGarrett Birkho�Hassler Whitney

Richard Dedekind

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F1/40 (pg.1/162)

Page 2: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

Cumulative Outstanding Reading

Read chapters 2 and 3 from Fujishige’s book.

Read chapter 1 from Fujishige’s book.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F2/40 (pg.2/162)

Page 3: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

Announcements, Assignments, and Reminders

Homework 3, available at our assignment dropbox(https://canvas.uw.edu/courses/1039754/assignments), due(electronically) Monday (5/2) at 11:55pm.

Homework 2, available at our assignment dropbox(https://canvas.uw.edu/courses/1039754/assignments), due(electronically) Monday (4/18) at 11:55pm.

Homework 1, available at our assignment dropbox(https://canvas.uw.edu/courses/1039754/assignments), due(electronically) Friday (4/8) at 11:55pm.

Weekly Office Hours: Mondays, 3:30-4:30, or by skype or googlehangout (set up meeting via our our discussion board (https://canvas.uw.edu/courses/1039754/discussion_topics)).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F3/40 (pg.3/162)

Page 4: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

Class Road Map - IT-I

L1(3/28): Motivation, Applications, &Basic Definitions

L2(3/30): Machine Learning Apps(diversity, complexity, parameter, learningtarget, surrogate).

L3(4/4): Info theory exs, more apps,definitions, graph/combinatorial examples,matrix rank example, visualization

L4(4/6): Graph and CombinatorialExamples, matrix rank, Venn diagrams,examples of proofs of submodularity, someuseful properties

L5(4/11): Examples & Properties, OtherDefs., Independence

L6(4/13): Independence, Matroids,Matroid Examples, matroid rank issubmodular

L7(4/18): Matroid Rank, More onPartition Matroid, System of DistinctReps, Transversals, Transversal Matroid,

L8(4/20): Transversals, Matroid andrepresentation, Dual Matroids, Geometries

L9(4/25):

L10(4/27):

L11(5/2):

L12(5/4):

L13(5/9):

L14(5/11):

L15(5/16):

L16(5/18):

L17(5/23):

L18(5/25):

L19(6/1):

L20(6/6): Final Presentationsmaximization.

Finals Week: June 6th-10th, 2016.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F4/40 (pg.4/162)

Page 5: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

System of Distinct Representatives

Let (V,V) be a set system (i.e., V = (Vk : i ∈ I) where Vi ⊆ V for alli), and I is an index set. Hence, |I| = |V|.A family (vi : i ∈ I) with vi ∈ V is said to be a system of distinctrepresentatives of V if ∃ a bijection π : I ↔ I such that vi ∈ Vπ(i) andvi 6= vj for all i 6= j.

In a system of distinct representatives, there is a requirement for therepresentatives to be distinct. We can re-state (and rename) this as a:

Definition 8.2.2 (transversal)

Given a set system (V,V) and index set I for V as defined above, a setT ⊆ V is a transversal of V if there is a bijection π : T ↔ I such that

x ∈ Vπ(x) for all x ∈ T (8.19)

Note that due to π : T ↔ I being a bijection, all of I and T are“covered” (so this makes things distinct automatically).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F5/40 (pg.5/162)

Page 6: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

When do transversals exist?As we saw, a transversal might not always exist. How to tell?

Given a set system (V,V) with V = (Vi : i ∈ I), and Vi ⊆ V for all i.Then, for any J ⊆ I, let

V (J) = ∪j∈JVj (8.19)

so |V (J)| : 2I → Z+ is the set cover func. (we know is submodular).We have

Theorem 8.2.2 (Hall’s theorem)

Given a set system (V,V), the family of subsets V = (Vi : i ∈ I) has atransversal (vi : i ∈ I) iff for all J ⊆ I

|V (J)| ≥ |J | (8.20)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F6/40 (pg.6/162)

Page 7: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

When do transversals exist?As we saw, a transversal might not always exist. How to tell?

Given a set system (V,V) with V = (Vi : i ∈ I), and Vi ⊆ V for all i.Then, for any J ⊆ I, let

V (J) = ∪j∈JVj (8.19)

so |V (J)| : 2I → Z+ is the set cover func. (we know is submodular).Hall’s theorem (∀J ⊆ I, |V (J)| ≥ |J |) as a bipartite graph.

V I

1

2

3

4

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F6/40 (pg.7/162)

Page 8: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

When do transversals exist?As we saw, a transversal might not always exist. How to tell?

Given a set system (V,V) with V = (Vi : i ∈ I), and Vi ⊆ V for all i.Then, for any J ⊆ I, let

V (J) = ∪j∈JVj (8.19)

so |V (J)| : 2I → Z+ is the set cover func. (we know is submodular).Hall’s theorem (∀J ⊆ I, |V (J)| ≥ |J |) as a bipartite graph.

V I

1

2

3

4

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F6/40 (pg.8/162)

Page 9: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

When do transversals exist?As we saw, a transversal might not always exist. How to tell?

Given a set system (V,V) with V = (Vi : i ∈ I), and Vi ⊆ V for all i.Then, for any J ⊆ I, let

V (J) = ∪j∈JVj (8.19)

so |V (J)| : 2I → Z+ is the set cover func. (we know is submodular).Moreover, we have

Theorem 8.2.3 (Rado’s theorem (1942))

If M = (V, r) is a matroid on V with rank function r, then the family ofsubsets (Vi : i ∈ I) of V has a transversal (vi : i ∈ I) that is independent inM iff for all J ⊆ I

r(V (J)) ≥ |J | (8.21)

Note, a transversal T independent in M means that r(T ) = |T |.Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F6/40 (pg.9/162)

Page 10: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

Application’s of Hall’s theorem

Consider a set of jobs I and a set of applicants V to the jobs. If anapplicant v ∈ V is qualified for job i ∈ I, we add edge (v, i) to thebipartite graph G = (V, I, E).

We wish all jobs to be filled, and hence Hall’s condition(∀J ⊆ I, |V (J)| ≥ |J |) is a necessary and sufficient condition for thisto be possible.

Note if |V | = |I|, then Hall’s theorem is the Marriage Theorem(Frobenious 1917), where an edge (v, i) in the graph indicatecompatibility between two individuals v ∈ V and i ∈ I coming fromtwo separate groups V and I.

If ∀J ⊆ I, |V (J)| ≥ |J |, then all individuals in each group can bematched with a compatible mate.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F7/40 (pg.10/162)

Page 11: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

Application’s of Hall’s theorem

Consider a set of jobs I and a set of applicants V to the jobs. If anapplicant v ∈ V is qualified for job i ∈ I, we add edge (v, i) to thebipartite graph G = (V, I, E).

We wish all jobs to be filled, and hence Hall’s condition(∀J ⊆ I, |V (J)| ≥ |J |) is a necessary and sufficient condition for thisto be possible.

Note if |V | = |I|, then Hall’s theorem is the Marriage Theorem(Frobenious 1917), where an edge (v, i) in the graph indicatecompatibility between two individuals v ∈ V and i ∈ I coming fromtwo separate groups V and I.

If ∀J ⊆ I, |V (J)| ≥ |J |, then all individuals in each group can bematched with a compatible mate.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F7/40 (pg.11/162)

Page 12: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

Application’s of Hall’s theorem

Consider a set of jobs I and a set of applicants V to the jobs. If anapplicant v ∈ V is qualified for job i ∈ I, we add edge (v, i) to thebipartite graph G = (V, I, E).

We wish all jobs to be filled, and hence Hall’s condition(∀J ⊆ I, |V (J)| ≥ |J |) is a necessary and sufficient condition for thisto be possible.

Note if |V | = |I|, then Hall’s theorem is the Marriage Theorem(Frobenious 1917), where an edge (v, i) in the graph indicatecompatibility between two individuals v ∈ V and i ∈ I coming fromtwo separate groups V and I.

If ∀J ⊆ I, |V (J)| ≥ |J |, then all individuals in each group can bematched with a compatible mate.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F7/40 (pg.12/162)

Page 13: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

Application’s of Hall’s theorem

Consider a set of jobs I and a set of applicants V to the jobs. If anapplicant v ∈ V is qualified for job i ∈ I, we add edge (v, i) to thebipartite graph G = (V, I, E).

We wish all jobs to be filled, and hence Hall’s condition(∀J ⊆ I, |V (J)| ≥ |J |) is a necessary and sufficient condition for thisto be possible.

Note if |V | = |I|, then Hall’s theorem is the Marriage Theorem(Frobenious 1917), where an edge (v, i) in the graph indicatecompatibility between two individuals v ∈ V and i ∈ I coming fromtwo separate groups V and I.

If ∀J ⊆ I, |V (J)| ≥ |J |, then all individuals in each group can bematched with a compatible mate.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F7/40 (pg.13/162)

Page 14: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Logistics Review

More general conditions for existence of transversals

Theorem 8.2.2 (Polymatroid transversal theorem)

If V = (Vi : i ∈ I) is a finite family of non-empty subsets of V , andf : 2V → Z+ is a non-negative, integral, monotone non-decreasing, andsubmodular function, then V has a system of representatives (vi : i ∈ I)such that

f(∪i∈J{vi}) ≥ |J | for all J ⊆ I (8.19)

if and only if

f(V (J)) ≥ |J | for all J ⊆ I (8.20)

Given Theorem ??, we immediately get Theorem 8.2.2 by takingf(S) = |S| for S ⊆ V .

We get Theorem ?? by taking f(S) = r(S) for S ⊆ V , the rankfunction of the matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F8/40 (pg.14/162)

Page 15: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid

Transversals, themselves, define a matroid.

Theorem 8.3.1

If V is a family of finite subsets of a ground set V , then the collection ofpartial transversals of V is the set of independent sets of a matroidM = (V,V) on V .

This means that the transversals of V are the bases of matroid M .

Therefore, all maximal partial transversals of V have the samecardinality!

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F9/40 (pg.15/162)

Page 16: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid

Transversals, themselves, define a matroid.

Theorem 8.3.1

If V is a family of finite subsets of a ground set V , then the collection ofpartial transversals of V is the set of independent sets of a matroidM = (V,V) on V .

This means that the transversals of V are the bases of matroid M .

Therefore, all maximal partial transversals of V have the samecardinality!

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F9/40 (pg.16/162)

Page 17: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid

Transversals, themselves, define a matroid.

Theorem 8.3.1

If V is a family of finite subsets of a ground set V , then the collection ofpartial transversals of V is the set of independent sets of a matroidM = (V,V) on V .

This means that the transversals of V are the bases of matroid M .

Therefore, all maximal partial transversals of V have the samecardinality!

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F9/40 (pg.17/162)

Page 18: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.

Given a set system (V,V), with V = (Vi : i ∈ I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v ∈ V, i ∈ I, v ∈ Vi}.A matching in this graph is a set of edges no two of which that have acommon endpoint.

In fact, we easily have:

Lemma 8.3.2

A subset T ⊆ V is a partial transversal of V iff there is a matching in(V, I, E) in which every edge has one endpoint in T (T matched into I).

V I

1

2

3

4

V I

1

2

3

4

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.18/162)

Page 19: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.Given a set system (V,V), with V = (Vi : i ∈ I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v ∈ V, i ∈ I, v ∈ Vi}.

A matching in this graph is a set of edges no two of which that have acommon endpoint.

In fact, we easily have:

Lemma 8.3.2

A subset T ⊆ V is a partial transversal of V iff there is a matching in(V, I, E) in which every edge has one endpoint in T (T matched into I).

V I

1

2

3

4

V I

1

2

3

4

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.19/162)

Page 20: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.Given a set system (V,V), with V = (Vi : i ∈ I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v ∈ V, i ∈ I, v ∈ Vi}.A matching in this graph is a set of edges no two of which that have acommon endpoint.

In fact, we easily have:

Lemma 8.3.2

A subset T ⊆ V is a partial transversal of V iff there is a matching in(V, I, E) in which every edge has one endpoint in T (T matched into I).

V I

1

2

3

4

V I

1

2

3

4

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.20/162)

Page 21: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.Given a set system (V,V), with V = (Vi : i ∈ I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v ∈ V, i ∈ I, v ∈ Vi}.A matching in this graph is a set of edges no two of which that have acommon endpoint. In fact, we easily have:

Lemma 8.3.2

A subset T ⊆ V is a partial transversal of V iff there is a matching in(V, I, E) in which every edge has one endpoint in T (T matched into I).

V I

1

2

3

4

V I

1

2

3

4

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.21/162)

Page 22: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.Given a set system (V,V), with V = (Vi : i ∈ I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v ∈ V, i ∈ I, v ∈ Vi}.A matching in this graph is a set of edges no two of which that have acommon endpoint. In fact, we easily have:

Lemma 8.3.2

A subset T ⊆ V is a partial transversal of V iff there is a matching in(V, I, E) in which every edge has one endpoint in T (T matched into I).

V I

1

2

3

4

V I

1

2

3

4

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.22/162)

Page 23: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Arbitrary Matchings and Matroids?

Are arbitrary matchings matroids?

Consider the following graph (left), and two max-matchings (two rightinstances)

A B

CD

A B

CD

A B

CD

{AC} is a maximum matching, as is {AD,BC}, but they are not thesame size.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F11/40 (pg.23/162)

Page 24: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Arbitrary Matchings and Matroids?

Are arbitrary matchings matroids?

Consider the following graph (left), and two max-matchings (two rightinstances)

A B

CD

A B

CD

A B

CD

{AC} is a maximum matching, as is {AD,BC}, but they are not thesame size.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F11/40 (pg.24/162)

Page 25: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Arbitrary Matchings and Matroids?

Are arbitrary matchings matroids?

Consider the following graph (left), and two max-matchings (two rightinstances)

A B

CD

A B

CD

A B

CD

{AC} is a maximum matching, as is {AD,BC}, but they are not thesame size.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F11/40 (pg.25/162)

Page 26: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partition Matroid, rank as matching

Example where ` = 5,(k1, k2, k3, k4, k5) =(2, 2, 1, 1, 3).

I1

I2

I3

I4

I5

V1

V2

V3

V4

V5

V IRecall, Γ : 2V → R as the neighborfunction in a bipartite graph, theneighbors of X is defined as Γ(X) ={v ∈ V (G) \X : E(X, {v}) 6= ∅}, andrecall that |Γ(X)| is submodular.

Here, for X ⊆ V , we have Γ(X) ={i ∈ I : (v, i) ∈ E(G) and v ∈ X}.For such a constructed bipartite graph,the rank function of a partition matroidis r(X) =

∑`i=1 min(|X ∩ Vi|, ki) = the

maximum matching involving X.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F12/40 (pg.26/162)

Page 27: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid Rank

Recall the partition matroid rank function. Note, ki = |Ii| in the bipartitegraph representation, and since a matroid, w.l.o.g., |Vi| ≥ ki (also, recall,V (J) = ∪j∈JVj).

Start with partition matroid rank function in the subsequent equations.

r(A) =∑

i∈{1,...,`}

min(|A ∩ Vi|, ki) (8.1)

=∑̀i=1

min(|A ∩ V (Ii)|, |Ii|) (8.2)

=∑

i∈{1,...,`}

minJi∈{∅,Ii}

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.3)

=∑

i∈{1,...,`}

minJi⊆Ii

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.4)

=∑

i∈{1,...,`}

minJi⊆Ii

(|V (Ji) ∩A|+ |Ii \ Ji|) (8.5)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.27/162)

Page 28: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid Rank

Recall the partition matroid rank function. Note, ki = |Ii| in the bipartitegraph representation, and since a matroid, w.l.o.g., |Vi| ≥ ki (also, recall,V (J) = ∪j∈JVj).Start with partition matroid rank function in the subsequent equations.

r(A) =∑

i∈{1,...,`}

min(|A ∩ Vi|, ki) (8.1)

=∑̀i=1

min(|A ∩ V (Ii)|, |Ii|) (8.2)

=∑

i∈{1,...,`}

minJi∈{∅,Ii}

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.3)

=∑

i∈{1,...,`}

minJi⊆Ii

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.4)

=∑

i∈{1,...,`}

minJi⊆Ii

(|V (Ji) ∩A|+ |Ii \ Ji|) (8.5)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.28/162)

Page 29: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid Rank

Recall the partition matroid rank function. Note, ki = |Ii| in the bipartitegraph representation, and since a matroid, w.l.o.g., |Vi| ≥ ki (also, recall,V (J) = ∪j∈JVj).Start with partition matroid rank function in the subsequent equations.

r(A) =∑

i∈{1,...,`}

min(|A ∩ Vi|, ki) (8.1)

=∑̀i=1

min(|A ∩ V (Ii)|, |Ii|) (8.2)

=∑

i∈{1,...,`}

minJi∈{∅,Ii}

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.3)

=∑

i∈{1,...,`}

minJi⊆Ii

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.4)

=∑

i∈{1,...,`}

minJi⊆Ii

(|V (Ji) ∩A|+ |Ii \ Ji|) (8.5)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.29/162)

Page 30: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid Rank

Recall the partition matroid rank function. Note, ki = |Ii| in the bipartitegraph representation, and since a matroid, w.l.o.g., |Vi| ≥ ki (also, recall,V (J) = ∪j∈JVj).Start with partition matroid rank function in the subsequent equations.

r(A) =∑

i∈{1,...,`}

min(|A ∩ Vi|, ki) (8.1)

=∑̀i=1

min(|A ∩ V (Ii)|, |Ii|) (8.2)

=∑

i∈{1,...,`}

minJi∈{∅,Ii}

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.3)

=∑

i∈{1,...,`}

minJi⊆Ii

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.4)

=∑

i∈{1,...,`}

minJi⊆Ii

(|V (Ji) ∩A|+ |Ii \ Ji|) (8.5)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.30/162)

Page 31: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid Rank

Recall the partition matroid rank function. Note, ki = |Ii| in the bipartitegraph representation, and since a matroid, w.l.o.g., |Vi| ≥ ki (also, recall,V (J) = ∪j∈JVj).Start with partition matroid rank function in the subsequent equations.

r(A) =∑

i∈{1,...,`}

min(|A ∩ Vi|, ki) (8.1)

=∑̀i=1

min(|A ∩ V (Ii)|, |Ii|) (8.2)

=∑

i∈{1,...,`}

minJi∈{∅,Ii}

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.3)

=∑

i∈{1,...,`}

minJi⊆Ii

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.4)

=∑

i∈{1,...,`}

minJi⊆Ii

(|V (Ji) ∩A|+ |Ii \ Ji|) (8.5)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.31/162)

Page 32: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid Rank

Recall the partition matroid rank function. Note, ki = |Ii| in the bipartitegraph representation, and since a matroid, w.l.o.g., |Vi| ≥ ki (also, recall,V (J) = ∪j∈JVj).Start with partition matroid rank function in the subsequent equations.

r(A) =∑

i∈{1,...,`}

min(|A ∩ Vi|, ki) (8.1)

=∑̀i=1

min(|A ∩ V (Ii)|, |Ii|) (8.2)

=∑

i∈{1,...,`}

minJi∈{∅,Ii}

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.3)

=∑

i∈{1,...,`}

minJi⊆Ii

({|A ∩ V (Ii)| if Ji 6= ∅

0 if Ji = ∅

}+ |Ii \ Ji|

)(8.4)

=∑

i∈{1,...,`}

minJi⊆Ii

(|V (Ji) ∩A|+ |Ii \ Ji|) (8.5)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.32/162)

Page 33: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =∑̀i=1

minJi⊆Ii

(|V (Ji) ∩ V (Ii) ∩A| − |Ii ∩ Ji|+ |Ii|) (8.6)

= minJ⊆I

(∑̀i=1

|V (J) ∩ V (Ii) ∩A| − |Ii ∩ J |+ |Ii|)

(8.7)

= minJ⊆I

(|V (J) ∩ V (I) ∩A| − |J |+ |I|) (8.8)

= minJ⊆I

(|V (J) ∩A| − |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.33/162)

Page 34: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =∑̀i=1

minJi⊆Ii

(|V (Ji) ∩ V (Ii) ∩A| − |Ii ∩ Ji|+ |Ii|) (8.6)

= minJ⊆I

(∑̀i=1

|V (J) ∩ V (Ii) ∩A| − |Ii ∩ J |+ |Ii|)

(8.7)

= minJ⊆I

(|V (J) ∩ V (I) ∩A| − |J |+ |I|) (8.8)

= minJ⊆I

(|V (J) ∩A| − |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.34/162)

Page 35: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =∑̀i=1

minJi⊆Ii

(|V (Ji) ∩ V (Ii) ∩A| − |Ii ∩ Ji|+ |Ii|) (8.6)

= minJ⊆I

(∑̀i=1

|V (J) ∩ V (Ii) ∩A| − |Ii ∩ J |+ |Ii|)

(8.7)

= minJ⊆I

(|V (J) ∩ V (I) ∩A| − |J |+ |I|) (8.8)

= minJ⊆I

(|V (J) ∩A| − |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.35/162)

Page 36: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =∑̀i=1

minJi⊆Ii

(|V (Ji) ∩ V (Ii) ∩A| − |Ii ∩ Ji|+ |Ii|) (8.6)

= minJ⊆I

(∑̀i=1

|V (J) ∩ V (Ii) ∩A| − |Ii ∩ J |+ |Ii|)

(8.7)

= minJ⊆I

(|V (J) ∩ V (I) ∩A| − |J |+ |I|) (8.8)

= minJ⊆I

(|V (J) ∩A| − |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.36/162)

Page 37: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =∑̀i=1

minJi⊆Ii

(|V (Ji) ∩ V (Ii) ∩A| − |Ii ∩ Ji|+ |Ii|) (8.6)

= minJ⊆I

(∑̀i=1

|V (J) ∩ V (Ii) ∩A| − |Ii ∩ J |+ |Ii|)

(8.7)

= minJ⊆I

(|V (J) ∩ V (I) ∩A| − |J |+ |I|) (8.8)

= minJ⊆I

(|V (J) ∩A| − |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.37/162)

Page 38: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partial Transversals Are Independent Sets in a Matroid

In fact, we have

Theorem 8.3.3

Let (V,V) where V = (V1, V2, . . . , V`) be a subset system. LetI = {1, . . . , `}. Let I be the set of partial transversals of V. Then (V, I) isa matroid.

Proof.

We note that ∅ ∈ I since the empty set is a transversal of the emptysubfamily of V, thus (I1’) holds.

We already saw that if T is a partial transversal of V, and if T ′ ⊆ T ,then T ′ is also a partial transversal. So (I2’) holds.

Suppose that T1 and T2 are partial transversals of V such that|T1| < |T2|. Exercise: show that (I3’) holds.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F15/40 (pg.38/162)

Page 39: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partial Transversals Are Independent Sets in a Matroid

In fact, we have

Theorem 8.3.3

Let (V,V) where V = (V1, V2, . . . , V`) be a subset system. LetI = {1, . . . , `}. Let I be the set of partial transversals of V. Then (V, I) isa matroid.

Proof.

We note that ∅ ∈ I since the empty set is a transversal of the emptysubfamily of V, thus (I1’) holds.

We already saw that if T is a partial transversal of V, and if T ′ ⊆ T ,then T ′ is also a partial transversal. So (I2’) holds.

Suppose that T1 and T2 are partial transversals of V such that|T1| < |T2|. Exercise: show that (I3’) holds.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F15/40 (pg.39/162)

Page 40: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partial Transversals Are Independent Sets in a Matroid

In fact, we have

Theorem 8.3.3

Let (V,V) where V = (V1, V2, . . . , V`) be a subset system. LetI = {1, . . . , `}. Let I be the set of partial transversals of V. Then (V, I) isa matroid.

Proof.

We note that ∅ ∈ I since the empty set is a transversal of the emptysubfamily of V, thus (I1’) holds.

We already saw that if T is a partial transversal of V, and if T ′ ⊆ T ,then T ′ is also a partial transversal. So (I2’) holds.

Suppose that T1 and T2 are partial transversals of V such that|T1| < |T2|. Exercise: show that (I3’) holds.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F15/40 (pg.40/162)

Page 41: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partial Transversals Are Independent Sets in a Matroid

In fact, we have

Theorem 8.3.3

Let (V,V) where V = (V1, V2, . . . , V`) be a subset system. LetI = {1, . . . , `}. Let I be the set of partial transversals of V. Then (V, I) isa matroid.

Proof.

We note that ∅ ∈ I since the empty set is a transversal of the emptysubfamily of V, thus (I1’) holds.

We already saw that if T is a partial transversal of V, and if T ′ ⊆ T ,then T ′ is also a partial transversal. So (I2’) holds.

Suppose that T1 and T2 are partial transversals of V such that|T1| < |T2|. Exercise: show that (I3’) holds.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F15/40 (pg.41/162)

Page 42: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid Rank

Transversal matroid has rank

r(A) = minJ⊆I

(|V (J) ∩A| − |J |+ |I|) (8.10)

Therefore, this function is submodular.

Note that it is a minimum over a set of modular functions. Is this truein general?

Exercise:

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F16/40 (pg.42/162)

Page 43: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid Rank

Transversal matroid has rank

r(A) = minJ⊆I

(|V (J) ∩A| − |J |+ |I|) (8.10)

Therefore, this function is submodular.

Note that it is a minimum over a set of modular functions. Is this truein general? Exercise:

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F16/40 (pg.43/162)

Page 44: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid Rank

Transversal matroid has rank

r(A) = minJ⊆I

(|V (J) ∩A| − |J |+ |I|) (8.10)

Therefore, this function is submodular.

Note that it is a minimum over a set of modular functions. Is this truein general? Exercise:

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F16/40 (pg.44/162)

Page 45: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ⊆ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a ∈ A, r(A \ {a}) = |A| − 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdifferent indices, as can a self loop in a graph appear on differentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.45/162)

Page 46: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ⊆ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a ∈ A, r(A \ {a}) = |A| − 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdifferent indices, as can a self loop in a graph appear on differentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.46/162)

Page 47: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ⊆ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a ∈ A, r(A \ {a}) = |A| − 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdifferent indices, as can a self loop in a graph appear on differentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.47/162)

Page 48: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ⊆ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a ∈ A, r(A \ {a}) = |A| − 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdifferent indices, as can a self loop in a graph appear on differentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.48/162)

Page 49: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ⊆ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a ∈ A, r(A \ {a}) = |A| − 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdifferent indices, as can a self loop in a graph appear on differentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.49/162)

Page 50: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection π : V1 → V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ∗, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.50/162)

Page 51: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection π : V1 → V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ∗, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.51/162)

Page 52: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection π : V1 → V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ∗, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.52/162)

Page 53: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection π : V1 → V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ∗, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Definition 8.4.2 (linear matroids on a field)

Let X be an n×m matrix and E = {1, . . . ,m}, where Xij ∈ F for somefield, and let I be the set of subsets of E such that the columns of X arelinearly independent over F.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.53/162)

Page 54: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection π : V1 → V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ∗, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Definition 8.4.3 (representable (as a linear matroid))

Any matroid isomorphic to a linear matroid on a field is called representableover F

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.54/162)

Page 55: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representability of Transversal Matroids

Piff and Welsh in 1970, and Adkin in 1972 proved an importanttheorem about representability of transversal matroids.

In particular:

Theorem 8.4.4

Transversal matroids are representable over all finite fields of sufficientlylarge cardinality, and are representable over any infinite field.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F19/40 (pg.55/162)

Page 56: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representability of Transversal Matroids

Piff and Welsh in 1970, and Adkin in 1972 proved an importanttheorem about representability of transversal matroids.

In particular:

Theorem 8.4.4

Transversal matroids are representable over all finite fields of sufficientlylarge cardinality, and are representable over any infinite field.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F19/40 (pg.56/162)

Page 57: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Converse: Representability of Transversal Matroids

The converse is not true, however.

Example 8.4.5

Let V = {1, 2, 3, 4, 5, 6} be a ground set and let M = (V, I) be a setsystem where I is all subsets of V of cardinality ≤ 2 except for the pairs{1, 2}, {3, 4}, {5, 6}.

It can be shown that this is a matroid and is representable.

However, this matroid is not isomorphic to any transversal matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F20/40 (pg.57/162)

Page 58: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Converse: Representability of Transversal Matroids

The converse is not true, however.

Example 8.4.5

Let V = {1, 2, 3, 4, 5, 6} be a ground set and let M = (V, I) be a setsystem where I is all subsets of V of cardinality ≤ 2 except for the pairs{1, 2}, {3, 4}, {5, 6}.

It can be shown that this is a matroid and is representable.

However, this matroid is not isomorphic to any transversal matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F20/40 (pg.58/162)

Page 59: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Converse: Representability of Transversal Matroids

The converse is not true, however.

Example 8.4.5

Let V = {1, 2, 3, 4, 5, 6} be a ground set and let M = (V, I) be a setsystem where I is all subsets of V of cardinality ≤ 2 except for the pairs{1, 2}, {3, 4}, {5, 6}.

It can be shown that this is a matroid and is representable.

However, this matroid is not isomorphic to any transversal matroid.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F20/40 (pg.59/162)

Page 60: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Review from Lecture 6

The next frame comes from lecture 6.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F21/40 (pg.60/162)

Page 61: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroids, other definitions using matroid rank r : 2V → Z+

Definition 8.5.3 (closed/flat/subspace)

A subset A ⊆ E is closed (equivalently, a flat or a subspace) of matroid Mif for all x ∈ E \A, r(A ∪ {x}) = r(A) + 1.

Definition: A hyperplane is a flat of rank r(M)− 1.

Definition 8.5.4 (closure)

Given A ⊆ E, the closure (or span) of A, is defined byspan(A) = {b ∈ E : r(A ∪ {b}) = r(A)}.

Therefore, a closed set A has span(A) = A.

Definition 8.5.5 (circuit)

A subset A ⊆ E is circuit or a cycle if it is an inclusionwise-minimaldependent set (i.e., if r(A) < |A| and for any a ∈ A, r(A \ {a}) = |A| − 1).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F22/40 (pg.61/162)

Page 62: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ⊆ V , then any set X ⊆ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ⊆ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.62/162)

Page 63: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ⊆ V , then any set X ⊆ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ⊆ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.63/162)

Page 64: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ⊆ V , then any set X ⊆ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ⊆ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.64/162)

Page 65: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ⊆ V , then any set X ⊆ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ⊆ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.65/162)

Page 66: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ⊆ V , then any set X ⊆ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ⊆ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.66/162)

Page 67: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ⊆ V , then any set X ⊆ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ⊆ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.67/162)

Page 68: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M∗ = (V, I∗) can bedefined on the same ground set V , but using a very different set ofindependent sets I∗.

We define the set of sets I∗ for M∗ as follows:

I∗ = {A ⊆ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ⊆ V is a spanning set of M} (8.12)

i.e., I∗ are complements of spanning sets of M .

That is, a set A is independent in the dual matroid M∗ if removal of Afrom V does not decrease the rank in M :

I∗ = {A ⊆ V : rankM (V \A) = rankM (V )} (8.13)

In other words, a set A ⊆ V is independent in the dual M∗ (i.e.,A ∈ I∗) if its complement is spanning in M (residual V \A mustcontain a base in M).

Dual of the dual: Note, we have that (M∗)∗ = M .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.68/162)

Page 69: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M∗ = (V, I∗) can bedefined on the same ground set V , but using a very different set ofindependent sets I∗.We define the set of sets I∗ for M∗ as follows:

I∗ = {A ⊆ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ⊆ V is a spanning set of M} (8.12)

i.e., I∗ are complements of spanning sets of M .

That is, a set A is independent in the dual matroid M∗ if removal of Afrom V does not decrease the rank in M :

I∗ = {A ⊆ V : rankM (V \A) = rankM (V )} (8.13)

In other words, a set A ⊆ V is independent in the dual M∗ (i.e.,A ∈ I∗) if its complement is spanning in M (residual V \A mustcontain a base in M).

Dual of the dual: Note, we have that (M∗)∗ = M .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.69/162)

Page 70: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M∗ = (V, I∗) can bedefined on the same ground set V , but using a very different set ofindependent sets I∗.We define the set of sets I∗ for M∗ as follows:

I∗ = {A ⊆ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ⊆ V is a spanning set of M} (8.12)

i.e., I∗ are complements of spanning sets of M .

That is, a set A is independent in the dual matroid M∗ if removal of Afrom V does not decrease the rank in M :

I∗ = {A ⊆ V : rankM (V \A) = rankM (V )} (8.13)

In other words, a set A ⊆ V is independent in the dual M∗ (i.e.,A ∈ I∗) if its complement is spanning in M (residual V \A mustcontain a base in M).

Dual of the dual: Note, we have that (M∗)∗ = M .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.70/162)

Page 71: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M∗ = (V, I∗) can bedefined on the same ground set V , but using a very different set ofindependent sets I∗.We define the set of sets I∗ for M∗ as follows:

I∗ = {A ⊆ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ⊆ V is a spanning set of M} (8.12)

i.e., I∗ are complements of spanning sets of M .

That is, a set A is independent in the dual matroid M∗ if removal of Afrom V does not decrease the rank in M :

I∗ = {A ⊆ V : rankM (V \A) = rankM (V )} (8.13)

In other words, a set A ⊆ V is independent in the dual M∗ (i.e.,A ∈ I∗) if its complement is spanning in M (residual V \A mustcontain a base in M).

Dual of the dual: Note, we have that (M∗)∗ = M .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.71/162)

Page 72: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M∗ = (V, I∗) can bedefined on the same ground set V , but using a very different set ofindependent sets I∗.We define the set of sets I∗ for M∗ as follows:

I∗ = {A ⊆ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ⊆ V is a spanning set of M} (8.12)

i.e., I∗ are complements of spanning sets of M .

That is, a set A is independent in the dual matroid M∗ if removal of Afrom V does not decrease the rank in M :

I∗ = {A ⊆ V : rankM (V \A) = rankM (V )} (8.13)

In other words, a set A ⊆ V is independent in the dual M∗ (i.e.,A ∈ I∗) if its complement is spanning in M (residual V \A mustcontain a base in M).

Dual of the dual: Note, we have that (M∗)∗ = M .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.72/162)

Page 73: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid: Bases

The smallest spanning sets are bases.

Hence, a base B of M (whereB = V \B∗ is as small as possible while still spanning) is thecomplement of a base B∗ of M∗ (where B∗ = V \B is as large aspossible while still being independent).

In fact, we have that

Theorem 8.5.3 (Dual matroid bases)

Let M = (V, I) be a matroid and B(M) be the set of bases of M . Thendefine

B∗(M) = {V \B : B ∈ B(M)}. (8.14)

Then B∗(M) is the set of basis of M∗ (that is, B∗(M) = B(M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F25/40 (pg.73/162)

Page 74: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid: Bases

The smallest spanning sets are bases. Hence, a base B of M (whereB = V \B∗ is as small as possible while still spanning) is thecomplement of a base B∗ of M∗ (where B∗ = V \B is as large aspossible while still being independent).

In fact, we have that

Theorem 8.5.3 (Dual matroid bases)

Let M = (V, I) be a matroid and B(M) be the set of bases of M . Thendefine

B∗(M) = {V \B : B ∈ B(M)}. (8.14)

Then B∗(M) is the set of basis of M∗ (that is, B∗(M) = B(M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F25/40 (pg.74/162)

Page 75: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid: Bases

The smallest spanning sets are bases. Hence, a base B of M (whereB = V \B∗ is as small as possible while still spanning) is thecomplement of a base B∗ of M∗ (where B∗ = V \B is as large aspossible while still being independent).

In fact, we have that

Theorem 8.5.3 (Dual matroid bases)

Let M = (V, I) be a matroid and B(M) be the set of bases of M . Thendefine

B∗(M) = {V \B : B ∈ B(M)}. (8.14)

Then B∗(M) is the set of basis of M∗ (that is, B∗(M) = B(M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F25/40 (pg.75/162)

Page 76: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid: Bases

The smallest spanning sets are bases. Hence, a base B of M (whereB = V \B∗ is as small as possible while still spanning) is thecomplement of a base B∗ of M∗ (where B∗ = V \B is as large aspossible while still being independent).

In fact, we have that

Theorem 8.5.3 (Dual matroid bases)

Let M = (V, I) be a matroid and B(M) be the set of bases of M . Thendefine

B∗(M) = {V \B : B ∈ B(M)}. (8.14)

Then B∗(M) is the set of basis of M∗ (that is, B∗(M) = B(M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F25/40 (pg.76/162)

Page 77: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.77/162)

Page 78: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.78/162)

Page 79: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.79/162)

Page 80: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.80/162)

Page 81: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.81/162)

Page 82: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.82/162)

Page 83: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.83/162)

Page 84: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.84/162)

Page 85: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.85/162)

Page 86: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B∗(M), the bases of M∗, are called cobases of M .

The circuits of M∗ are called cocircuits of M .

The hyperplanes of M∗ are called cohyperplanes of M .

The independent sets of M∗ are called coindependent sets of M .

The spanning sets of M∗ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ⊆ V . Then

1 X is independent in M iff V \X is cospanning in M (spanning in M∗).

2 X is spanning in M iff V \X is coindependent in M (independent inM∗).

3 X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M∗).

4 X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M∗).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.86/162)

Page 87: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.

Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ⊆ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ⊆ E(G) such that X \ {x} is not a cutfor any x ∈ X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.87/162)

Page 88: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).

A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ⊆ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ⊆ E(G) such that X \ {x} is not a cutfor any x ∈ X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.88/162)

Page 89: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ⊆ E(G) is a cut in G ifk(G) < k(G \X).

A minimal cut in G is a cut X ⊆ E(G) such that X \ {x} is not a cutfor any x ∈ X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.89/162)

Page 90: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ⊆ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ⊆ E(G) such that X \ {x} is not a cutfor any x ∈ X.

A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.90/162)

Page 91: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ⊆ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ⊆ E(G) such that X \ {x} is not a cutfor any x ∈ X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.

A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.91/162)

Page 92: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ⊆ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ⊆ E(G) such that X \ {x} is not a cutfor any x ∈ X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.

All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.92/162)

Page 93: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ⊆ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ⊆ E(G) such that X \ {x} is not a cutfor any x ∈ X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.93/162)

Page 94: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}

It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.94/162)

Page 95: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

A graph G

Minimally spanning in M (and thusa base (maximally independent) in M)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.95/162)

Page 96: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Minimally spanning in M (and thusa base (maximally independent) in M)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Maximally independent in M* (thusa base, minimally spanning, in M*)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.96/162)

Page 97: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Spanning in M, but not a base, andnot independent (has cycles)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Minimally spanning in M (and thusa base (maximally independent) in M)

Maximally independent in M* (thusa base, minimally spanning, in M*)

Independent in M* (does not contain a cut)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.97/162)

Page 98: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Independent but not spanning in M, and not closed in M.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Dependent in M* (contains a cocycle, is a nonminimal cut)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.98/162)

Page 99: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Spanning in M, but not a base, andnot independent (has cycles)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Independent in M* (does not contain a cut)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.99/162)

Page 100: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Independent but not spanning in M, and not closed in M.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Dependent in M* (contains a cocycle, is a nonminimal cut)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.100/162)

Page 101: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

A hyperplane in M, dependent but not spanning in M

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

A cycle in M* (minimally dependentin M*, a cocycle, or a minimal cut)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.101/162)

Page 102: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I∗ = {A ⊆ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

A hyperplane in M, dependentbut not spanning in M

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

A cycle in M* (minimally dependentin M*, a cocycle, or a minimal cut)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.102/162)

Page 103: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Clearly ∅ ∈ I∗, so (I1’) holds.

Also, if I ⊆ J ∈ I∗, then clearly also I ∈ I∗ since if V \ J is spanningin M , so must V \ I. Therefore, (I2’) holds.

. . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.103/162)

Page 104: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Clearly ∅ ∈ I∗, so (I1’) holds.

Also, if I ⊆ J ∈ I∗, then clearly also I ∈ I∗ since if V \ J is spanningin M , so must V \ I. Therefore, (I2’) holds.

. . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.104/162)

Page 105: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Consider I, J ∈ I∗ with |I| < |J |. We need to show that there is somemember v ∈ J \ I such that I + v is independent in M∗, which meansthat V \ (I + v) = (V \ I) \ v is still spanning in M . That is, removingv from V \ I doesn’t make (V \ I) \ v not spanning in M .

Since V \ J is spanning in M , V \ J contains some base (sayB ⊆ V \ J) of M . Also, V \ I contains a base of M , say B′ ⊆ V \ I.

Since B \ I ⊆ V \ I, and B \ I is independent in M , we can choosethe base B′ of M s.t. B \ I ⊆ B′ ⊆ V \ I.

Since B and J are disjoint, we have both: 1) B \ I and J \ I aredisjoint; and 2) B ∩ I ⊆ I \ J . Also note, B′ and I are disjoint.

. . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.105/162)

Page 106: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Consider I, J ∈ I∗ with |I| < |J |. We need to show that there is somemember v ∈ J \ I such that I + v is independent in M∗, which meansthat V \ (I + v) = (V \ I) \ v is still spanning in M . That is, removingv from V \ I doesn’t make (V \ I) \ v not spanning in M .

Since V \ J is spanning in M , V \ J contains some base (sayB ⊆ V \ J) of M . Also, V \ I contains a base of M , say B′ ⊆ V \ I.

Since B \ I ⊆ V \ I, and B \ I is independent in M , we can choosethe base B′ of M s.t. B \ I ⊆ B′ ⊆ V \ I.

Since B and J are disjoint, we have both: 1) B \ I and J \ I aredisjoint; and 2) B ∩ I ⊆ I \ J . Also note, B′ and I are disjoint.

. . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.106/162)

Page 107: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Consider I, J ∈ I∗ with |I| < |J |. We need to show that there is somemember v ∈ J \ I such that I + v is independent in M∗, which meansthat V \ (I + v) = (V \ I) \ v is still spanning in M . That is, removingv from V \ I doesn’t make (V \ I) \ v not spanning in M .

Since V \ J is spanning in M , V \ J contains some base (sayB ⊆ V \ J) of M . Also, V \ I contains a base of M , say B′ ⊆ V \ I.

Since B \ I ⊆ V \ I, and B \ I is independent in M , we can choosethe base B′ of M s.t. B \ I ⊆ B′ ⊆ V \ I.

Since B and J are disjoint, we have both: 1) B \ I and J \ I aredisjoint; and 2) B ∩ I ⊆ I \ J . Also note, B′ and I are disjoint.

. . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.107/162)

Page 108: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Consider I, J ∈ I∗ with |I| < |J |. We need to show that there is somemember v ∈ J \ I such that I + v is independent in M∗, which meansthat V \ (I + v) = (V \ I) \ v is still spanning in M . That is, removingv from V \ I doesn’t make (V \ I) \ v not spanning in M .

Since V \ J is spanning in M , V \ J contains some base (sayB ⊆ V \ J) of M . Also, V \ I contains a base of M , say B′ ⊆ V \ I.

Since B \ I ⊆ V \ I, and B \ I is independent in M , we can choosethe base B′ of M s.t. B \ I ⊆ B′ ⊆ V \ I.

Since B and J are disjoint, we have both: 1) B \ I and J \ I aredisjoint; and 2) B ∩ I ⊆ I \ J . Also note, B′ and I are disjoint. . . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.108/162)

Page 109: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Now J \ I 6⊆ B′, since otherwise (i.e., assuming J \ I ⊆ B′):

|B| = |B ∩ I|+ |B \ I| (8.15)

≤ |I \ J |+ |B \ I| (8.16)

< |J \ I|+ |B \ I| ≤ |B′| (8.17)

which is a contradiction. The last inequality on the right follows sinceJ \ I ⊆ B′ (by assumption) and B \ I ⊆ B′ implies that (J \ I)∪ (B \ I) ⊆ B′, butsince J and B are disjoint, we have that |J \ I|+ |B \ I| ≤ |B′|.

Therefore, J \ I 6⊆ B′, and there is a v ∈ J \ I s.t. v /∈ B′.So B′ is disjoint with I ∪ {v}, means B′ ⊆ V \ (I ∪ {v}), orV \ (I ∪ {v}) is spanning in M , and therefore I ∪ {v} ∈ I∗.

. . .Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.109/162)

Page 110: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Now J \ I 6⊆ B′, since otherwise (i.e., assuming J \ I ⊆ B′):

|B| = |B ∩ I|+ |B \ I| (8.15)

≤ |I \ J |+ |B \ I| (8.16)

< |J \ I|+ |B \ I| ≤ |B′| (8.17)

which is a contradiction.

Therefore, J \ I 6⊆ B′, and there is a v ∈ J \ I s.t. v /∈ B′.

So B′ is disjoint with I ∪ {v}, means B′ ⊆ V \ (I ∪ {v}), orV \ (I ∪ {v}) is spanning in M , and therefore I ∪ {v} ∈ I∗.

. . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.110/162)

Page 111: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M∗ = (V, I∗) be as previously defined.Then M∗ is a matroid.

Proof.

Now J \ I 6⊆ B′, since otherwise (i.e., assuming J \ I ⊆ B′):

|B| = |B ∩ I|+ |B \ I| (8.15)

≤ |I \ J |+ |B \ I| (8.16)

< |J \ I|+ |B \ I| ≤ |B′| (8.17)

which is a contradiction.

Therefore, J \ I 6⊆ B′, and there is a v ∈ J \ I s.t. v /∈ B′.So B′ is disjoint with I ∪ {v}, means B′ ⊆ V \ (I ∪ {v}), orV \ (I ∪ {v}) is spanning in M , and therefore I ∪ {v} ∈ I∗.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.111/162)

Page 112: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid Duals and Representability

Theorem 8.5.6

Let M be an F-representable matroid (i.e., one that can be represented bya finite sized matrix over field F). Then M∗ is also F-representable.

Hence, for matroids as general as matric matroids, duality does not extendthe space of matroids that can be used.

Theorem 8.5.7

Let M be a graphic matroid (i.e., one that can be represented by a graphG = (V,E)). Then M∗ is not necessarily also graphic.

Hence, for graphic matroids, duality can increase the space and power ofmatroids, and since they are based on a graph, they are relatively easy touse: 1) all cuts are dependent sets; 2) minimal cuts are cycles; 3) bases areone edge less than minimal cuts; and 4) independent sets are edges that arenot cuts.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F30/40 (pg.112/162)

Page 113: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid Duals and Representability

Theorem 8.5.6

Let M be an F-representable matroid (i.e., one that can be represented bya finite sized matrix over field F). Then M∗ is also F-representable.

Hence, for matroids as general as matric matroids, duality does not extendthe space of matroids that can be used.

Theorem 8.5.7

Let M be a graphic matroid (i.e., one that can be represented by a graphG = (V,E)). Then M∗ is not necessarily also graphic.

Hence, for graphic matroids, duality can increase the space and power ofmatroids, and since they are based on a graph, they are relatively easy touse: 1) all cuts are dependent sets; 2) minimal cuts are cycles; 3) bases areone edge less than minimal cuts; and 4) independent sets are edges that arenot cuts.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F30/40 (pg.113/162)

Page 114: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function rM∗ of the dual matroid M∗ may be specified in termsof the rank rM in matroid M as follows. For X ⊆ V :

rM∗(X) = |X|+ rM (V \X)− rM (V ) (8.18)

Note, we again immediately see that this is submodular by theproperties of submodular functions we saw in lectures 1 and 2. I.e., |X|is modular, complement f(V \X) is submodular if f is submodular, rM (V ) is aconstant, and summing submodular functions and a constant preservessubmodularity.

Non-negativity integral follows since|X|+ rM (V \X) ≥ rM (X) + rM (V \X) ≥ rM (V ).

Monotone non-decreasing follows since, as X increases by one, |X|always increases by 1, while rM (V \X) decreases by one or zero.

Therefore, rM∗ is the rank function of a matroid. That it is the dualmatroid rank function is shown in the next proof.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.114/162)

Page 115: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function rM∗ of the dual matroid M∗ may be specified in termsof the rank rM in matroid M as follows. For X ⊆ V :

rM∗(X) = |X|+ rM (V \X)− rM (V ) (8.18)

Note, we again immediately see that this is submodular by theproperties of submodular functions we saw in lectures 1 and 2.

Non-negativity integral follows since|X|+ rM (V \X) ≥ rM (X) + rM (V \X) ≥ rM (V ). The right inequalityfollows since rM is submodular.

Monotone non-decreasing follows since, as X increases by one, |X|always increases by 1, while rM (V \X) decreases by one or zero.

Therefore, rM∗ is the rank function of a matroid. That it is the dualmatroid rank function is shown in the next proof.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.115/162)

Page 116: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function rM∗ of the dual matroid M∗ may be specified in termsof the rank rM in matroid M as follows. For X ⊆ V :

rM∗(X) = |X|+ rM (V \X)− rM (V ) (8.18)

Note, we again immediately see that this is submodular by theproperties of submodular functions we saw in lectures 1 and 2.

Non-negativity integral follows since|X|+ rM (V \X) ≥ rM (X) + rM (V \X) ≥ rM (V ).

Monotone non-decreasing follows since, as X increases by one, |X|always increases by 1, while rM (V \X) decreases by one or zero.

Therefore, rM∗ is the rank function of a matroid. That it is the dualmatroid rank function is shown in the next proof.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.116/162)

Page 117: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function rM∗ of the dual matroid M∗ may be specified in termsof the rank rM in matroid M as follows. For X ⊆ V :

rM∗(X) = |X|+ rM (V \X)− rM (V ) (8.18)

Note, we again immediately see that this is submodular by theproperties of submodular functions we saw in lectures 1 and 2.

Non-negativity integral follows since|X|+ rM (V \X) ≥ rM (X) + rM (V \X) ≥ rM (V ).

Monotone non-decreasing follows since, as X increases by one, |X|always increases by 1, while rM (V \X) decreases by one or zero.

Therefore, rM∗ is the rank function of a matroid. That it is the dualmatroid rank function is shown in the next proof.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.117/162)

Page 118: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function rM∗ of the dual matroid M∗ may be specified in termsof the rank rM in matroid M as follows. For X ⊆ V :

rM∗(X) = |X|+ rM (V \X)− rM (V ) (8.18)

Proof.

A set X is independent in (V, rM∗) if and only if

rM∗(X) = |X|+ rM (V \X)− rM (V ) = |X| (8.19)

or

rM (V \X) = rM (V ) (8.20)

But a subset X is independent in M∗ only if V \X is spanning in M (bythe definition of the dual matroid).

. . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.118/162)

Page 119: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function rM∗ of the dual matroid M∗ may be specified in termsof the rank rM in matroid M as follows. For X ⊆ V :

rM∗(X) = |X|+ rM (V \X)− rM (V ) (8.18)

Proof.

A set X is independent in (V, rM∗) if and only if

rM∗(X) = |X|+ rM (V \X)− rM (V ) = |X| (8.19)

or

rM (V \X) = rM (V ) (8.20)

But a subset X is independent in M∗ only if V \X is spanning in M (bythe definition of the dual matroid).

. . .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.119/162)

Page 120: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function rM∗ of the dual matroid M∗ may be specified in termsof the rank rM in matroid M as follows. For X ⊆ V :

rM∗(X) = |X|+ rM (V \X)− rM (V ) (8.18)

Proof.

A set X is independent in (V, rM∗) if and only if

rM∗(X) = |X|+ rM (V \X)− rM (V ) = |X| (8.19)

or

rM (V \X) = rM (V ) (8.20)

But a subset X is independent in M∗ only if V \X is spanning in M (bythe definition of the dual matroid).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.120/162)

Page 121: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ⊆ V , then

IY = {Z : Z ⊆ Y, Z ∈ I} (8.21)

is such that MY = (Y, IY ) is a matroid with rank r(MY ) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY = {Z : Z ∩X = ∅, Z ∈ I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY : 2Y → Z+, whererY (Z) = r(Z) for Z ⊆ Y .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.121/162)

Page 122: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ⊆ V , then

IY = {Z : Z ⊆ Y, Z ∈ I} (8.21)

is such that MY = (Y, IY ) is a matroid with rank r(MY ) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY = {Z : Z ∩X = ∅, Z ∈ I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY : 2Y → Z+, whererY (Z) = r(Z) for Z ⊆ Y .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.122/162)

Page 123: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ⊆ V , then

IY = {Z : Z ⊆ Y, Z ∈ I} (8.21)

is such that MY = (Y, IY ) is a matroid with rank r(MY ) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY = {Z : Z ∩X = ∅, Z ∈ I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY : 2Y → Z+, whererY (Z) = r(Z) for Z ⊆ Y .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.123/162)

Page 124: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ⊆ V , then

IY = {Z : Z ⊆ Y, Z ∈ I} (8.21)

is such that MY = (Y, IY ) is a matroid with rank r(MY ) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY = {Z : Z ∩X = ∅, Z ∈ I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY : 2Y → Z+, whererY (Z) = r(Z) for Z ⊆ Y .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.124/162)

Page 125: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ⊆ V , then

IY = {Z : Z ⊆ Y, Z ∈ I} (8.21)

is such that MY = (Y, IY ) is a matroid with rank r(MY ) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY = {Z : Z ∩X = ∅, Z ∈ I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY : 2Y → Z+, whererY (Z) = r(Z) for Z ⊆ Y .

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.125/162)

Page 126: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Zis written M/Z.

Let Z ⊆ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I ∪X is independent in M .

The rank function takes the form

rM/Z(Y ) = r(Y ∪ Z)− r(Z) = r(Y |Z) (8.23)

So given I ⊆ V \Z and X is a base of Z, rM/Z(I) = |I| is identical tor(I ∪ Z) = |I|+ r(Z) = |I|+ |X| but r(I ∪ Z) = r(I ∪X). Thisimplies r(I ∪X) = |I|+ |X|, or I ∪X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M∗ \ Z)∗ (Exercise: show why).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.126/162)

Page 127: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Zis written M/Z.

Let Z ⊆ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I ∪X is independent in M .

The rank function takes the form

rM/Z(Y ) = r(Y ∪ Z)− r(Z) = r(Y |Z) (8.23)

So given I ⊆ V \Z and X is a base of Z, rM/Z(I) = |I| is identical tor(I ∪ Z) = |I|+ r(Z) = |I|+ |X| but r(I ∪ Z) = r(I ∪X). Thisimplies r(I ∪X) = |I|+ |X|, or I ∪X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M∗ \ Z)∗ (Exercise: show why).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.127/162)

Page 128: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Zis written M/Z.

Let Z ⊆ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I ∪X is independent in M .

The rank function takes the form

rM/Z(Y ) = r(Y ∪ Z)− r(Z) = r(Y |Z) (8.23)

So given I ⊆ V \Z and X is a base of Z, rM/Z(I) = |I| is identical tor(I ∪ Z) = |I|+ r(Z) = |I|+ |X| but r(I ∪ Z) = r(I ∪X). Thisimplies r(I ∪X) = |I|+ |X|, or I ∪X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M∗ \ Z)∗ (Exercise: show why).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.128/162)

Page 129: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Zis written M/Z.

Let Z ⊆ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I ∪X is independent in M .

The rank function takes the form

rM/Z(Y ) = r(Y ∪ Z)− r(Z) = r(Y |Z) (8.23)

So given I ⊆ V \Z and X is a base of Z, rM/Z(I) = |I| is identical tor(I ∪ Z) = |I|+ r(Z) = |I|+ |X| but r(I ∪ Z) = r(I ∪X). Thisimplies r(I ∪X) = |I|+ |X|, or I ∪X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M∗ \ Z)∗ (Exercise: show why).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.129/162)

Page 130: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Zis written M/Z.

Let Z ⊆ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I ∪X is independent in M .

The rank function takes the form

rM/Z(Y ) = r(Y ∪ Z)− r(Z) = r(Y |Z) (8.23)

So given I ⊆ V \Z and X is a base of Z, rM/Z(I) = |I| is identical tor(I ∪ Z) = |I|+ r(Z) = |I|+ |X| but r(I ∪ Z) = r(I ∪X). Thisimplies r(I ∪X) = |I|+ |X|, or I ∪X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M∗ \ Z)∗ (Exercise: show why).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.130/162)

Page 131: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Zis written M/Z.

Let Z ⊆ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I ∪X is independent in M .

The rank function takes the form

rM/Z(Y ) = r(Y ∪ Z)− r(Z) = r(Y |Z) (8.23)

So given I ⊆ V \Z and X is a base of Z, rM/Z(I) = |I| is identical tor(I ∪ Z) = |I|+ r(Z) = |I|+ |X| but r(I ∪ Z) = r(I ∪X). Thisimplies r(I ∪X) = |I|+ |X|, or I ∪X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M∗ \ Z)∗ (Exercise: show why).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.131/162)

Page 132: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid Intersection

Let M1 = (V, I1) and M2 = (V, I2) be two matroids. Consider theircommon independent sets I1 ∩ I2.

While (V, I1 ∩ I2) is typically not a matroid (Exercise: show graphicalexample.), we might be interested in finding the maximum sizecommon independent set. That is, find max |X| such that bothX ∈ I1 and X ∈ I2.

Theorem 8.6.1

Let M1 and M2 be given as above, with rank functions r1 and r2. Then thesize of the maximum size set in I1 ∩ I2 is given by

(r1 ∗ r2)(V ) , minX⊆V

(r1(X) + r2(V \X)

)(8.24)

This is an instance of the convolution of two submodular functions, f1and f2 that, evaluated at Y ⊆ V , is written as:

(f1 ∗ f2)(Y ) = minX⊆Y

(f1(X) + f2(Y \X)

)(8.25)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F34/40 (pg.132/162)

Page 133: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid Intersection

Let M1 = (V, I1) and M2 = (V, I2) be two matroids. Consider theircommon independent sets I1 ∩ I2.While (V, I1 ∩ I2) is typically not a matroid (Exercise: show graphicalexample.), we might be interested in finding the maximum sizecommon independent set. That is, find max |X| such that bothX ∈ I1 and X ∈ I2.

Theorem 8.6.1

Let M1 and M2 be given as above, with rank functions r1 and r2. Then thesize of the maximum size set in I1 ∩ I2 is given by

(r1 ∗ r2)(V ) , minX⊆V

(r1(X) + r2(V \X)

)(8.24)

This is an instance of the convolution of two submodular functions, f1and f2 that, evaluated at Y ⊆ V , is written as:

(f1 ∗ f2)(Y ) = minX⊆Y

(f1(X) + f2(Y \X)

)(8.25)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F34/40 (pg.133/162)

Page 134: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid Intersection

Let M1 = (V, I1) and M2 = (V, I2) be two matroids. Consider theircommon independent sets I1 ∩ I2.While (V, I1 ∩ I2) is typically not a matroid (Exercise: show graphicalexample.), we might be interested in finding the maximum sizecommon independent set. That is, find max |X| such that bothX ∈ I1 and X ∈ I2.

Theorem 8.6.1

Let M1 and M2 be given as above, with rank functions r1 and r2. Then thesize of the maximum size set in I1 ∩ I2 is given by

(r1 ∗ r2)(V ) , minX⊆V

(r1(X) + r2(V \X)

)(8.24)

This is an instance of the convolution of two submodular functions, f1and f2 that, evaluated at Y ⊆ V , is written as:

(f1 ∗ f2)(Y ) = minX⊆Y

(f1(X) + f2(Y \X)

)(8.25)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F34/40 (pg.134/162)

Page 135: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid Intersection

Let M1 = (V, I1) and M2 = (V, I2) be two matroids. Consider theircommon independent sets I1 ∩ I2.While (V, I1 ∩ I2) is typically not a matroid (Exercise: show graphicalexample.), we might be interested in finding the maximum sizecommon independent set. That is, find max |X| such that bothX ∈ I1 and X ∈ I2.

Theorem 8.6.1

Let M1 and M2 be given as above, with rank functions r1 and r2. Then thesize of the maximum size set in I1 ∩ I2 is given by

(r1 ∗ r2)(V ) , minX⊆V

(r1(X) + r2(V \X)

)(8.24)

This is an instance of the convolution of two submodular functions, f1and f2 that, evaluated at Y ⊆ V , is written as:

(f1 ∗ f2)(Y ) = minX⊆Y

(f1(X) + f2(Y \X)

)(8.25)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F34/40 (pg.135/162)

Page 136: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ⊆ V , wehave |Γ(X)| ≥ |X|.

⇔ |Γ(X)| − |X| ≥ 0, ∀X⇔ minX |Γ(X)| − |X| ≥ 0

⇔ minX |Γ(X)|+ |V | − |X| ≥ |V |⇔ minX

(|Γ(X)|+ |V \X|

)≥ |V |

⇔ [Γ(·) ∗ | · |](V ) ≥ |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [Γ(·) ∗ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.136/162)

Page 137: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ⊆ V , wehave |Γ(X)| ≥ |X|.⇔ |Γ(X)| − |X| ≥ 0, ∀X

⇔ minX |Γ(X)| − |X| ≥ 0

⇔ minX |Γ(X)|+ |V | − |X| ≥ |V |⇔ minX

(|Γ(X)|+ |V \X|

)≥ |V |

⇔ [Γ(·) ∗ | · |](V ) ≥ |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [Γ(·) ∗ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.137/162)

Page 138: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ⊆ V , wehave |Γ(X)| ≥ |X|.⇔ |Γ(X)| − |X| ≥ 0, ∀X⇔ minX |Γ(X)| − |X| ≥ 0

⇔ minX |Γ(X)|+ |V | − |X| ≥ |V |⇔ minX

(|Γ(X)|+ |V \X|

)≥ |V |

⇔ [Γ(·) ∗ | · |](V ) ≥ |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [Γ(·) ∗ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.138/162)

Page 139: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ⊆ V , wehave |Γ(X)| ≥ |X|.⇔ |Γ(X)| − |X| ≥ 0, ∀X⇔ minX |Γ(X)| − |X| ≥ 0

⇔ minX |Γ(X)|+ |V | − |X| ≥ |V |

⇔ minX

(|Γ(X)|+ |V \X|

)≥ |V |

⇔ [Γ(·) ∗ | · |](V ) ≥ |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [Γ(·) ∗ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.139/162)

Page 140: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ⊆ V , wehave |Γ(X)| ≥ |X|.⇔ |Γ(X)| − |X| ≥ 0, ∀X⇔ minX |Γ(X)| − |X| ≥ 0

⇔ minX |Γ(X)|+ |V | − |X| ≥ |V |⇔ minX

(|Γ(X)|+ |V \X|

)≥ |V |

⇔ [Γ(·) ∗ | · |](V ) ≥ |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [Γ(·) ∗ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.140/162)

Page 141: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ⊆ V , wehave |Γ(X)| ≥ |X|.⇔ |Γ(X)| − |X| ≥ 0, ∀X⇔ minX |Γ(X)| − |X| ≥ 0

⇔ minX |Γ(X)|+ |V | − |X| ≥ |V |⇔ minX

(|Γ(X)|+ |V \X|

)≥ |V |

⇔ [Γ(·) ∗ | · |](V ) ≥ |V |

So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [Γ(·) ∗ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.141/162)

Page 142: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ⊆ V , wehave |Γ(X)| ≥ |X|.⇔ |Γ(X)| − |X| ≥ 0, ∀X⇔ minX |Γ(X)| − |X| ≥ 0

⇔ minX |Γ(X)|+ |V | − |X| ≥ |V |⇔ minX

(|Γ(X)|+ |V \X|

)≥ |V |

⇔ [Γ(·) ∗ | · |](V ) ≥ |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [Γ(·) ∗ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.142/162)

Page 143: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ⊆ V , wehave |Γ(X)| ≥ |X|.⇔ |Γ(X)| − |X| ≥ 0, ∀X⇔ minX |Γ(X)| − |X| ≥ 0

⇔ minX |Γ(X)|+ |V | − |X| ≥ |V |⇔ minX

(|Γ(X)|+ |V \X|

)≥ |V |

⇔ [Γ(·) ∗ | · |](V ) ≥ |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [Γ(·) ∗ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.143/162)

Page 144: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid UnionDefinition 8.6.2

Let M1 = (V1, I1), M2 = (V2, I2), . . . , Mk = (Vk, Ik) be matroids. Wedefine the union of matroids asM1 ∨M2 ∨ · · · ∨Mk = (V1 ] V2 ] · · · ] Vk, I1 ∨ I2 ∨ · · · ∨ Ik), where

I1 ∨ I2 ∨ · · · ∨ Ik = {I1 ] I2 ] · · · ] Ik|I1 ∈ I1, . . . , Ik ∈ Ik} (8.26)

Note A ]B designates the disjoint union of A and B.

Theorem 8.6.3

Let M1 = (V1, I1), M2 = (V2, I2), . . . , Mk = (Vk, Ik) be matroids, withrank functions r1, . . . , rk. Then the union of these matroids is still amatroid, having rank function

r(Y ) = minX⊆Y

(|Y \X|+ r1(X ∩ V1) + · · ·+ rk(X ∩ Vk)

)(8.27)

for any Y ⊆ V1 ∪ . . . Vk.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F36/40 (pg.144/162)

Page 145: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid UnionDefinition 8.6.2

Let M1 = (V1, I1), M2 = (V2, I2), . . . , Mk = (Vk, Ik) be matroids. Wedefine the union of matroids asM1 ∨M2 ∨ · · · ∨Mk = (V1 ] V2 ] · · · ] Vk, I1 ∨ I2 ∨ · · · ∨ Ik), where

I1 ∨ I2 ∨ · · · ∨ Ik = {I1 ] I2 ] · · · ] Ik|I1 ∈ I1, . . . , Ik ∈ Ik} (8.26)

Note A ]B designates the disjoint union of A and B.

Theorem 8.6.3

Let M1 = (V1, I1), M2 = (V2, I2), . . . , Mk = (Vk, Ik) be matroids, withrank functions r1, . . . , rk. Then the union of these matroids is still amatroid, having rank function

r(Y ) = minX⊆Y

(|Y \X|+ r1(X ∩ V1) + · · ·+ rk(X ∩ Vk)

)(8.27)

for any Y ⊆ V1 ∪ . . . Vk.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F36/40 (pg.145/162)

Page 146: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Exercise: Matroid Union, and Matroid duality

Exercise: Describe M ∨M∗.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F37/40 (pg.146/162)

Page 147: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroids of three or fewer elements are graphic

All matroids up to and including three elements (edges) are graphic.

(a) The onlymatroid with zeroelements.

(b) The twoone-elementmatroids.

(c) The fourtwo-elementmatroids.

(d) The eightthree-elementmatroids.

This is a nice way to show matroids with low ground set sizes. Whatabout matroids that are low rank but with many elements?

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F38/40 (pg.147/162)

Page 148: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroids of three or fewer elements are graphic

All matroids up to and including three elements (edges) are graphic.

(a) The onlymatroid with zeroelements.

(b) The twoone-elementmatroids.

(c) The fourtwo-elementmatroids.

(d) The eightthree-elementmatroids.

This is a nice way to show matroids with low ground set sizes. Whatabout matroids that are low rank but with many elements?

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F38/40 (pg.148/162)

Page 149: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroids of three or fewer elements are graphic

All matroids up to and including three elements (edges) are graphic.

(a) The onlymatroid with zeroelements.

(b) The twoone-elementmatroids.

(c) The fourtwo-elementmatroids.

(d) The eightthree-elementmatroids.

This is a nice way to show matroids with low ground set sizes. Whatabout matroids that are low rank but with many elements?

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F38/40 (pg.149/162)

Page 150: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Affine Matroids

Given an n×m matrix with entries over some field F, we say that asubset S ⊆ {1, . . . ,m} of indices (with corresponding column vectors{vi : i ∈ S}, with |S| = k) is affinely dependent if m ≥ 1 and thereexists elements {a1, . . . , ak} ∈ F, not all zero with

∑ki=1 ai = 0, such

that∑k

i=1 aivi = 0.

Otherwise, the set is called affinely independent.Concisely: points {v1, v2, . . . , vk} are affinely independent ifv2 − v1, v3 − v1, . . . , vk − v1 are linearly independent.Example: in 2D, three collinear points are affinely dependent, threenon-collear points are affinely independent, and ≥ 4 non-collinearpoints are affinely dependent.

Proposition 8.7.1 (affine matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices affinely independentvectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.150/162)

Page 151: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Affine Matroids

Given an n×m matrix with entries over some field F, we say that asubset S ⊆ {1, . . . ,m} of indices (with corresponding column vectors{vi : i ∈ S}, with |S| = k) is affinely dependent if m ≥ 1 and thereexists elements {a1, . . . , ak} ∈ F, not all zero with

∑ki=1 ai = 0, such

that∑k

i=1 aivi = 0.Otherwise, the set is called affinely independent.

Concisely: points {v1, v2, . . . , vk} are affinely independent ifv2 − v1, v3 − v1, . . . , vk − v1 are linearly independent.Example: in 2D, three collinear points are affinely dependent, threenon-collear points are affinely independent, and ≥ 4 non-collinearpoints are affinely dependent.

Proposition 8.7.1 (affine matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices affinely independentvectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.151/162)

Page 152: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Affine Matroids

Given an n×m matrix with entries over some field F, we say that asubset S ⊆ {1, . . . ,m} of indices (with corresponding column vectors{vi : i ∈ S}, with |S| = k) is affinely dependent if m ≥ 1 and thereexists elements {a1, . . . , ak} ∈ F, not all zero with

∑ki=1 ai = 0, such

that∑k

i=1 aivi = 0.Otherwise, the set is called affinely independent.Concisely: points {v1, v2, . . . , vk} are affinely independent ifv2 − v1, v3 − v1, . . . , vk − v1 are linearly independent.

Example: in 2D, three collinear points are affinely dependent, threenon-collear points are affinely independent, and ≥ 4 non-collinearpoints are affinely dependent.

Proposition 8.7.1 (affine matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices affinely independentvectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.152/162)

Page 153: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Affine Matroids

Given an n×m matrix with entries over some field F, we say that asubset S ⊆ {1, . . . ,m} of indices (with corresponding column vectors{vi : i ∈ S}, with |S| = k) is affinely dependent if m ≥ 1 and thereexists elements {a1, . . . , ak} ∈ F, not all zero with

∑ki=1 ai = 0, such

that∑k

i=1 aivi = 0.Otherwise, the set is called affinely independent.Concisely: points {v1, v2, . . . , vk} are affinely independent ifv2 − v1, v3 − v1, . . . , vk − v1 are linearly independent.Example: in 2D, three collinear points are affinely dependent, threenon-collear points are affinely independent, and ≥ 4 non-collinearpoints are affinely dependent.

Proposition 8.7.1 (affine matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices affinely independentvectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.153/162)

Page 154: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Affine Matroids

Given an n×m matrix with entries over some field F, we say that asubset S ⊆ {1, . . . ,m} of indices (with corresponding column vectors{vi : i ∈ S}, with |S| = k) is affinely dependent if m ≥ 1 and thereexists elements {a1, . . . , ak} ∈ F, not all zero with

∑ki=1 ai = 0, such

that∑k

i=1 aivi = 0.Otherwise, the set is called affinely independent.Concisely: points {v1, v2, . . . , vk} are affinely independent ifv2 − v1, v3 − v1, . . . , vk − v1 are linearly independent.Example: in 2D, three collinear points are affinely dependent, threenon-collear points are affinely independent, and ≥ 4 non-collinearpoints are affinely dependent.

Proposition 8.7.1 (affine matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices affinely independentvectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.154/162)

Page 155: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Affine Matroids

Given an n×m matrix with entries over some field F, we say that asubset S ⊆ {1, . . . ,m} of indices (with corresponding column vectors{vi : i ∈ S}, with |S| = k) is affinely dependent if m ≥ 1 and thereexists elements {a1, . . . , ak} ∈ F, not all zero with

∑ki=1 ai = 0, such

that∑k

i=1 aivi = 0.Otherwise, the set is called affinely independent.Concisely: points {v1, v2, . . . , vk} are affinely independent ifv2 − v1, v3 − v1, . . . , vk − v1 are linearly independent.Example: in 2D, three collinear points are affinely dependent, threenon-collear points are affinely independent, and ≥ 4 non-collinearpoints are affinely dependent.

Proposition 8.7.1 (affine matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices affinely independentvectors. Then (E, I) is a matroid.

Exercise: prove this.Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.155/162)

Page 156: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the affine matroid with n×m = 2× 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.

We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with ≥ 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with ≥ 4elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.156/162)

Page 157: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the affine matroid with n×m = 2× 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.

We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with ≥ 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with ≥ 4elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.157/162)

Page 158: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the affine matroid with n×m = 2× 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.

We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with ≥ 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with ≥ 4elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.158/162)

Page 159: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the affine matroid with n×m = 2× 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.

We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with ≥ 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with ≥ 4elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.159/162)

Page 160: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the affine matroid with n×m = 2× 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.

We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with ≥ 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with ≥ 4elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.160/162)

Page 161: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the affine matroid with n×m = 2× 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.

We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with ≥ 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with ≥ 4elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.161/162)

Page 162: Submodular Functions, Optimization, and Applications to ...€¦ · L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the affine matroid with n×m = 2× 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.

We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with ≥ 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with ≥ 4elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Jeff Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.162/162)


Recommended