+ All Categories
Home > Documents > Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps...

Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps...

Date post: 07-Jul-2020
Category:
Upload: others
View: 6 times
Download: 0 times
Share this document with a friend
162
Submodular Functions, Optimization, and Applications to Machine Learning — Spring Quarter, Lecture 8 — http://www.ee.washington.edu/people/faculty/bilmes/classes/ee596b_spring_2016/ Prof. JeBilmes University of Washington, Seattle Department of Electrical Engineering http://melodi.ee.washington.edu/ ~ bilmes Apr 25th, 2016 + f (A)+ f (B) f (A [ B) =f (Ar ) + f ( C ) + f ( B r ) =f (A\B) f (A \ B) =f (Ar ) +2 f ( C ) + f ( B r ) Prof. JeBilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F1/40 (pg.1/162)
Transcript
Page 1: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Submodular Functions, Optimization,and Applications to Machine Learning

— Spring Quarter, Lecture 8 —http://www.ee.washington.edu/people/faculty/bilmes/classes/ee596b_spring_2016/

Prof. Je↵ Bilmes

University of Washington, SeattleDepartment of Electrical Engineering

http://melodi.ee.washington.edu/

~

bilmes

Apr 25th, 2016

+f (A) + f (B) f (A [ B)

= f (Ar ) +f (C ) + f (Br )

�= f (A \ B)

f (A \ B)

= f (Ar ) + 2f (C ) + f (Br )

Clockwise from top left:vLásló Lovász

Jack EdmondsSatoru Fujishige

George NemhauserLaurence Wolsey

András FrankLloyd ShapleyH. NarayananRobert Bixby

William CunninghamWilliam TutteRichard Rado

Alexander SchrijverGarrett BirkhoffHassler Whitney

Richard Dedekind

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F1/40 (pg.1/162)

Page 2: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

Cumulative Outstanding Reading

Read chapters 2 and 3 from Fujishige’s book.

Read chapter 1 from Fujishige’s book.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F2/40 (pg.2/162)

Page 3: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

Announcements, Assignments, and Reminders

Homework 3, available at our assignment dropbox(https://canvas.uw.edu/courses/1039754/assignments), due(electronically) Monday (5/2) at 11:55pm.

Homework 2, available at our assignment dropbox(https://canvas.uw.edu/courses/1039754/assignments), due(electronically) Monday (4/18) at 11:55pm.

Homework 1, available at our assignment dropbox(https://canvas.uw.edu/courses/1039754/assignments), due(electronically) Friday (4/8) at 11:55pm.

Weekly O�ce Hours: Mondays, 3:30-4:30, or by skype or googlehangout (set up meeting via our our discussion board (https://canvas.uw.edu/courses/1039754/discussion_topics)).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F3/40 (pg.3/162)

Page 4: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

Class Road Map - IT-IL1(3/28): Motivation, Applications, &Basic Definitions

L2(3/30): Machine Learning Apps(diversity, complexity, parameter, learningtarget, surrogate).

L3(4/4): Info theory exs, more apps,definitions, graph/combinatorial examples,matrix rank example, visualization

L4(4/6): Graph and CombinatorialExamples, matrix rank, Venn diagrams,examples of proofs of submodularity, someuseful properties

L5(4/11): Examples & Properties, OtherDefs., Independence

L6(4/13): Independence, Matroids,Matroid Examples, matroid rank issubmodular

L7(4/18): Matroid Rank, More onPartition Matroid, System of DistinctReps, Transversals, Transversal Matroid,

L8(4/20): Transversals, Matroid andrepresentation, Dual Matroids, Geometries

L9(4/25):

L10(4/27):

L11(5/2):

L12(5/4):

L13(5/9):

L14(5/11):

L15(5/16):

L16(5/18):

L17(5/23):

L18(5/25):

L19(6/1):

L20(6/6): Final Presentationsmaximization.

Finals Week: June 6th-10th, 2016.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F4/40 (pg.4/162)

Page 5: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

System of Distinct Representatives

Let (V,V) be a set system (i.e., V = (V

k

: i 2 I) where V

i

✓ V for alli), and I is an index set. Hence, |I| = |V|.A family (v

i

: i 2 I) with v

i

2 V is said to be a system of distinctrepresentatives of V if 9 a bijection ⇡ : I $ I such that v

i

2 V

⇡(i) andv

i

6= v

j

for all i 6= j.

In a system of distinct representatives, there is a requirement for therepresentatives to be distinct. We can re-state (and rename) this as a:

Definition 8.2.2 (transversal)

Given a set system (V,V) and index set I for V as defined above, a setT ✓ V is a transversal of V if there is a bijection ⇡ : T $ I such that

x 2 V

⇡(x) for all x 2 T (8.19)

Note that due to ⇡ : T $ I being a bijection, all of I and T are“covered” (so this makes things distinct automatically).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F5/40 (pg.5/162)

Page 6: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

When do transversals exist?As we saw, a transversal might not always exist. How to tell?Given a set system (V,V) with V = (V

i

: i 2 I), and V

i

✓ V for all i.Then, for any J ✓ I, let

V (J) = [j2JVj

(8.19)

so |V (J)| : 2I ! Z+ is the set cover func. (we know is submodular).We have

Theorem 8.2.2 (Hall’s theorem)

Given a set system (V,V), the family of subsets V = (V

i

: i 2 I) has a

transversal (v

i

: i 2 I) iff for all J ✓ I

|V (J)| � |J | (8.20)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F6/40 (pg.6/162)

Page 7: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

When do transversals exist?As we saw, a transversal might not always exist. How to tell?Given a set system (V,V) with V = (V

i

: i 2 I), and V

i

✓ V for all i.Then, for any J ✓ I, let

V (J) = [j2JVj

(8.19)

so |V (J)| : 2I ! Z+ is the set cover func. (we know is submodular).Hall’s theorem (8J ✓ I, |V (J)| � |J |) as a bipartite graph.

V I

1

2

3

4

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F6/40 (pg.7/162)

Page 8: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

When do transversals exist?As we saw, a transversal might not always exist. How to tell?Given a set system (V,V) with V = (V

i

: i 2 I), and V

i

✓ V for all i.Then, for any J ✓ I, let

V (J) = [j2JVj

(8.19)

so |V (J)| : 2I ! Z+ is the set cover func. (we know is submodular).Hall’s theorem (8J ✓ I, |V (J)| � |J |) as a bipartite graph.

V I

1

2

3

4

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F6/40 (pg.8/162)

Page 9: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

When do transversals exist?As we saw, a transversal might not always exist. How to tell?Given a set system (V,V) with V = (V

i

: i 2 I), and V

i

✓ V for all i.Then, for any J ✓ I, let

V (J) = [j2JVj

(8.19)

so |V (J)| : 2I ! Z+ is the set cover func. (we know is submodular).Moreover, we have

Theorem 8.2.3 (Rado’s theorem (1942))

If M = (V, r) is a matroid on V with rank function r, then the family of

subsets (V

i

: i 2 I) of V has a transversal (v

i

: i 2 I) that is independent in

M iff for all J ✓ I

r(V (J)) � |J | (8.21)

Note, a transversal T independent in M means that r(T ) = |T |.Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F6/40 (pg.9/162)

Page 10: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

Application’s of Hall’s theorem

Consider a set of jobs I and a set of applicants V to the jobs. If anapplicant v 2 V is qualified for job i 2 I, we add edge (v, i) to thebipartite graph G = (V, I, E).

We wish all jobs to be filled, and hence Hall’s condition(8J ✓ I, |V (J)| � |J |) is a necessary and su�cient condition for thisto be possible.

Note if |V | = |I|, then Hall’s theorem is the Marriage Theorem(Frobenious 1917), where an edge (v, i) in the graph indicatecompatibility between two individuals v 2 V and i 2 I coming fromtwo separate groups V and I.

If 8J ✓ I, |V (J)| � |J |, then all individuals in each group can bematched with a compatible mate.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F7/40 (pg.10/162)

Page 11: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

Application’s of Hall’s theorem

Consider a set of jobs I and a set of applicants V to the jobs. If anapplicant v 2 V is qualified for job i 2 I, we add edge (v, i) to thebipartite graph G = (V, I, E).

We wish all jobs to be filled, and hence Hall’s condition(8J ✓ I, |V (J)| � |J |) is a necessary and su�cient condition for thisto be possible.

Note if |V | = |I|, then Hall’s theorem is the Marriage Theorem(Frobenious 1917), where an edge (v, i) in the graph indicatecompatibility between two individuals v 2 V and i 2 I coming fromtwo separate groups V and I.

If 8J ✓ I, |V (J)| � |J |, then all individuals in each group can bematched with a compatible mate.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F7/40 (pg.11/162)

Page 12: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

Application’s of Hall’s theorem

Consider a set of jobs I and a set of applicants V to the jobs. If anapplicant v 2 V is qualified for job i 2 I, we add edge (v, i) to thebipartite graph G = (V, I, E).

We wish all jobs to be filled, and hence Hall’s condition(8J ✓ I, |V (J)| � |J |) is a necessary and su�cient condition for thisto be possible.

Note if |V | = |I|, then Hall’s theorem is the Marriage Theorem(Frobenious 1917), where an edge (v, i) in the graph indicatecompatibility between two individuals v 2 V and i 2 I coming fromtwo separate groups V and I.

If 8J ✓ I, |V (J)| � |J |, then all individuals in each group can bematched with a compatible mate.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F7/40 (pg.12/162)

Page 13: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

Application’s of Hall’s theorem

Consider a set of jobs I and a set of applicants V to the jobs. If anapplicant v 2 V is qualified for job i 2 I, we add edge (v, i) to thebipartite graph G = (V, I, E).

We wish all jobs to be filled, and hence Hall’s condition(8J ✓ I, |V (J)| � |J |) is a necessary and su�cient condition for thisto be possible.

Note if |V | = |I|, then Hall’s theorem is the Marriage Theorem(Frobenious 1917), where an edge (v, i) in the graph indicatecompatibility between two individuals v 2 V and i 2 I coming fromtwo separate groups V and I.

If 8J ✓ I, |V (J)| � |J |, then all individuals in each group can bematched with a compatible mate.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F7/40 (pg.13/162)

Page 14: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Logistics Review

More general conditions for existence of transversals

Theorem 8.2.2 (Polymatroid transversal theorem)

If V = (V

i

: i 2 I) is a finite family of non-empty subsets of V , and

f : 2

V ! Z+ is a non-negative, integral, monotone non-decreasing, and

submodular function, then V has a system of representatives (v

i

: i 2 I)

such that

f([i2J{vi}) � |J | for all J ✓ I (8.19)

if and only if

f(V (J)) � |J | for all J ✓ I (8.20)

Given Theorem ??, we immediately get Theorem 8.2.2 by takingf(S) = |S| for S ✓ V .We get Theorem ?? by taking f(S) = r(S) for S ✓ V , the rankfunction of the matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F8/40 (pg.14/162)

Page 15: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid

Transversals, themselves, define a matroid.

Theorem 8.3.1

If V is a family of finite subsets of a ground set V , then the collection of

partial transversals of V is the set of independent sets of a matroid

M = (V,V) on V .

This means that the transversals of V are the bases of matroid M .

Therefore, all maximal partial transversals of V have the samecardinality!

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F9/40 (pg.15/162)

Page 16: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid

Transversals, themselves, define a matroid.

Theorem 8.3.1

If V is a family of finite subsets of a ground set V , then the collection of

partial transversals of V is the set of independent sets of a matroid

M = (V,V) on V .

This means that the transversals of V are the bases of matroid M .

Therefore, all maximal partial transversals of V have the samecardinality!

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F9/40 (pg.16/162)

Page 17: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid

Transversals, themselves, define a matroid.

Theorem 8.3.1

If V is a family of finite subsets of a ground set V , then the collection of

partial transversals of V is the set of independent sets of a matroid

M = (V,V) on V .

This means that the transversals of V are the bases of matroid M .

Therefore, all maximal partial transversals of V have the samecardinality!

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F9/40 (pg.17/162)

Page 18: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.

Given a set system (V,V), with V = (V

i

: i 2 I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v 2 V, i 2 I, v 2 V

i

}.A matching in this graph is a set of edges no two of which that have acommon endpoint.

In fact, we easily have:

Lemma 8.3.2

A subset T ✓ V is a partial transversal of V iff there is a matching in

(V, I, E) in which every edge has one endpoint in T (T matched into I).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.18/162)

Page 19: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.Given a set system (V,V), with V = (V

i

: i 2 I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v 2 V, i 2 I, v 2 V

i

}.

A matching in this graph is a set of edges no two of which that have acommon endpoint.

In fact, we easily have:

Lemma 8.3.2

A subset T ✓ V is a partial transversal of V iff there is a matching in

(V, I, E) in which every edge has one endpoint in T (T matched into I).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.19/162)

w

Page 20: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.Given a set system (V,V), with V = (V

i

: i 2 I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v 2 V, i 2 I, v 2 V

i

}.A matching in this graph is a set of edges no two of which that have acommon endpoint.

In fact, we easily have:

Lemma 8.3.2

A subset T ✓ V is a partial transversal of V iff there is a matching in

(V, I, E) in which every edge has one endpoint in T (T matched into I).

V I

1

2

3

4

V I

1

2

3

4

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.20/162)

#

Page 21: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.Given a set system (V,V), with V = (V

i

: i 2 I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v 2 V, i 2 I, v 2 V

i

}.A matching in this graph is a set of edges no two of which that have acommon endpoint. In fact, we easily have:

Lemma 8.3.2

A subset T ✓ V is a partial transversal of V iff there is a matching in

(V, I, E) in which every edge has one endpoint in T (T matched into I).

V I

1

2

3

4

V I

1

2

3

4

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.21/162)

Page 22: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversals and Bipartite Matchings

Transversals correspond exactly to matchings in bipartite graphs.Given a set system (V,V), with V = (V

i

: i 2 I), we can define abipartite graph G = (V, I, E) associated with V that has edge set{(v, i) : v 2 V, i 2 I, v 2 V

i

}.A matching in this graph is a set of edges no two of which that have acommon endpoint. In fact, we easily have:

Lemma 8.3.2

A subset T ✓ V is a partial transversal of V iff there is a matching in

(V, I, E) in which every edge has one endpoint in T (T matched into I).

V I

1

2

3

4

V I

1

2

3

4

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F10/40 (pg.22/162)

0

Page 23: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Arbitrary Matchings and Matroids?

Are arbitrary matchings matroids?

Consider the following graph (left), and two max-matchings (two rightinstances)

{AC} is a maximum matching, as is {AD,BC}, but they are not thesame size.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F11/40 (pg.23/162)

Page 24: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Arbitrary Matchings and Matroids?

Are arbitrary matchings matroids?

Consider the following graph (left), and two max-matchings (two rightinstances)

A B

CD

A B

CD

A B

CD

{AC} is a maximum matching, as is {AD,BC}, but they are not thesame size.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F11/40 (pg.24/162)

1 2

Page 25: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Arbitrary Matchings and Matroids?

Are arbitrary matchings matroids?

Consider the following graph (left), and two max-matchings (two rightinstances)

A B

CD

A B

CD

A B

CD

{AC} is a maximum matching, as is {AD,BC}, but they are not thesame size.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F11/40 (pg.25/162)

• Th i , setot match .hr is graph G= ( V, E) ⇒ ( E,

'm ) is

a set system . II hold, IZ ( subelusive property hold

,

I } Aunt hold . Q : what is biggest subatomic .ms't . ( E

,'M

')satisfies Is ?

Page 26: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partition Matroid, rank as matching

Example where ` = 5,(k1, k2, k3, k4, k5) =

(2, 2, 1, 1, 3).

I1

I2

I3

I4

I5

V1

V2

V3

V4

V5

V IRecall, � : 2

V ! R as the neighborfunction in a bipartite graph, theneighbors of X is defined as �(X) =

{v 2 V (G) \X : E(X, {v}) 6= ;}, andrecall that |�(X)| is submodular.

Here, for X ✓ V , we have �(X) =

{i 2 I : (v, i) 2 E(G) and v 2 X}.For such a constructed bipartite graph,the rank function of a partition matroidis r(X) =

P`

i=1min(|X \ V

i

|, ki

) = themaximum matching involving X.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F12/40 (pg.26/162)

Page 27: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid RankRecall the partition matroid rank function. Note, k

i

= |Ii

| in the bipartitegraph representation, and since a matroid, w.l.o.g., |V

i

| � k

i

(also, recall,V (J) = [

j2JVj

).

Start with partition matroid rank function in the subsequent equations.

r(A) =

X

i2{1,...,`}

min(|A \ V

i

|, ki

) (8.1)

=

`X

i=1

min(|A \ V (I

i

)|, |Ii

|) (8.2)

=

X

i2{1,...,`}

min

Ji2{;,Ii}

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.3)

=

X

i2{1,...,`}

min

Ji✓Ii

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.4)

=

X

i2{1,...,`}

min

Ji✓Ii

(|V (J

i

) \A|+ |Ii

\ Ji

|) (8.5)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.27/162)

Page 28: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid RankRecall the partition matroid rank function. Note, k

i

= |Ii

| in the bipartitegraph representation, and since a matroid, w.l.o.g., |V

i

| � k

i

(also, recall,V (J) = [

j2JVj

).Start with partition matroid rank function in the subsequent equations.

r(A) =

X

i2{1,...,`}

min(|A \ V

i

|, ki

) (8.1)

=

`X

i=1

min(|A \ V (I

i

)|, |Ii

|) (8.2)

=

X

i2{1,...,`}

min

Ji2{;,Ii}

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.3)

=

X

i2{1,...,`}

min

Ji✓Ii

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.4)

=

X

i2{1,...,`}

min

Ji✓Ii

(|V (J

i

) \A|+ |Ii

\ Ji

|) (8.5)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.28/162)

Page 29: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid RankRecall the partition matroid rank function. Note, k

i

= |Ii

| in the bipartitegraph representation, and since a matroid, w.l.o.g., |V

i

| � k

i

(also, recall,V (J) = [

j2JVj

).Start with partition matroid rank function in the subsequent equations.

r(A) =

X

i2{1,...,`}

min(|A \ V

i

|, ki

) (8.1)

=

`X

i=1

min(|A \ V (I

i

)|, |Ii

|) (8.2)

=

X

i2{1,...,`}

min

Ji2{;,Ii}

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.3)

=

X

i2{1,...,`}

min

Ji✓Ii

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.4)

=

X

i2{1,...,`}

min

Ji✓Ii

(|V (J

i

) \A|+ |Ii

\ Ji

|) (8.5)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.29/162)

Page 30: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid RankRecall the partition matroid rank function. Note, k

i

= |Ii

| in the bipartitegraph representation, and since a matroid, w.l.o.g., |V

i

| � k

i

(also, recall,V (J) = [

j2JVj

).Start with partition matroid rank function in the subsequent equations.

r(A) =

X

i2{1,...,`}

min(|A \ V

i

|, ki

) (8.1)

=

`X

i=1

min(|A \ V (I

i

)|, |Ii

|) (8.2)

=

X

i2{1,...,`}

min

Ji2{;,Ii}

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.3)

=

X

i2{1,...,`}

min

Ji✓Ii

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.4)

=

X

i2{1,...,`}

min

Ji✓Ii

(|V (J

i

) \A|+ |Ii

\ Ji

|) (8.5)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.30/162)

Page 31: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid RankRecall the partition matroid rank function. Note, k

i

= |Ii

| in the bipartitegraph representation, and since a matroid, w.l.o.g., |V

i

| � k

i

(also, recall,V (J) = [

j2JVj

).Start with partition matroid rank function in the subsequent equations.

r(A) =

X

i2{1,...,`}

min(|A \ V

i

|, ki

) (8.1)

=

`X

i=1

min(|A \ V (I

i

)|, |Ii

|) (8.2)

=

X

i2{1,...,`}

min

Ji2{;,Ii}

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.3)

=

X

i2{1,...,`}

min

Ji✓Ii

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.4)

=

X

i2{1,...,`}

min

Ji✓Ii

(|V (J

i

) \A|+ |Ii

\ Ji

|) (8.5)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.31/162)

Page 32: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Morphing Partition Matroid RankRecall the partition matroid rank function. Note, k

i

= |Ii

| in the bipartitegraph representation, and since a matroid, w.l.o.g., |V

i

| � k

i

(also, recall,V (J) = [

j2JVj

).Start with partition matroid rank function in the subsequent equations.

r(A) =

X

i2{1,...,`}

min(|A \ V

i

|, ki

) (8.1)

=

`X

i=1

min(|A \ V (I

i

)|, |Ii

|) (8.2)

=

X

i2{1,...,`}

min

Ji2{;,Ii}

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.3)

=

X

i2{1,...,`}

min

Ji✓Ii

✓⇢|A \ V (I

i

)| if Ji

6= ;0 if J

i

= ;

�+ |I

i

\ Ji

|◆

(8.4)

=

X

i2{1,...,`}

min

Ji✓Ii

(|V (J

i

) \A|+ |Ii

\ Ji

|) (8.5)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F13/40 (pg.32/162)

VHDEVCF)

Page 33: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =

`X

i=1

min

Ji✓Ii

(|V (J

i

) \ V (I

i

) \A|� |Ii

\ J

i

|+ |Ii

|) (8.6)

= min

J✓I

`X

i=1

|V (J) \ V (I

i

) \A|� |Ii

\ J |+ |Ii

|!

(8.7)

= min

J✓I

(|V (J) \ V (I) \A|� |J |+ |I|) (8.8)

= min

J✓I

(|V (J) \A|� |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.33/162)

e.

Page 34: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =

`X

i=1

min

Ji✓Ii

(|V (J

i

) \ V (I

i

) \A|� |Ii

\ J

i

|+ |Ii

|) (8.6)

= min

J✓I

`X

i=1

|V (J) \ V (I

i

) \A|� |Ii

\ J |+ |Ii

|!

(8.7)

= min

J✓I

(|V (J) \ V (I) \A|� |J |+ |I|) (8.8)

= min

J✓I

(|V (J) \A|� |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.34/162)

Page 35: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =

`X

i=1

min

Ji✓Ii

(|V (J

i

) \ V (I

i

) \A|� |Ii

\ J

i

|+ |Ii

|) (8.6)

= min

J✓I

`X

i=1

|V (J) \ V (I

i

) \A|� |Ii

\ J |+ |Ii

|!

(8.7)

= min

J✓I

(|V (J) \ V (I) \A|� |J |+ |I|) (8.8)

= min

J✓I

(|V (J) \A|� |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.35/162)

Page 36: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =

`X

i=1

min

Ji✓Ii

(|V (J

i

) \ V (I

i

) \A|� |Ii

\ J

i

|+ |Ii

|) (8.6)

= min

J✓I

`X

i=1

|V (J) \ V (I

i

) \A|� |Ii

\ J |+ |Ii

|!

(8.7)

= min

J✓I

(|V (J) \ V (I) \A|� |J |+ |I|) (8.8)

= min

J✓I

(|V (J) \A|� |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.36/162)

Page 37: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

... Morphing Partition Matroid Rank

Continuing,

r(A) =

`X

i=1

min

Ji✓Ii

(|V (J

i

) \ V (I

i

) \A|� |Ii

\ J

i

|+ |Ii

|) (8.6)

= min

J✓I

`X

i=1

|V (J) \ V (I

i

) \A|� |Ii

\ J |+ |Ii

|!

(8.7)

= min

J✓I

(|V (J) \ V (I) \A|� |J |+ |I|) (8.8)

= min

J✓I

(|V (J) \A|� |J |+ |I|) (8.9)

In fact, this bottom (more general) expression is the expression for therank of a transversal matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F14/40 (pg.37/162)

W

Page 38: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partial Transversals Are Independent Sets in a Matroid

In fact, we have

Theorem 8.3.3

Let (V,V) where V = (V1, V2, . . . , V`

) be a subset system. Let

I = {1, . . . , `}. Let I be the set of partial transversals of V. Then (V, I) isa matroid.

Proof.

We note that ; 2 I since the empty set is a transversal of the emptysubfamily of V, thus (I1’) holds.We already saw that if T is a partial transversal of V, and if T 0 ✓ T ,then T

0 is also a partial transversal. So (I2’) holds.

Suppose that T1 and T2 are partial transversals of V such that|T1| < |T2|. Exercise: show that (I3’) holds.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F15/40 (pg.38/162)

Page 39: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partial Transversals Are Independent Sets in a Matroid

In fact, we have

Theorem 8.3.3

Let (V,V) where V = (V1, V2, . . . , V`

) be a subset system. Let

I = {1, . . . , `}. Let I be the set of partial transversals of V. Then (V, I) isa matroid.

Proof.

We note that ; 2 I since the empty set is a transversal of the emptysubfamily of V, thus (I1’) holds.

We already saw that if T is a partial transversal of V, and if T 0 ✓ T ,then T

0 is also a partial transversal. So (I2’) holds.

Suppose that T1 and T2 are partial transversals of V such that|T1| < |T2|. Exercise: show that (I3’) holds.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F15/40 (pg.39/162)

Page 40: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partial Transversals Are Independent Sets in a Matroid

In fact, we have

Theorem 8.3.3

Let (V,V) where V = (V1, V2, . . . , V`

) be a subset system. Let

I = {1, . . . , `}. Let I be the set of partial transversals of V. Then (V, I) isa matroid.

Proof.

We note that ; 2 I since the empty set is a transversal of the emptysubfamily of V, thus (I1’) holds.We already saw that if T is a partial transversal of V, and if T 0 ✓ T ,then T

0 is also a partial transversal. So (I2’) holds.

Suppose that T1 and T2 are partial transversals of V such that|T1| < |T2|. Exercise: show that (I3’) holds.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F15/40 (pg.40/162)

Page 41: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Partial Transversals Are Independent Sets in a Matroid

In fact, we have

Theorem 8.3.3

Let (V,V) where V = (V1, V2, . . . , V`

) be a subset system. Let

I = {1, . . . , `}. Let I be the set of partial transversals of V. Then (V, I) isa matroid.

Proof.

We note that ; 2 I since the empty set is a transversal of the emptysubfamily of V, thus (I1’) holds.We already saw that if T is a partial transversal of V, and if T 0 ✓ T ,then T

0 is also a partial transversal. So (I2’) holds.

Suppose that T1 and T2 are partial transversals of V such that|T1| < |T2|. Exercise: show that (I3’) holds.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F15/40 (pg.41/162)

Page 42: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid Rank

Transversal matroid has rank

r(A) = min

J✓I

(|V (J) \A|� |J |+ |I|) (8.10)

Therefore, this function is submodular.

Note that it is a minimum over a set of modular functions. Is this truein general?

Exercise:

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F16/40 (pg.42/162)

= Filament

myth Ivana ) -HI so

Page 43: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid Rank

Transversal matroid has rank

r(A) = min

J✓I

(|V (J) \A|� |J |+ |I|) (8.10)

Therefore, this function is submodular.

Note that it is a minimum over a set of modular functions. Is this truein general? Exercise:

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F16/40 (pg.43/162)

Page 44: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Transversal Matroid Rank

Transversal matroid has rank

r(A) = min

J✓I

(|V (J) \A|� |J |+ |I|) (8.10)

Therefore, this function is submodular.

Note that it is a minimum over a set of modular functions. Is this truein general? Exercise:

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F16/40 (pg.44/162)

• whit an most gwud properties ot

a Srt of modular function ) ?

Page 45: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ✓ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a 2 A, r(A \ {a}) = |A|� 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdi↵erent indices, as can a self loop in a graph appear on di↵erentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.45/162)

Page 46: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ✓ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a 2 A, r(A \ {a}) = |A|� 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdi↵erent indices, as can a self loop in a graph appear on di↵erentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.46/162)

( 4£ ) C u,

't4 ( UE )

( E ,FD

Page 47: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ✓ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a 2 A, r(A \ {a}) = |A|� 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdi↵erent indices, as can a self loop in a graph appear on di↵erentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.47/162)

At Ei } rfi )=0=,

i =) { it -1

f. . QQ.fi#*¥AH

Page 48: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ✓ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a 2 A, r(A \ {a}) = |A|� 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdi↵erent indices, as can a self loop in a graph appear on di↵erentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.48/162)

.#o

Page 49: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid loops

A circuit in a matroids is well defined, a subset A ✓ E is circuit if it isan inclusionwise minimally dependent set (i.e., if r(A) < |A| and forany a 2 A, r(A \ {a}) = |A|� 1).

There is no reason in a matroid such an A could not consist of a singleelement.

Such an {a} is called a loop.

In a matric (i.e., linear) matroid, the only such loop is the value 0, asall non-zero vectors have rank 1. The 0 can appear > 1 time withdi↵erent indices, as can a self loop in a graph appear on di↵erentnodes.

Note, we also say that two elements s, t are said to be parallel if {s, t}is a circuit.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F17/40 (pg.49/162)

IN

Page 50: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection ⇡ : V1 ! V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ⇤, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.50/162)

Page 51: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection ⇡ : V1 ! V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ⇤, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.51/162)

;@

Page 52: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection ⇡ : V1 ! V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ⇤, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.52/162)

Page 53: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection ⇡ : V1 ! V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ⇤, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Definition 8.4.2 (linear matroids on a field)

Let X be an n⇥m matrix and E = {1, . . . ,m}, where Xij

2 F for somefield, and let I be the set of subsets of E such that the columns of X arelinearly independent over F.Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.53/162)

Page 54: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representable

Definition 8.4.1 (Matroid isomorphism)

Two matroids M1 and M2 respectively on ground sets V1 and V2 areisomorphic if there is a bijection ⇡ : V1 ! V2 which preserves independence(equivalently, rank, circuits, and so on).

Let F be any field (such as R, Q, or some finite field F, such as aGalois field GF(p) where p is prime (such as GF(2)), but not Z.Succinctly: A field is a set with +, ⇤, closure, associativity,commutativity, and additive and multiplictaive identities and inverses.

We can more generally define matroids on a field.

Definition 8.4.3 (representable (as a linear matroid))

Any matroid isomorphic to a linear matroid on a field is called representableover F

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F18/40 (pg.54/162)

.

Page 55: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representability of Transversal Matroids

Pi↵ and Welsh in 1970, and Adkin in 1972 proved an importanttheorem about representability of transversal matroids.

In particular:

Theorem 8.4.4

Transversal matroids are representable over all finite fields of su�ciently

large cardinality, and are representable over any infinite field.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F19/40 (pg.55/162)

Page 56: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Representability of Transversal Matroids

Pi↵ and Welsh in 1970, and Adkin in 1972 proved an importanttheorem about representability of transversal matroids.

In particular:

Theorem 8.4.4

Transversal matroids are representable over all finite fields of su�ciently

large cardinality, and are representable over any infinite field.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F19/40 (pg.56/162)

Page 57: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Converse: Representability of Transversal Matroids

The converse is not true, however.

Example 8.4.5

Let V = {1, 2, 3, 4, 5, 6} be a ground set and let M = (V, I) be a setsystem where I is all subsets of V of cardinality 2 except for the pairs{1, 2}, {3, 4}, {5, 6}.

It can be shown that this is a matroid and is representable.

However, this matroid is not isomorphic to any transversal matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F20/40 (pg.57/162)

Page 58: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Converse: Representability of Transversal Matroids

The converse is not true, however.

Example 8.4.5

Let V = {1, 2, 3, 4, 5, 6} be a ground set and let M = (V, I) be a setsystem where I is all subsets of V of cardinality 2 except for the pairs{1, 2}, {3, 4}, {5, 6}.

It can be shown that this is a matroid and is representable.

However, this matroid is not isomorphic to any transversal matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F20/40 (pg.58/162)

Page 59: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Converse: Representability of Transversal Matroids

The converse is not true, however.

Example 8.4.5

Let V = {1, 2, 3, 4, 5, 6} be a ground set and let M = (V, I) be a setsystem where I is all subsets of V of cardinality 2 except for the pairs{1, 2}, {3, 4}, {5, 6}.

It can be shown that this is a matroid and is representable.

However, this matroid is not isomorphic to any transversal matroid.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F20/40 (pg.59/162)

Page 60: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Review from Lecture 6

The next frame comes from lecture 6.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F21/40 (pg.60/162)

Page 61: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroids, other definitions using matroid rank r : 2V ! Z+

Definition 8.5.3 (closed/flat/subspace)

A subset A ✓ E is closed (equivalently, a flat or a subspace) of matroid M

if for all x 2 E \A, r(A [ {x}) = r(A) + 1.

Definition: A hyperplane is a flat of rank r(M)� 1.

Definition 8.5.4 (closure)

Given A ✓ E, the closure (or span) of A, is defined byspan(A) = {b 2 E : r(A [ {b}) = r(A)}.

Therefore, a closed set A has span(A) = A.

Definition 8.5.5 (circuit)

A subset A ✓ E is circuit or a cycle if it is an inclusionwise-minimaldependent set (i.e., if r(A) < |A| and for any a 2 A, r(A \ {a}) = |A|� 1).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F22/40 (pg.61/162)

Page 62: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ✓ V , then any set X ✓ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ✓ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.62/162)

Page 63: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ✓ V , then any set X ✓ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ✓ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.63/162)

VFV ,H7=r1✓)

Page 64: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ✓ V , then any set X ✓ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ✓ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.64/162)

Page 65: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ✓ V , then any set X ✓ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ✓ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.65/162)

insmf, €0.925maximally independent & minimally spanning .

Page 66: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ✓ V , then any set X ✓ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ✓ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.66/162)

Page 67: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Spanning Sets

We have the following definitions:

Definition 8.5.1 (spanning set of a set)

Given a matroid M = (V, I), and a set Y ✓ V , then any set X ✓ Y suchthat r(X) = r(Y ) is called a spanning set of Y .

Definition 8.5.2 (spanning set of a matroid)

Given a matroid M = (V, I), any set A ✓ V such that r(A) = r(V ) iscalled a spanning set of the matroid.

A base of a matroid is a minimal spanning set (and it is independent)but supersets of a base are also spanning.

V is always trivially spanning.

Consider the terminology: “spanning tree in a graph”, comes fromspanning in a matroid sense.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F23/40 (pg.67/162)

Page 68: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M

⇤= (V, I⇤

) can bedefined on the same ground set V , but using a very di↵erent set ofindependent sets I⇤.

We define the set of sets I⇤ for M⇤ as follows:

I⇤= {A ✓ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ✓ V is a spanning set of M} (8.12)

i.e., I⇤ are complements of spanning sets of M .That is, a set A is independent in the dual matroid M

⇤ if removal of Afrom V does not decrease the rank in M :

I⇤= {A ✓ V : rank

M

(V \A) = rankM

(V )} (8.13)

In other words, a set A ✓ V is independent in the dual M⇤ (i.e.,A 2 I⇤) if its complement is spanning in M (residual V \A mustcontain a base in M).Dual of the dual: Note, we have that (M⇤

)

⇤= M .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.68/162)

Page 69: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M

⇤= (V, I⇤

) can bedefined on the same ground set V , but using a very di↵erent set ofindependent sets I⇤.We define the set of sets I⇤ for M⇤ as follows:

I⇤= {A ✓ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ✓ V is a spanning set of M} (8.12)

i.e., I⇤ are complements of spanning sets of M .

That is, a set A is independent in the dual matroid M

⇤ if removal of Afrom V does not decrease the rank in M :

I⇤= {A ✓ V : rank

M

(V \A) = rankM

(V )} (8.13)

In other words, a set A ✓ V is independent in the dual M⇤ (i.e.,A 2 I⇤) if its complement is spanning in M (residual V \A mustcontain a base in M).Dual of the dual: Note, we have that (M⇤

)

⇤= M .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.69/162)

Page 70: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M

⇤= (V, I⇤

) can bedefined on the same ground set V , but using a very di↵erent set ofindependent sets I⇤.We define the set of sets I⇤ for M⇤ as follows:

I⇤= {A ✓ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ✓ V is a spanning set of M} (8.12)

i.e., I⇤ are complements of spanning sets of M .That is, a set A is independent in the dual matroid M

⇤ if removal of Afrom V does not decrease the rank in M :

I⇤= {A ✓ V : rank

M

(V \A) = rankM

(V )} (8.13)

In other words, a set A ✓ V is independent in the dual M⇤ (i.e.,A 2 I⇤) if its complement is spanning in M (residual V \A mustcontain a base in M).Dual of the dual: Note, we have that (M⇤

)

⇤= M .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.70/162)

Page 71: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M

⇤= (V, I⇤

) can bedefined on the same ground set V , but using a very di↵erent set ofindependent sets I⇤.We define the set of sets I⇤ for M⇤ as follows:

I⇤= {A ✓ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ✓ V is a spanning set of M} (8.12)

i.e., I⇤ are complements of spanning sets of M .That is, a set A is independent in the dual matroid M

⇤ if removal of Afrom V does not decrease the rank in M :

I⇤= {A ✓ V : rank

M

(V \A) = rankM

(V )} (8.13)

In other words, a set A ✓ V is independent in the dual M⇤ (i.e.,A 2 I⇤) if its complement is spanning in M (residual V \A mustcontain a base in M).

Dual of the dual: Note, we have that (M⇤)

⇤= M .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.71/162)

Page 72: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid

Given a matroid M = (V, I), a dual matroid M

⇤= (V, I⇤

) can bedefined on the same ground set V , but using a very di↵erent set ofindependent sets I⇤.We define the set of sets I⇤ for M⇤ as follows:

I⇤= {A ✓ V : V \A is a spanning set of M} (8.11)

= {V \ S : S ✓ V is a spanning set of M} (8.12)

i.e., I⇤ are complements of spanning sets of M .That is, a set A is independent in the dual matroid M

⇤ if removal of Afrom V does not decrease the rank in M :

I⇤= {A ✓ V : rank

M

(V \A) = rankM

(V )} (8.13)

In other words, a set A ✓ V is independent in the dual M⇤ (i.e.,A 2 I⇤) if its complement is spanning in M (residual V \A mustcontain a base in M).Dual of the dual: Note, we have that (M⇤

)

⇤= M .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F24/40 (pg.72/162)

Page 73: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid: Bases

The smallest spanning sets are bases.

Hence, a base B of M (whereB = V \B⇤ is as small as possible while still spanning) is thecomplement of a base B

⇤ of M⇤ (where B

⇤= V \B is as large as

possible while still being independent).

In fact, we have that

Theorem 8.5.3 (Dual matroid bases)

Let M = (V, I) be a matroid and B(M) be the set of bases of M . Then

define

B⇤(M) = {V \B : B 2 B(M)}. (8.14)

Then B⇤(M) is the set of basis of M

⇤(that is, B⇤

(M) = B(M⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F25/40 (pg.73/162)

Page 74: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid: Bases

The smallest spanning sets are bases. Hence, a base B of M (whereB = V \B⇤ is as small as possible while still spanning) is thecomplement of a base B

⇤ of M⇤ (where B

⇤= V \B is as large as

possible while still being independent).

In fact, we have that

Theorem 8.5.3 (Dual matroid bases)

Let M = (V, I) be a matroid and B(M) be the set of bases of M . Then

define

B⇤(M) = {V \B : B 2 B(M)}. (8.14)

Then B⇤(M) is the set of basis of M

⇤(that is, B⇤

(M) = B(M⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F25/40 (pg.74/162)

±

Page 75: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid: Bases

The smallest spanning sets are bases. Hence, a base B of M (whereB = V \B⇤ is as small as possible while still spanning) is thecomplement of a base B

⇤ of M⇤ (where B

⇤= V \B is as large as

possible while still being independent).

In fact, we have that

Theorem 8.5.3 (Dual matroid bases)

Let M = (V, I) be a matroid and B(M) be the set of bases of M . Then

define

B⇤(M) = {V \B : B 2 B(M)}. (8.14)

Then B⇤(M) is the set of basis of M

⇤(that is, B⇤

(M) = B(M⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F25/40 (pg.75/162)

Page 76: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual of a Matroid: Bases

The smallest spanning sets are bases. Hence, a base B of M (whereB = V \B⇤ is as small as possible while still spanning) is thecomplement of a base B

⇤ of M⇤ (where B

⇤= V \B is as large as

possible while still being independent).

In fact, we have that

Theorem 8.5.3 (Dual matroid bases)

Let M = (V, I) be a matroid and B(M) be the set of bases of M . Then

define

B⇤(M) = {V \B : B 2 B(M)}. (8.14)

Then B⇤(M) is the set of basis of M

⇤(that is, B⇤

(M) = B(M⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F25/40 (pg.76/162)

Page 77: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.77/162)

Page 78: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.78/162)

Page 79: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.79/162)

Page 80: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.80/162)

Page 81: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.81/162)

Page 82: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.82/162)

Page 83: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.83/162)

Page 84: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.84/162)

Page 85: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.85/162)

Page 86: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

An exercise in duality Terminology

B⇤(M), the bases of M⇤, are called cobases of M .

The circuits of M⇤ are called cocircuits of M .

The hyperplanes of M⇤ are called cohyperplanes of M .

The independent sets of M⇤ are called coindependent sets of M .

The spanning sets of M⇤ are called cospanning sets of M .

Proposition 8.5.4 (from Oxley 2011)

Let M = (V, I) be a matroid, and let X ✓ V . Then

1X is independent in M iff V \X is cospanning in M (spanning in M

⇤).

2X is spanning in M iff V \X is coindependent in M (independent in

M

⇤).

3X is a hyperplane in M iff V \X is a cocircuit in M (circuit in M

⇤).

4X is a circuit in M iff V \X is a cohyperplane in M (hyperplane in M

⇤).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F26/40 (pg.86/162)

Page 87: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.

Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ✓ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ✓ E(G) such that X \ {x} is not a cutfor any x 2 X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.87/162)

Page 88: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).

A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ✓ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ✓ E(G) such that X \ {x} is not a cutfor any x 2 X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.88/162)

*-

I

Page 89: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ✓ E(G) is a cut in G ifk(G) < k(G \X).

A minimal cut in G is a cut X ✓ E(G) such that X \ {x} is not a cutfor any x 2 X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.89/162)

. *¥¥:*¥¥

Page 90: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ✓ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ✓ E(G) such that X \ {x} is not a cutfor any x 2 X.

A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.90/162)

Page 91: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ✓ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ✓ E(G) such that X \ {x} is not a cutfor any x 2 X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.

A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.91/162)

Page 92: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ✓ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ✓ E(G) such that X \ {x} is not a cutfor any x 2 X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.

All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.92/162)

Page 93: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example duality: graphic matroid

Using a graphic/cycle matroid, we can already see how dual matroidconcepts demonstrates the extraordinary flexibility and power that amatroid can have.Recall, in cycle matroid, a spanning set of G is any set of edges that areincident to all nodes (i.e., any superset of a spanning forest), a minimalspanning set is a spanning tree (or forest), and a circuit has a nice visualinterpretation (a cycle in the graph).A cut in a graph G is a set of edges, the removal of which increases thenumber of connected components. I.e., X ✓ E(G) is a cut in G ifk(G) < k(G \X).A minimal cut in G is a cut X ✓ E(G) such that X \ {x} is not a cutfor any x 2 X.A cocycle (cocircuit) in a graphic matroid is a minimal graph cut.A mincut is a circuit in the dual “cocycle” (or “cut”) matroid.All dependent sets in a cocycle matroid are cuts (i.e., a dependent set isa minimal cut or contains one).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F27/40 (pg.93/162)

Page 94: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}

It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.94/162)

Page 95: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

A graph G

Minimally spanning in M (and thusa base (maximally independent) in M)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.95/162)

/ '

,-

'

T

Page 96: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Minimally spanning in M (and thusa base (maximally independent) in M)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Maximally independent in M* (thusa base, minimally spanning, in M*)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.96/162)

Page 97: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Spanning in M, but not a base, andnot independent (has cycles)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Minimally spanning in M (and thusa base (maximally independent) in M)

Maximally independent in M* (thusa base, minimally spanning, in M*)

Independent in M* (does not contain a cut)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.97/162)

Page 98: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Independent but not spanning in M, and not closed in M.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Dependent in M* (contains a cocycle, is a nonminimal cut)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.98/162)

Page 99: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Spanning in M, but not a base, andnot independent (has cycles)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Independent in M* (does not contain a cut)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.99/162)

Page 100: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

Independent but not spanning in M, and not closed in M.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Dependent in M* (contains a cocycle, is a nonminimal cut)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.100/162)

Page 101: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

A hyperplane in M, dependent but not spanning in M

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

A cycle in M* (minimally dependentin M*, a cocycle, or a minimal cut)

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.101/162)

Page 102: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Example: cocycle matroid (sometimes “cut matroid”)

The dual of the cycle matroid is called the cocycle matroid. Recall,I⇤

= {A ✓ V : V \A is a spanning set of M}It consists of all sets of edges the complement of which contains aspanning tree — i.e., an independent set can’t consist of edges that, ifremoved, would render the graph non-spanning.

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

A hyperplane in M, dependentbut not spanning in M

2

1

3

4

7

6

5

81

2

3

4

6

7

8

5

912

10

11

A cycle in M* (minimally dependentin M*, a cocycle, or a minimal cut)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F28/40 (pg.102/162)

Page 103: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Clearly ; 2 I⇤, so (I1’) holds.

Also, if I ✓ J 2 I⇤, then clearly also I 2 I⇤ since if V \ J is spanningin M , so must V \ I. Therefore, (I2’) holds.

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.103/162)

Page 104: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Clearly ; 2 I⇤, so (I1’) holds.

Also, if I ✓ J 2 I⇤, then clearly also I 2 I⇤ since if V \ J is spanningin M , so must V \ I. Therefore, (I2’) holds.

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.104/162)

:E VII

or -

Page 105: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Consider I, J 2 I⇤ with |I| < |J |. We need to show that there is somemember v 2 J \ I such that I + v is independent in M

⇤, which meansthat V \ (I + v) = (V \ I) \ v is still spanning in M . That is, removingv from V \ I doesn’t make (V \ I) \ v not spanning in M .

Since V \ J is spanning in M , V \ J contains some base (sayB ✓ V \ J) of M . Also, V \ I contains a base of M , say B

0 ✓ V \ I.Since B \ I ✓ V \ I, and B \ I is independent in M , we can choosethe base B

0 of M s.t. B \ I ✓ B

0 ✓ V \ I.Since B and J are disjoint, we have both: 1) B \ I and J \ I aredisjoint; and 2) B \ I ✓ I \ J . Also note, B0 and I are disjoint.

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.105/162)

VII z vlcttr )

Page 106: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Consider I, J 2 I⇤ with |I| < |J |. We need to show that there is somemember v 2 J \ I such that I + v is independent in M

⇤, which meansthat V \ (I + v) = (V \ I) \ v is still spanning in M . That is, removingv from V \ I doesn’t make (V \ I) \ v not spanning in M .

Since V \ J is spanning in M , V \ J contains some base (sayB ✓ V \ J) of M . Also, V \ I contains a base of M , say B

0 ✓ V \ I.

Since B \ I ✓ V \ I, and B \ I is independent in M , we can choosethe base B

0 of M s.t. B \ I ✓ B

0 ✓ V \ I.Since B and J are disjoint, we have both: 1) B \ I and J \ I aredisjoint; and 2) B \ I ✓ I \ J . Also note, B0 and I are disjoint.

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.106/162)

- -

Page 107: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Consider I, J 2 I⇤ with |I| < |J |. We need to show that there is somemember v 2 J \ I such that I + v is independent in M

⇤, which meansthat V \ (I + v) = (V \ I) \ v is still spanning in M . That is, removingv from V \ I doesn’t make (V \ I) \ v not spanning in M .

Since V \ J is spanning in M , V \ J contains some base (sayB ✓ V \ J) of M . Also, V \ I contains a base of M , say B

0 ✓ V \ I.Since B \ I ✓ V \ I, and B \ I is independent in M , we can choosethe base B

0 of M s.t. B \ I ✓ B

0 ✓ V \ I.

Since B and J are disjoint, we have both: 1) B \ I and J \ I aredisjoint; and 2) B \ I ✓ I \ J . Also note, B0 and I are disjoint.

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.107/162)

Page 108: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Consider I, J 2 I⇤ with |I| < |J |. We need to show that there is somemember v 2 J \ I such that I + v is independent in M

⇤, which meansthat V \ (I + v) = (V \ I) \ v is still spanning in M . That is, removingv from V \ I doesn’t make (V \ I) \ v not spanning in M .

Since V \ J is spanning in M , V \ J contains some base (sayB ✓ V \ J) of M . Also, V \ I contains a base of M , say B

0 ✓ V \ I.Since B \ I ✓ V \ I, and B \ I is independent in M , we can choosethe base B

0 of M s.t. B \ I ✓ B

0 ✓ V \ I.Since B and J are disjoint, we have both: 1) B \ I and J \ I aredisjoint; and 2) B \ I ✓ I \ J . Also note, B0 and I are disjoint.

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.108/162)

B €€€J

Page 109: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Now J \ I 6✓ B

0, since otherwise (i.e., assuming J \ I ✓ B

0):

|B| = |B \ I|+ |B \ I| (8.15)

|I \ J |+ |B \ I| (8.16)

< |J \ I|+ |B \ I| |B0| (8.17)

which is a contradiction. The last inequality on the right follows since

J \ I ✓ B0(by assumption) and B \ I ✓ B0

implies that (J \ I)[ (B \ I) ✓ B0, but

since J and B are disjoint, we have that |J \ I|+ |B \ I| |B0|.

Therefore, J \ I 6✓ B

0, and there is a v 2 J \ I s.t. v /2 B

0.

So B

0 is disjoint with I [ {v}, means B0 ✓ V \ (I [ {v}), orV \ (I [ {v}) is spanning in M , and therefore I [ {v} 2 I⇤.

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.109/162)

Page 110: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Now J \ I 6✓ B

0, since otherwise (i.e., assuming J \ I ✓ B

0):

|B| = |B \ I|+ |B \ I| (8.15)

|I \ J |+ |B \ I| (8.16)

< |J \ I|+ |B \ I| |B0| (8.17)

which is a contradiction.

Therefore, J \ I 6✓ B

0, and there is a v 2 J \ I s.t. v /2 B

0.

So B

0 is disjoint with I [ {v}, means B0 ✓ V \ (I [ {v}), orV \ (I [ {v}) is spanning in M , and therefore I [ {v} 2 I⇤.

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.110/162)

Page 111: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

The dual of a matroid is (indeed) a matroid

Theorem 8.5.5

Given matroid M = (V, I), let M⇤= (V, I⇤

) be as previously defined.

Then M

⇤is a matroid.

Proof.

Now J \ I 6✓ B

0, since otherwise (i.e., assuming J \ I ✓ B

0):

|B| = |B \ I|+ |B \ I| (8.15)

|I \ J |+ |B \ I| (8.16)

< |J \ I|+ |B \ I| |B0| (8.17)

which is a contradiction.

Therefore, J \ I 6✓ B

0, and there is a v 2 J \ I s.t. v /2 B

0.

So B

0 is disjoint with I [ {v}, means B0 ✓ V \ (I [ {v}), orV \ (I [ {v}) is spanning in M , and therefore I [ {v} 2 I⇤.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F29/40 (pg.111/162)

Page 112: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid Duals and Representability

Theorem 8.5.6

Let M be an F-representable matroid (i.e., one that can be represented by

a finite sized matrix over field F). Then M

⇤is also F-representable.

Hence, for matroids as general as matric matroids, duality does not extendthe space of matroids that can be used.

Theorem 8.5.7

Let M be a graphic matroid (i.e., one that can be represented by a graph

G = (V,E)). Then M

⇤is not necessarily also graphic.

Hence, for graphic matroids, duality can increase the space and power ofmatroids, and since they are based on a graph, they are relatively easy touse: 1) all cuts are dependent sets; 2) minimal cuts are cycles; 3) bases areone edge less than minimal cuts; and 4) independent sets are edges that arenot cuts.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F30/40 (pg.112/162)

Page 113: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid Duals and Representability

Theorem 8.5.6

Let M be an F-representable matroid (i.e., one that can be represented by

a finite sized matrix over field F). Then M

⇤is also F-representable.

Hence, for matroids as general as matric matroids, duality does not extendthe space of matroids that can be used.

Theorem 8.5.7

Let M be a graphic matroid (i.e., one that can be represented by a graph

G = (V,E)). Then M

⇤is not necessarily also graphic.

Hence, for graphic matroids, duality can increase the space and power ofmatroids, and since they are based on a graph, they are relatively easy touse: 1) all cuts are dependent sets; 2) minimal cuts are cycles; 3) bases areone edge less than minimal cuts; and 4) independent sets are edges that arenot cuts.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F30/40 (pg.113/162)

Page 114: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function r

M

⇤of the dual matroid M

⇤may be specified in terms

of the rank r

M

in matroid M as follows. For X ✓ V :

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) (8.18)

Note, we again immediately see that this is submodular by theproperties of submodular functions we saw in lectures 1 and 2. I.e., |X|is modular, complement f(V \X) is submodular if f is submodular, rM (V ) is aconstant, and summing submodular functions and a constant preserves

submodularity.

Non-negativity integral follows since|X|+ r

M

(V \X) � r

M

(X) + r

M

(V \X) � r

M

(V ).

Monotone non-decreasing follows since, as X increases by one, |X|always increases by 1, while r

M

(V \X) decreases by one or zero.

Therefore, rM

⇤ is the rank function of a matroid. That it is the dualmatroid rank function is shown in the next proof.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.114/162)

Page 115: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function r

M

⇤of the dual matroid M

⇤may be specified in terms

of the rank r

M

in matroid M as follows. For X ✓ V :

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) (8.18)

Note, we again immediately see that this is submodular by theproperties of submodular functions we saw in lectures 1 and 2.Non-negativity integral follows since|X|+ r

M

(V \X) � r

M

(X) + r

M

(V \X) � r

M

(V ). The right inequality

follows since rM is submodular.

Monotone non-decreasing follows since, as X increases by one, |X|always increases by 1, while r

M

(V \X) decreases by one or zero.

Therefore, rM

⇤ is the rank function of a matroid. That it is the dualmatroid rank function is shown in the next proof.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.115/162)

Page 116: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function r

M

⇤of the dual matroid M

⇤may be specified in terms

of the rank r

M

in matroid M as follows. For X ✓ V :

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) (8.18)

Note, we again immediately see that this is submodular by theproperties of submodular functions we saw in lectures 1 and 2.

Non-negativity integral follows since|X|+ r

M

(V \X) � r

M

(X) + r

M

(V \X) � r

M

(V ).

Monotone non-decreasing follows since, as X increases by one, |X|always increases by 1, while r

M

(V \X) decreases by one or zero.

Therefore, rM

⇤ is the rank function of a matroid. That it is the dualmatroid rank function is shown in the next proof.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.116/162)

Page 117: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function r

M

⇤of the dual matroid M

⇤may be specified in terms

of the rank r

M

in matroid M as follows. For X ✓ V :

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) (8.18)

Note, we again immediately see that this is submodular by theproperties of submodular functions we saw in lectures 1 and 2.

Non-negativity integral follows since|X|+ r

M

(V \X) � r

M

(X) + r

M

(V \X) � r

M

(V ).

Monotone non-decreasing follows since, as X increases by one, |X|always increases by 1, while r

M

(V \X) decreases by one or zero.

Therefore, rM

⇤ is the rank function of a matroid. That it is the dualmatroid rank function is shown in the next proof.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.117/162)

Page 118: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function r

M

⇤of the dual matroid M

⇤may be specified in terms

of the rank r

M

in matroid M as follows. For X ✓ V :

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) (8.18)

Proof.

A set X is independent in (V, r

M

⇤) if and only if

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) = |X| (8.19)

or

r

M

(V \X) = r

M

(V ) (8.20)

But a subset X is independent in M

⇤ only if V \X is spanning in M (bythe definition of the dual matroid).

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.118/162)

Page 119: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function r

M

⇤of the dual matroid M

⇤may be specified in terms

of the rank r

M

in matroid M as follows. For X ✓ V :

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) (8.18)

Proof.

A set X is independent in (V, r

M

⇤) if and only if

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) = |X| (8.19)

or

r

M

(V \X) = r

M

(V ) (8.20)

But a subset X is independent in M

⇤ only if V \X is spanning in M (bythe definition of the dual matroid).

. . .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.119/162)

Page 120: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Dual Matroid Rank

Theorem 8.5.8

The rank function r

M

⇤of the dual matroid M

⇤may be specified in terms

of the rank r

M

in matroid M as follows. For X ✓ V :

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) (8.18)

Proof.

A set X is independent in (V, r

M

⇤) if and only if

r

M

⇤(X) = |X|+ r

M

(V \X)� r

M

(V ) = |X| (8.19)

or

r

M

(V \X) = r

M

(V ) (8.20)

But a subset X is independent in M

⇤ only if V \X is spanning in M (bythe definition of the dual matroid).Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F31/40 (pg.120/162)

Page 121: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ✓ V , then

IY

= {Z : Z ✓ Y, Z 2 I} (8.21)

is such that MY

= (Y, IY

) is a matroid with rank r(M

Y

) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY

= {Z : Z \X = ;, Z 2 I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY

: 2

Y ! Z+, wherer

Y

(Z) = r(Z) for Z ✓ Y .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.121/162)

Page 122: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ✓ V , then

IY

= {Z : Z ✓ Y, Z 2 I} (8.21)

is such that MY

= (Y, IY

) is a matroid with rank r(M

Y

) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY

= {Z : Z \X = ;, Z 2 I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY

: 2

Y ! Z+, wherer

Y

(Z) = r(Z) for Z ✓ Y .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.122/162)

Page 123: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ✓ V , then

IY

= {Z : Z ✓ Y, Z 2 I} (8.21)

is such that MY

= (Y, IY

) is a matroid with rank r(M

Y

) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY

= {Z : Z \X = ;, Z 2 I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY

: 2

Y ! Z+, wherer

Y

(Z) = r(Z) for Z ✓ Y .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.123/162)

Page 124: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ✓ V , then

IY

= {Z : Z ✓ Y, Z 2 I} (8.21)

is such that MY

= (Y, IY

) is a matroid with rank r(M

Y

) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY

= {Z : Z \X = ;, Z 2 I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY

: 2

Y ! Z+, wherer

Y

(Z) = r(Z) for Z ✓ Y .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.124/162)

Page 125: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid restriction/deletion

Let M = (V, I) be a matroid and let Y ✓ V , then

IY

= {Z : Z ✓ Y, Z 2 I} (8.21)

is such that MY

= (Y, IY

) is a matroid with rank r(M

Y

) = r(Y ).

This is called the restriction of M to Y , and is often written M |Y .

If Y = V \X, then we have that M |Y has the form:

IY

= {Z : Z \X = ;, Z 2 I} (8.22)

is considered a deletion of X from M , and is often written M \X.

Hence, M |Y = M \ (V \ Y ), and M |(V \X) = M \X.

The rank function is of the same form. I.e., rY

: 2

Y ! Z+, wherer

Y

(Z) = r(Z) for Z ✓ Y .

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F32/40 (pg.125/162)

Page 126: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Z

is written M/Z.

Let Z ✓ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I [X is independent in M .

The rank function takes the form

r

M/Z

(Y ) = r(Y [ Z)� r(Z) = r(Y |Z) (8.23)

So given I ✓ V \Z and X is a base of Z, rM/Z

(I) = |I| is identical tor(I [ Z) = |I|+ r(Z) = |I|+ |X| but r(I [ Z) = r(I [X). Thisimplies r(I [X) = |I|+ |X|, or I [X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M

⇤ \ Z)

⇤ (Exercise: show why).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.126/162)

Page 127: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Z

is written M/Z.

Let Z ✓ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I [X is independent in M .

The rank function takes the form

r

M/Z

(Y ) = r(Y [ Z)� r(Z) = r(Y |Z) (8.23)

So given I ✓ V \Z and X is a base of Z, rM/Z

(I) = |I| is identical tor(I [ Z) = |I|+ r(Z) = |I|+ |X| but r(I [ Z) = r(I [X). Thisimplies r(I [X) = |I|+ |X|, or I [X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M

⇤ \ Z)

⇤ (Exercise: show why).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.127/162)

Page 128: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Z

is written M/Z.

Let Z ✓ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I [X is independent in M .

The rank function takes the form

r

M/Z

(Y ) = r(Y [ Z)� r(Z) = r(Y |Z) (8.23)

So given I ✓ V \Z and X is a base of Z, rM/Z

(I) = |I| is identical tor(I [ Z) = |I|+ r(Z) = |I|+ |X| but r(I [ Z) = r(I [X). Thisimplies r(I [X) = |I|+ |X|, or I [X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M

⇤ \ Z)

⇤ (Exercise: show why).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.128/162)

Page 129: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Z

is written M/Z.

Let Z ✓ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I [X is independent in M .

The rank function takes the form

r

M/Z

(Y ) = r(Y [ Z)� r(Z) = r(Y |Z) (8.23)

So given I ✓ V \Z and X is a base of Z, rM/Z

(I) = |I| is identical tor(I [ Z) = |I|+ r(Z) = |I|+ |X| but r(I [ Z) = r(I [X). Thisimplies r(I [X) = |I|+ |X|, or I [X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M

⇤ \ Z)

⇤ (Exercise: show why).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.129/162)

Page 130: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Z

is written M/Z.

Let Z ✓ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I [X is independent in M .

The rank function takes the form

r

M/Z

(Y ) = r(Y [ Z)� r(Z) = r(Y |Z) (8.23)

So given I ✓ V \Z and X is a base of Z, rM/Z

(I) = |I| is identical tor(I [ Z) = |I|+ r(Z) = |I|+ |X| but r(I [ Z) = r(I [X). Thisimplies r(I [X) = |I|+ |X|, or I [X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M

⇤ \ Z)

⇤ (Exercise: show why).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.130/162)

Page 131: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid contraction M/Z

Contraction is dual to deletion, and is like a forced inclusion ofcontained base, but with a similar ground set removal. Contracting Z

is written M/Z.

Let Z ✓ V and let X be a base of Z. Then a subset I of V \ Z isindependent in M/Z iff I [X is independent in M .

The rank function takes the form

r

M/Z

(Y ) = r(Y [ Z)� r(Z) = r(Y |Z) (8.23)

So given I ✓ V \Z and X is a base of Z, rM/Z

(I) = |I| is identical tor(I [ Z) = |I|+ r(Z) = |I|+ |X| but r(I [ Z) = r(I [X). Thisimplies r(I [X) = |I|+ |X|, or I [X is independent in M .

A minor of a matroid is any matroid obtained via a series of deletionsand contractions of some matroid.

In fact, it is the case M/Z = (M

⇤ \ Z)

⇤ (Exercise: show why).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F33/40 (pg.131/162)

Page 132: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid IntersectionLet M1 = (V, I1) and M2 = (V, I2) be two matroids. Consider theircommon independent sets I1 \ I2.

While (V, I1 \ I2) is typically not a matroid (Exercise: show graphicalexample.), we might be interested in finding the maximum sizecommon independent set. That is, find max |X| such that bothX 2 I1 and X 2 I2.

Theorem 8.6.1

Let M1 and M2 be given as above, with rank functions r1 and r2. Then the

size of the maximum size set in I1 \ I2 is given by

(r1 ⇤ r2)(V ) , min

X✓V

⇣r1(X) + r2(V \X)

⌘(8.24)

This is an instance of the convolution of two submodular functions, f1and f2 that, evaluated at Y ✓ V , is written as:

(f1 ⇤ f2)(Y ) = min

X✓Y

⇣f1(X) + f2(Y \X)

⌘(8.25)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F34/40 (pg.132/162)

Page 133: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid IntersectionLet M1 = (V, I1) and M2 = (V, I2) be two matroids. Consider theircommon independent sets I1 \ I2.While (V, I1 \ I2) is typically not a matroid (Exercise: show graphicalexample.), we might be interested in finding the maximum sizecommon independent set. That is, find max |X| such that bothX 2 I1 and X 2 I2.

Theorem 8.6.1

Let M1 and M2 be given as above, with rank functions r1 and r2. Then the

size of the maximum size set in I1 \ I2 is given by

(r1 ⇤ r2)(V ) , min

X✓V

⇣r1(X) + r2(V \X)

⌘(8.24)

This is an instance of the convolution of two submodular functions, f1and f2 that, evaluated at Y ✓ V , is written as:

(f1 ⇤ f2)(Y ) = min

X✓Y

⇣f1(X) + f2(Y \X)

⌘(8.25)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F34/40 (pg.133/162)

Page 134: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid IntersectionLet M1 = (V, I1) and M2 = (V, I2) be two matroids. Consider theircommon independent sets I1 \ I2.While (V, I1 \ I2) is typically not a matroid (Exercise: show graphicalexample.), we might be interested in finding the maximum sizecommon independent set. That is, find max |X| such that bothX 2 I1 and X 2 I2.

Theorem 8.6.1

Let M1 and M2 be given as above, with rank functions r1 and r2. Then the

size of the maximum size set in I1 \ I2 is given by

(r1 ⇤ r2)(V ) , min

X✓V

⇣r1(X) + r2(V \X)

⌘(8.24)

This is an instance of the convolution of two submodular functions, f1and f2 that, evaluated at Y ✓ V , is written as:

(f1 ⇤ f2)(Y ) = min

X✓Y

⇣f1(X) + f2(Y \X)

⌘(8.25)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F34/40 (pg.134/162)

Page 135: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid IntersectionLet M1 = (V, I1) and M2 = (V, I2) be two matroids. Consider theircommon independent sets I1 \ I2.While (V, I1 \ I2) is typically not a matroid (Exercise: show graphicalexample.), we might be interested in finding the maximum sizecommon independent set. That is, find max |X| such that bothX 2 I1 and X 2 I2.

Theorem 8.6.1

Let M1 and M2 be given as above, with rank functions r1 and r2. Then the

size of the maximum size set in I1 \ I2 is given by

(r1 ⇤ r2)(V ) , min

X✓V

⇣r1(X) + r2(V \X)

⌘(8.24)

This is an instance of the convolution of two submodular functions, f1and f2 that, evaluated at Y ✓ V , is written as:

(f1 ⇤ f2)(Y ) = min

X✓Y

⇣f1(X) + f2(Y \X)

⌘(8.25)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F34/40 (pg.135/162)

Page 136: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ✓ V , wehave |�(X)| � |X|.

, |�(X)|� |X| � 0, 8X, min

X

|�(X)|� |X| � 0

, min

X

|�(X)|+ |V |� |X| � |V |

, min

X

⇣|�(X)|+ |V \X|

⌘� |V |

, [�(·) ⇤ | · |](V ) � |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [�(·) ⇤ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.136/162)

Page 137: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ✓ V , wehave |�(X)| � |X|., |�(X)|� |X| � 0, 8X

, min

X

|�(X)|� |X| � 0

, min

X

|�(X)|+ |V |� |X| � |V |

, min

X

⇣|�(X)|+ |V \X|

⌘� |V |

, [�(·) ⇤ | · |](V ) � |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [�(·) ⇤ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.137/162)

Page 138: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ✓ V , wehave |�(X)| � |X|., |�(X)|� |X| � 0, 8X, min

X

|�(X)|� |X| � 0

, min

X

|�(X)|+ |V |� |X| � |V |

, min

X

⇣|�(X)|+ |V \X|

⌘� |V |

, [�(·) ⇤ | · |](V ) � |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [�(·) ⇤ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.138/162)

Page 139: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ✓ V , wehave |�(X)| � |X|., |�(X)|� |X| � 0, 8X, min

X

|�(X)|� |X| � 0

, min

X

|�(X)|+ |V |� |X| � |V |

, min

X

⇣|�(X)|+ |V \X|

⌘� |V |

, [�(·) ⇤ | · |](V ) � |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [�(·) ⇤ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.139/162)

Page 140: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ✓ V , wehave |�(X)| � |X|., |�(X)|� |X| � 0, 8X, min

X

|�(X)|� |X| � 0

, min

X

|�(X)|+ |V |� |X| � |V |

, min

X

⇣|�(X)|+ |V \X|

⌘� |V |

, [�(·) ⇤ | · |](V ) � |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [�(·) ⇤ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.140/162)

Page 141: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ✓ V , wehave |�(X)| � |X|., |�(X)|� |X| � 0, 8X, min

X

|�(X)|� |X| � 0

, min

X

|�(X)|+ |V |� |X| � |V |

, min

X

⇣|�(X)|+ |V \X|

⌘� |V |

, [�(·) ⇤ | · |](V ) � |V |

So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [�(·) ⇤ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.141/162)

Page 142: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ✓ V , wehave |�(X)| � |X|., |�(X)|� |X| � 0, 8X, min

X

|�(X)|� |X| � 0

, min

X

|�(X)|+ |V |� |X| � |V |

, min

X

⇣|�(X)|+ |V \X|

⌘� |V |

, [�(·) ⇤ | · |](V ) � |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [�(·) ⇤ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.142/162)

Page 143: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Convolution and Hall’s Theorem

Recall Hall’s theorem, that a transversal exists iff for all X ✓ V , wehave |�(X)| � |X|., |�(X)|� |X| � 0, 8X, min

X

|�(X)|� |X| � 0

, min

X

|�(X)|+ |V |� |X| � |V |

, min

X

⇣|�(X)|+ |V \X|

⌘� |V |

, [�(·) ⇤ | · |](V ) � |V |So Hall’s theorem can be expressed as convolution. Exercise: defineg(A) = [�(·) ⇤ | · |](A), prove that g is submodular.

Note, in general, convolution of two submodular functions does notpreserve submodularity (but in certain special cases it does).

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F35/40 (pg.143/162)

Page 144: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid UnionDefinition 8.6.2

Let M1 = (V1, I1), M2 = (V2, I2), . . . , Mk

= (V

k

, Ik

) be matroids. Wedefine the union of matroids asM1 _M2 _ · · · _M

k

= (V1 ] V2 ] · · · ] V

k

, I1 _ I2 _ · · · _ Ik

), where

I1 _ I2 _ · · · _ Ik

= {I1 ] I2 ] · · · ] I

k

|I1 2 I1, . . . , Ik

2 Ik

} (8.26)

Note A ]B designates the disjoint union of A and B.

Theorem 8.6.3

Let M1 = (V1, I1), M2 = (V2, I2), . . . , Mk

= (V

k

, Ik

) be matroids, with

rank functions r1, . . . , rk

. Then the union of these matroids is still a

matroid, having rank function

r(Y ) = min

X✓Y

⇣|Y \X|+ r1(X \ V1) + · · ·+ r

k

(X \ V

k

)

⌘(8.27)

for any Y ✓ V1 [ . . . V

k

.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F36/40 (pg.144/162)

Page 145: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroid UnionDefinition 8.6.2

Let M1 = (V1, I1), M2 = (V2, I2), . . . , Mk

= (V

k

, Ik

) be matroids. Wedefine the union of matroids asM1 _M2 _ · · · _M

k

= (V1 ] V2 ] · · · ] V

k

, I1 _ I2 _ · · · _ Ik

), where

I1 _ I2 _ · · · _ Ik

= {I1 ] I2 ] · · · ] I

k

|I1 2 I1, . . . , Ik

2 Ik

} (8.26)

Note A ]B designates the disjoint union of A and B.

Theorem 8.6.3

Let M1 = (V1, I1), M2 = (V2, I2), . . . , Mk

= (V

k

, Ik

) be matroids, with

rank functions r1, . . . , rk

. Then the union of these matroids is still a

matroid, having rank function

r(Y ) = min

X✓Y

⇣|Y \X|+ r1(X \ V1) + · · ·+ r

k

(X \ V

k

)

⌘(8.27)

for any Y ✓ V1 [ . . . V

k

.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F36/40 (pg.145/162)

Page 146: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Exercise: Matroid Union, and Matroid duality

Exercise: Describe M _M

⇤.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F37/40 (pg.146/162)

Page 147: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroids of three or fewer elements are graphic

All matroids up to and including three elements (edges) are graphic.

(a) The onlymatroid with zeroelements.

(b) The twoone-elementmatroids.

(c) The fourtwo-elementmatroids.

(d) The eightthree-elementmatroids.

This is a nice way to show matroids with low ground set sizes. Whatabout matroids that are low rank but with many elements?

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F38/40 (pg.147/162)

Page 148: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroids of three or fewer elements are graphic

All matroids up to and including three elements (edges) are graphic.

(a) The onlymatroid with zeroelements.

(b) The twoone-elementmatroids.

(c) The fourtwo-elementmatroids.

(d) The eightthree-elementmatroids.

This is a nice way to show matroids with low ground set sizes. Whatabout matroids that are low rank but with many elements?

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F38/40 (pg.148/162)

Page 149: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Matroids of three or fewer elements are graphic

All matroids up to and including three elements (edges) are graphic.

(a) The onlymatroid with zeroelements.

(b) The twoone-elementmatroids.

(c) The fourtwo-elementmatroids.

(d) The eightthree-elementmatroids.

This is a nice way to show matroids with low ground set sizes. Whatabout matroids that are low rank but with many elements?

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F38/40 (pg.149/162)

Page 150: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

A�ne MatroidsGiven an n⇥m matrix with entries over some field F, we say that asubset S ✓ {1, . . . ,m} of indices (with corresponding column vectors{v

i

: i 2 S}, with |S| = k) is a�nely dependent if m � 1 and thereexists elements {a1, . . . , a

k

} 2 F, not all zero withP

k

i=1 ai = 0, such

thatP

k

i=1 aivi = 0.

Otherwise, the set is called a�nely independent.Concisely: points {v1, v2, . . . , v

k

} are a�nely independent ifv2 � v1, v3 � v1, . . . , v

k

� v1 are linearly independent.Example: in 2D, three collinear points are a�nely dependent, threenon-collear points are a�nely independent, and � 4 non-collinearpoints are a�nely dependent.

Proposition 8.7.1 (a�ne matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices a�nely independent

vectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.150/162)

Page 151: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

A�ne MatroidsGiven an n⇥m matrix with entries over some field F, we say that asubset S ✓ {1, . . . ,m} of indices (with corresponding column vectors{v

i

: i 2 S}, with |S| = k) is a�nely dependent if m � 1 and thereexists elements {a1, . . . , a

k

} 2 F, not all zero withP

k

i=1 ai = 0, such

thatP

k

i=1 aivi = 0.Otherwise, the set is called a�nely independent.

Concisely: points {v1, v2, . . . , vk

} are a�nely independent ifv2 � v1, v3 � v1, . . . , v

k

� v1 are linearly independent.Example: in 2D, three collinear points are a�nely dependent, threenon-collear points are a�nely independent, and � 4 non-collinearpoints are a�nely dependent.

Proposition 8.7.1 (a�ne matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices a�nely independent

vectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.151/162)

Page 152: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

A�ne MatroidsGiven an n⇥m matrix with entries over some field F, we say that asubset S ✓ {1, . . . ,m} of indices (with corresponding column vectors{v

i

: i 2 S}, with |S| = k) is a�nely dependent if m � 1 and thereexists elements {a1, . . . , a

k

} 2 F, not all zero withP

k

i=1 ai = 0, such

thatP

k

i=1 aivi = 0.Otherwise, the set is called a�nely independent.Concisely: points {v1, v2, . . . , v

k

} are a�nely independent ifv2 � v1, v3 � v1, . . . , v

k

� v1 are linearly independent.

Example: in 2D, three collinear points are a�nely dependent, threenon-collear points are a�nely independent, and � 4 non-collinearpoints are a�nely dependent.

Proposition 8.7.1 (a�ne matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices a�nely independent

vectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.152/162)

Page 153: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

A�ne MatroidsGiven an n⇥m matrix with entries over some field F, we say that asubset S ✓ {1, . . . ,m} of indices (with corresponding column vectors{v

i

: i 2 S}, with |S| = k) is a�nely dependent if m � 1 and thereexists elements {a1, . . . , a

k

} 2 F, not all zero withP

k

i=1 ai = 0, such

thatP

k

i=1 aivi = 0.Otherwise, the set is called a�nely independent.Concisely: points {v1, v2, . . . , v

k

} are a�nely independent ifv2 � v1, v3 � v1, . . . , v

k

� v1 are linearly independent.Example: in 2D, three collinear points are a�nely dependent, threenon-collear points are a�nely independent, and � 4 non-collinearpoints are a�nely dependent.

Proposition 8.7.1 (a�ne matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices a�nely independent

vectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.153/162)

Page 154: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

A�ne MatroidsGiven an n⇥m matrix with entries over some field F, we say that asubset S ✓ {1, . . . ,m} of indices (with corresponding column vectors{v

i

: i 2 S}, with |S| = k) is a�nely dependent if m � 1 and thereexists elements {a1, . . . , a

k

} 2 F, not all zero withP

k

i=1 ai = 0, such

thatP

k

i=1 aivi = 0.Otherwise, the set is called a�nely independent.Concisely: points {v1, v2, . . . , v

k

} are a�nely independent ifv2 � v1, v3 � v1, . . . , v

k

� v1 are linearly independent.Example: in 2D, three collinear points are a�nely dependent, threenon-collear points are a�nely independent, and � 4 non-collinearpoints are a�nely dependent.

Proposition 8.7.1 (a�ne matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices a�nely independent

vectors. Then (E, I) is a matroid.

Exercise: prove this.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.154/162)

Page 155: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

A�ne MatroidsGiven an n⇥m matrix with entries over some field F, we say that asubset S ✓ {1, . . . ,m} of indices (with corresponding column vectors{v

i

: i 2 S}, with |S| = k) is a�nely dependent if m � 1 and thereexists elements {a1, . . . , a

k

} 2 F, not all zero withP

k

i=1 ai = 0, such

thatP

k

i=1 aivi = 0.Otherwise, the set is called a�nely independent.Concisely: points {v1, v2, . . . , v

k

} are a�nely independent ifv2 � v1, v3 � v1, . . . , v

k

� v1 are linearly independent.Example: in 2D, three collinear points are a�nely dependent, threenon-collear points are a�nely independent, and � 4 non-collinearpoints are a�nely dependent.

Proposition 8.7.1 (a�ne matroid)

Let ground set E = {1, . . . ,m} index column vectors of a matrix, and let Ibe the set of subsets X of E such that X indices a�nely independent

vectors. Then (E, I) is a matroid.

Exercise: prove this.Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F39/40 (pg.155/162)

Page 156: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the a�ne matroid with n⇥m = 2⇥ 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.

We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with � 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with � 4

elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.156/162)

Page 157: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the a�ne matroid with n⇥m = 2⇥ 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with � 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with � 4

elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.157/162)

Page 158: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the a�ne matroid with n⇥m = 2⇥ 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with � 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with � 4

elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.158/162)

Page 159: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the a�ne matroid with n⇥m = 2⇥ 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with � 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with � 4

elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.159/162)

Page 160: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the a�ne matroid with n⇥m = 2⇥ 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with � 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with � 4

elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.160/162)

Page 161: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the a�ne matroid with n⇥m = 2⇥ 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with � 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with � 4

elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.161/162)

Page 162: Submodular Functions, Optimization, and Applications to ......L2(3/30): Machine Learning Apps (diversity, complexity, parameter, learning target, surrogate). L3(4/4): Info theory exs,

Transversal Matroid Matroid and representation Dual Matroid Other Matroid Properties Combinatorial Geometries

Euclidean Representation of Low-rank Matroids

Consider the a�ne matroid with n⇥m = 2⇥ 6 matrix on the fieldF = R, and let the elements be {(0, 0), (1, 0), (2, 0), (0, 1), (0, 2), (1, 1)}.We can plot the points in R2 as on the right:

Points have rank 1, lines have rank 2, planes haverank 3.

Flats (points, lines, planes, etc.) have rank equalto one more than their geometric dimension.

Any two points constitute a line, but lines withonly two points are not drawn.

Lines indicate collinear sets with � 3 points, whileany two points have rank 2.

Dependent sets consist of all subsets with � 4

elements (rank 3), or 3 collinear elements (rank2). Any two points have rank 2.

x

y

(0,1) (0,2)

(1,1)(1,0)

(2,0)

(0,0)

Prof. Je↵ Bilmes EE596b/Spring 2016/Submodularity - Lecture 8 - Apr 25th, 2016 F40/40 (pg.162/162)


Recommended