+ All Categories
Home > Documents > Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M....

Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M....

Date post: 22-Sep-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
31
Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton University Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 1 / 28
Transcript
Page 1: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Mixed Membership Stochastic Blockmodels(2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg

and Eric P. Xing

Herrissa Lamothe

Princeton University

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 1 / 28

Page 2: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Outline

1 Overview

2 The MMSB ModelMixed MembershipModel Estimation

3 Application of Mixed Membership ModelEmpirical and Synthetic DataDrawbacks to the MMSBModel Flexibility

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 2 / 28

Page 3: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Motivating question

Are there certain ”rules” dictating how individual vertices (or nodes) make”decisions” to connect/not to connect to other vertices?

Let’s suppose we can draw a network graph, G from a generativemodel, so that G comes from a probability distribution Pr(G |θ),governed by parameter θ

so that if θ is partly known, it can act as a constraint in generatingsynthetic graphs (similar to G )

if G is partly known, we can use it to infer a θ that make G likely

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 3 / 28

Page 4: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

How do we obtain θ?

Pr(G |θ) =∏

ij

Pr(Aij |θ)

Vertex-level attributes: too chaotic (over-fit)

Global-network attributes: too general (over-generalize)

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 4 / 28

Page 5: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Introducing structure

Differentiates data from noise

Captures relevant patterns

Describing patterns and predicting them

Vertices → Communities

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 5 / 28

Page 6: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Let’s Rephrase the Motivating Question

How do groups of vertices make decisions to connect/not to connect toother groups?

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 6 / 28

Page 7: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

The stochastic block model

Class of models of which Mixed Membership Stochastic Blockmodel is avariant.

Aaron Clauset (UColorado at Boulder, Computer Science)

has amazing slides...Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 7 / 28

Page 8: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

• each vertex has type ( vertex types or groups)

• stochastic block matrix of group-level connection probabilities

• probability that are connected =

community = vertices with same pattern of inter-community connections

the stochastic block model

i zi 2 {1, . . . , k}M

k

i, j Mzi,zj

00.5

1

a(t) 0

1

0

1

020

040

060

0

0

1

alig

nmen

t pos

ition

t

1

23

4

56

78

9

calcu

late

alig

nmen

t sco

res

conv

ert t

o al

ignm

ent i

ndica

tors

rem

ove

shor

t alig

ned

regi

ons

extra

ct h

ighl

y va

riabl

e re

gion

s

NGDYKEKVSNNLRAIFNKIYENLNDPKLKKHYQKDAPNY

NGDYKKKVSNNLKTIFKKIYDALKDTVKETYKDDPNY

NGDYKEKVSNNLRAIFKKIYDALEDTVKETYKDDPNY

16

6

13

166

13

A

B

C

D

00.5

1

a(t) 0

1

0

1

020

040

060

0

0

1

alig

nmen

t pos

ition

t

1

23

4

56

78

9

calcu

late

alig

nmen

t sco

res

conv

ert t

o al

ignm

ent i

ndica

tors

rem

ove

shor

t alig

ned

regi

ons

extra

ct h

ighl

y va

riabl

e re

gion

s

NGDYKEKVSNNLRAIFNKIYENLNDPKLKKHYQKDAPNY

NGDYKKKVSNNLKTIFKKIYDALKDTVKETYKDDPNY

NGDYKEKVSNNLRAIFKKIYDALEDTVKETYKDDPNY

16

6

13

166

13

A

B

C

D

inferred M

Page 9: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

the stochastic block model

assortativeedges within groups

disassortativeedges between groups

orderedlinear group hierarchy

core-peripherydense core, sparse periphery

Page 10: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

✓a,⇤

the most general SBM

the stochastic block model

Pr(A | z, ✓) =Y

i,j

f�Aij | ✓R(zi,zj)

f

RAij : value of adjacency

: partition of adjacencies: probability function: pattern for -type adjacencies

Binomial = simple graphsPoisson = multi-graphsNormal = weighted graphsetc.

a

✓11

✓22

✓33

✓44

✓12

✓21

✓31 ✓32

✓41 ✓42 ✓43

✓34

✓24

✓14✓13

✓23

Page 11: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Interactions across blocks

Classes of SBMs are looking at interactions across ”blocks”

They are not directly looking for patches of ”connectedness” amongnodes.

Assume that individual nodes’ behavior can be explained entirely bygroup membership.

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 8 / 28

Page 12: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Sociological contributions

If this way of operationalizing the problem (in terms of group membership)seems familiar, that is because sociologists have made many contributionsto this line of research.

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 9 / 28

Page 13: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Implications for interactions within blocks

Given this framework, what are we implying about connections betweenvertices i and j if they belong to the same group k?

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 10 / 28

Page 14: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Outline

1 Overview

2 The MMSB ModelMixed MembershipModel Estimation

3 Application of Mixed Membership ModelEmpirical and Synthetic DataDrawbacks to the MMSBModel Flexibility

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 11 / 28

Page 15: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Mixed Membership

Mixed membership stochastic block model (MMSB) (f = Bernoulli)

Similar to SBM, but with an extra layer of parameters to estimate.

Key assumptions remain: Pr(i → j) = Mzi ,zj

M = Stochastic Block Matrix

But, zi and zj must be estimated for each dyadic interaction between alli and j vertices, based on a latent mixed membership vector for each i.

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 12 / 28

Page 16: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Mixed Membership

each vertex i has a mixed membership vector θi ∼ Dirichlet(α)

vertex i takes on a single group membership with probability θi in thecontext of a directed dyadic interaction with vertex j

For each pair of vertices (i , j) ∈ [1,N]x [1,N] (adjacency matrix),

Sample group membership of i and j independently

Sample group zi→j ∼ Multinomial(θi , 1)

Sample group zi←j ∼ Multinomial(θi , 1)

Note: This model specification can be adapted to undirected interactionseasily, zi↔j

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 13 / 28

Page 17: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Mixed Membership

Key Innovations:

nodes belong to more than one group

nodes belong to groups with different strengths of membership

nodes take on a specific group membership for the duration of aninteraction

include sparsity parameter, control model’s sensitivity to zeros inadjacency matrix due to noise.

Final sampling of Aij ∼ Bernoulli (ρ zi→j M zi←j + (1− ρ) δ0)

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 14 / 28

Page 18: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Mixed Membership

joint probability distribution:

p(A, θ, zi , zj |α,M) =N∏

i=1

p(θi |α)N∏

j=1

p(zi→j |θi )p(zi←j |θj)p(Aij |zi→j , zi←j ,M)

Where,

A is the observed adjacency matrix

M is the block matrix

θi and θj are the mixed membership vectors for i and j

zi and zj are the group membership indicators for i and j during theirinteraction

The only input to this model is the number of groups.

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 15 / 28

Page 19: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Outline

1 Overview

2 The MMSB ModelMixed MembershipModel Estimation

3 Application of Mixed Membership ModelEmpirical and Synthetic DataDrawbacks to the MMSBModel Flexibility

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 16 / 28

Page 20: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

MMSB Estimation

We won’t focus on this too much...

Only to note that the actual marginal probability (likelihood) ofp(A|α,M) is not tractable to compute (i.e. we cannot integrateout z and α).

Airoldi et al. carry out an approximate inference and parameterestimation.

In order to compute the posterior degrees of membership for all i givenhyperparameters (θ and α):

p(θ|A, α,M) =p(θ,A|α,M)

p(A|α,M)

They use variational methods: ”find a lower bound of the likelihood andapproximate posterior distributions for each objects membership vector.”(Airoldi et al, 2015 p. 7)

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 17 / 28

Page 21: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

MMSB Estimation

What are the implications of this estimation strategy for the model?

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 18 / 28

Page 22: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Outline

1 Overview

2 The MMSB ModelMixed MembershipModel Estimation

3 Application of Mixed Membership ModelEmpirical and Synthetic DataDrawbacks to the MMSBModel Flexibility

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 19 / 28

Page 23: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Sampson monk factions

(a) MMSB with LDA(b) Airoldi et al.: Variational Methods

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 20 / 28

Page 24: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Synthetic data

How to create ”good” synthetic data?

various levels of difficulty of detection? (cin − cout)

specific block patterns?

always the problem of linking recovered partitions to actual theorizedgroups

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 21 / 28

Page 25: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Outline

1 Overview

2 The MMSB ModelMixed MembershipModel Estimation

3 Application of Mixed Membership ModelEmpirical and Synthetic DataDrawbacks to the MMSBModel Flexibility

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 22 / 28

Page 26: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Potential Issues

1 Estimation procedure

2 Selecting ”K” number of groups

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 23 / 28

Page 27: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Selecting K

Two Goals: Prediction and Interpretation

1 Prediction: We want to identify the number of group affiliations, K ,most predictive of observed patterns of interaction, G .

2 Interpretation (Hypothesis-Driven): We want to determinehow predictive a specific set of hypothesized group affiliations, K , isof observed patterns of interaction, G .

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 24 / 28

Page 28: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Selecting K

Our goal affects how we deal with two key issuesin community detection:

1 How do we determine the right number of groups in analyzing theinteractions in G?

prediction: Find the most predictive K (i.e. BIC,or cross-validation).interpretation: ?

2 How do we know that the way our algorithm (”heuristic”) partitionedthe data is always the same for K groups?

prediction: permutation of components of θi tointerpret [E |θi (k)|A], if we know components of eachfunctional group.interpretation: ?

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 25 / 28

Page 29: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Outline

1 Overview

2 The MMSB ModelMixed MembershipModel Estimation

3 Application of Mixed Membership ModelEmpirical and Synthetic DataDrawbacks to the MMSBModel Flexibility

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 26 / 28

Page 30: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

There are also times when we, as researchers, have prior (if partial)information that we want to test and inject in our model...

partial information on K

partial information on M (block, or ”mixing” matrix)

partial information on θ (mixed membership vectors

partial information on decision rule for selecting zi→j or zi←j . (i.e.we might want to break the independence assumption).

or any combination of the above

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 27 / 28

Page 31: Mixed Membership Stochastic Blockmodels...Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton

Questions to the room

Are there tools available to allow researchers to take advantage ofthis potential flexibility in the MMSB and SBMs in general?

What types of research question can we address using the MMSBmodel?

What types of research question can we address if we can take fulladvantage of the model’s potential flexibility?

and as always, there are questions about the assumptions made by thismodel.

Under what circumstances (for what research questions) are wewilling to make them?

Under what circumstances would these assumptions not hold?

Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 28 / 28


Recommended