+ All Categories
Home > Documents > An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International...

An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International...

Date post: 26-Aug-2020
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
34
IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams Yifeng Zeng Prashant Doshi Aalborg University, Denmark University of Georgia, USA Speaker Muthukumaran C. University of Georgia, USA
Transcript
Page 1: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

An Information-Theoretic Approach to ModelIdentification in Interactive Influence

Diagrams

Yifeng Zeng Prashant DoshiAalborg University, Denmark University of Georgia, USA

SpeakerMuthukumaran C.

University of Georgia, USA

Page 2: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Outline

Outline

I Problem StatementI Related WorkI Interactive Influence Diagrams (I - ID)I Bayesian Model IdentificationI Information-Theoretic Model IdentificationI Experimental Results

Page 3: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Opponent Modeling

Guess Your Opponent!

I Repeated GamesI Observe previous

actionsI Predict next actionsI Win the rewards

I Model OpponentI How and What will

he/she play?

Page 4: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Related Work

Review

I Carmel&Markovitch(1996)I Model agents’ strategies using finite state automata

I Suryadi&Gmytrasiewicz(1999)I Learn influence diagrams to be consistent with

observationsI Saha et al.(2005)

I Approximate agents’ decision functions using Chebyshevpolynomials

Page 5: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Our Representation

Interactive Influence Diagrams (I - ID)

Interactive Influence Diagram (I - ID, Doshi et al. 2007)

I A generic level l Interactive-ID (I-ID) foragent i situated with one other agent j

I Model Node: Mj,l−1I Models of agent j at level l − 1

I Policy link: dashed lineI Distribution over agent j ’s actions

given its modelsI Beliefs on Mj,l−1: P(Mj,l−1|s)

I Be updated over time

Page 6: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Our Representation

Model Node

Details of the Model Node

I Members of the model nodeI Different chance nodes: solutions of

models mj,l−1I Mod [Mj ] represents the different

models of agent jI CPT of the chance node Aj is a

multiplexerI Assumes the distribution of each of the

action nodes (A1j , A2

j ) depending onthe value of Mod [Mj ]

Page 7: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Our Representation

Typical Case

Public Good (PG) Game

I There are two agents initially endowed with XT amount ofresources. Each agent may choose: Fully Contribute (FC),Partially Contribute (PC) the resources to a public pot, ornot contribute (D: called defect here)

I The value of resources in the public pot is discounted by ci(≤1) for each agent i , where ci is the marginal privatereturn

I In order to encourage contributions, the contributingagents punish free riders P but incur a small cost cp foradministering the punishment

Page 8: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Our Representation

Typical Case

Public Good (PG) Game

I There are two agents initially endowed with XT amount ofresources. Each agent may choose: Fully Contribute (FC),Partially Contribute (PC) the resources to a public pot, ornot contribute (D: called defect here)

I The value of resources in the public pot is discounted by ci(≤1) for each agent i , where ci is the marginal privatereturn

I In order to encourage contributions, the contributingagents punish free riders P but incur a small cost cp foradministering the punishment

Page 9: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Our Representation

Typical Case

Public Good (PG) Game

I There are two agents initially endowed with XT amount ofresources. Each agent may choose: Fully Contribute (FC),Partially Contribute (PC) the resources to a public pot, ornot contribute (D: called defect here)

I The value of resources in the public pot is discounted by ci(≤1) for each agent i , where ci is the marginal privatereturn

I In order to encourage contributions, the contributingagents punish free riders P but incur a small cost cp foradministering the punishment

Page 10: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Our Representation

Reward Function

Payoff Matrix

i , j FC PC DFC 2ciXT , 3

2 XT ci − 12 cp, ciXT − cp,

2cjXT12 XT + 3

2 XT cj − 12 P XT + cjXT − P

PC 12 XT + 3

2 XT ci − 12 P, 1

2 XT + ciXT , 12 XT + 1

2 ciXT − 12 P,

32 XT cj − 1

2 cp12 XT + cjXT XT + 1

2 cjXT − PD XT + ciXT − P, XT + 1

2 ciXT − P, XT ,cjXT − cp

12 XT + 1

2 cjXT − 12 P XT

Table: PG game with punishment. Based on punishment, P, andmarginal return, ci , agents may choose to contribute than defect.

Page 11: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Our Representation

Candidate Models

Agent j ’s Types

I m1j : A reciprocal agent who contributes only when it

expects the other agent to contribute as wellI Low values of ci

I m2j : An altruistic agent who prefers to contribute during the

playI High values of ci

I m3j : Relies on both its own and opponent actions in the

previous time stepI m4

j : Relies more on the past interaction - up to twoprevious time steps

Page 12: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Our Representation

I-ID for PG Game

Page 13: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Model Identification

Two Cases

I Case 1: m∗j ∈ Mj (Traditional)

I Bayesian Model IdentificationI Case 2: m∗

j 6∈ MjI Information-Theoretic Model Identification

Page 14: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 1: m∗j ∈ Mj - Bayesian Model Identification

Belief Update

Bayesian Learning (Traditional)

Pr(mnj |o

ti ) =

Pr(oti |m

nj )Pr(mn

j |o1:t−1)∑mj∈Mj

Pr(oti |mj)Pr(mj)

(1)

I If an agent’s prior belief assigns a non-zero probability tothe true model of the other agent, its posterior beliefsupdated using Bayesian learning will converge withprobability 1

I Don’t always converge to the true model of the other agentI Observationally equivalent models

Page 15: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 1: m∗j ∈ Mj - Bayesian Model Identification

Observational Equivalence

Observational Equivalence

I Two j ’s ModelsI Model 1: Select FC for an infinite number of steps, but if at

any time i chooses PC, j would also do so at the next timestep and then continue selecting PC

I Model 2: Play tit-for-tat strategy: j performs the actionwhich i did in the previous time step

I i selects FC for an infinite number of times

Page 16: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Relevant Models

Relevant Models

I Relevant model mnj

I A relevant model predicts an action that is likely to correlatewith a particular observed action of the other agent

I Pr(a1j |mn

j , a∗j ) ≥ Pr(a1j |mn

j , a∗j ), where a1j ∈ OPT (mn

j )

I We interpret the existence of a mutual pattern as evidencethat the candidate model shares some behavioral aspectsof the true model

I Assign large probabilities to mnj in Mod [Mj ] over time

Page 17: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Parameter Learning

Learning Naive Bayesian Models

Figure: History of interaction

Page 18: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Mutual Information

Mutual Information as Model Weight

MI(mnj , m∗

j )def= Pr(An

j , Aj)log[Pr(An

j ,Aj )

Pr(Anj )Pr(Aj )

]

= Pr(Anj |Aj)Pr(Aj)log[

Pr(Anj |Aj )

Pr(Anj )

](2)

I Anj : the chance node mapped from mn

jI Aj : the observed actions generated by m∗

j

Page 19: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Algorithm

Model Weight Update

Step 1: Update the training set using i ’sobservations and model mp

j solutions

Step 2: Learn the parameters of the naive BNincluding the chance nodes A1

j ,. . ., Anj , and Aj

LoopStep 3: Compute MI(mp

j , m∗j )

Step 4: Obtain Pr(Aj |Apj ) from the learned

naive BNStep 5: Populate CPD row of the chance

node Aj using Pr(Aj |Apj , mp

j )

Step 6: Normalize MI(mpj , m∗

j )

Step 7: Populate CPD of the chance nodeMod [Mj ] using MI

Page 20: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Algorithm

Model Weight Update

Step 1: Update the training set using i ’sobservations and model mp

j solutionsStep 2: Learn the parameters of the naive BNincluding the chance nodes A1

j ,. . ., Anj , and Aj

LoopStep 3: Compute MI(mp

j , m∗j )

Step 4: Obtain Pr(Aj |Apj ) from the learned

naive BNStep 5: Populate CPD row of the chance

node Aj using Pr(Aj |Apj , mp

j )

Step 6: Normalize MI(mpj , m∗

j )

Step 7: Populate CPD of the chance nodeMod [Mj ] using MI

Page 21: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Algorithm

Model Weight Update

Step 1: Update the training set using i ’sobservations and model mp

j solutionsStep 2: Learn the parameters of the naive BNincluding the chance nodes A1

j ,. . ., Anj , and Aj

Loop

Step 3: Compute MI(mpj , m∗

j )

Step 4: Obtain Pr(Aj |Apj ) from the learned

naive BNStep 5: Populate CPD row of the chance

node Aj using Pr(Aj |Apj , mp

j )

Step 6: Normalize MI(mpj , m∗

j )

Step 7: Populate CPD of the chance nodeMod [Mj ] using MI

Page 22: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Algorithm

Model Weight Update

Step 1: Update the training set using i ’sobservations and model mp

j solutionsStep 2: Learn the parameters of the naive BNincluding the chance nodes A1

j ,. . ., Anj , and Aj

LoopStep 3: Compute MI(mp

j , m∗j )

Step 4: Obtain Pr(Aj |Apj ) from the learned

naive BNStep 5: Populate CPD row of the chance

node Aj using Pr(Aj |Apj , mp

j )

Step 6: Normalize MI(mpj , m∗

j )

Step 7: Populate CPD of the chance nodeMod [Mj ] using MI

Page 23: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Algorithm

Model Weight Update

Step 1: Update the training set using i ’sobservations and model mp

j solutionsStep 2: Learn the parameters of the naive BNincluding the chance nodes A1

j ,. . ., Anj , and Aj

LoopStep 3: Compute MI(mp

j , m∗j )

Step 4: Obtain Pr(Aj |Apj ) from the learned

naive BN

Step 5: Populate CPD row of the chancenode Aj using Pr(Aj |Ap

j , mpj )

Step 6: Normalize MI(mpj , m∗

j )

Step 7: Populate CPD of the chance nodeMod [Mj ] using MI

Page 24: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Algorithm

Model Weight Update

Step 1: Update the training set using i ’sobservations and model mp

j solutionsStep 2: Learn the parameters of the naive BNincluding the chance nodes A1

j ,. . ., Anj , and Aj

LoopStep 3: Compute MI(mp

j , m∗j )

Step 4: Obtain Pr(Aj |Apj ) from the learned

naive BNStep 5: Populate CPD row of the chance

node Aj using Pr(Aj |Apj , mp

j )

Step 6: Normalize MI(mpj , m∗

j )

Step 7: Populate CPD of the chance nodeMod [Mj ] using MI

Page 25: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Algorithm

Model Weight Update

Step 1: Update the training set using i ’sobservations and model mp

j solutionsStep 2: Learn the parameters of the naive BNincluding the chance nodes A1

j ,. . ., Anj , and Aj

LoopStep 3: Compute MI(mp

j , m∗j )

Step 4: Obtain Pr(Aj |Apj ) from the learned

naive BNStep 5: Populate CPD row of the chance

node Aj using Pr(Aj |Apj , mp

j )

Step 6: Normalize MI(mpj , m∗

j )

Step 7: Populate CPD of the chance nodeMod [Mj ] using MI

Page 26: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Algorithm

Model Weight Update

Step 1: Update the training set using i ’sobservations and model mp

j solutionsStep 2: Learn the parameters of the naive BNincluding the chance nodes A1

j ,. . ., Anj , and Aj

LoopStep 3: Compute MI(mp

j , m∗j )

Step 4: Obtain Pr(Aj |Apj ) from the learned

naive BNStep 5: Populate CPD row of the chance

node Aj using Pr(Aj |Apj , mp

j )

Step 6: Normalize MI(mpj , m∗

j )

Step 7: Populate CPD of the chance nodeMod [Mj ] using MI

Page 27: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Theoretical Results

Some Properties

I Property 1I Irrelevance: Pr(aj |mn

j , a∗j ) = Pr (aj | mnj , a∗j )

I MI(mnj , m∗

j ) = 0

I Property 2

I Relevance Ordering(mnj is more relevant than mp

j ):Pr(a1

j |mnj , a∗j ) ≥ Pr(a1

j |mpj , a∗j ) and

Pr(a1j |mn

j , a∗j ) ≤ Pr(a1j |m

pj , a∗j )

I Larger MI is assigned to mnj : MI (mn

j , m∗j ) ≥ MI (mp

j , m∗j )

I Property 3

I Convergence

I Given that the true model m∗j ∈ Mj and is assigned a

non-zero probability, the normalized distribution of mutualinformation of the models converges with probability 1

Page 28: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Theoretical Results

Some Properties

I Property 1I Irrelevance: Pr(aj |mn

j , a∗j ) = Pr (aj | mnj , a∗j )

I MI(mnj , m∗

j ) = 0I Property 2

I Relevance Ordering(mnj is more relevant than mp

j ):Pr(a1

j |mnj , a∗j ) ≥ Pr(a1

j |mpj , a∗j ) and

Pr(a1j |mn

j , a∗j ) ≤ Pr(a1j |m

pj , a∗j )

I Larger MI is assigned to mnj : MI (mn

j , m∗j ) ≥ MI (mp

j , m∗j )

I Property 3

I Convergence

I Given that the true model m∗j ∈ Mj and is assigned a

non-zero probability, the normalized distribution of mutualinformation of the models converges with probability 1

Page 29: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Theoretical Results

Some Properties

I Property 1I Irrelevance: Pr(aj |mn

j , a∗j ) = Pr (aj | mnj , a∗j )

I MI(mnj , m∗

j ) = 0I Property 2

I Relevance Ordering(mnj is more relevant than mp

j ):Pr(a1

j |mnj , a∗j ) ≥ Pr(a1

j |mpj , a∗j ) and

Pr(a1j |mn

j , a∗j ) ≤ Pr(a1j |m

pj , a∗j )

I Larger MI is assigned to mnj : MI (mn

j , m∗j ) ≥ MI (mp

j , m∗j )

I Property 3I Convergence

I Given that the true model m∗j ∈ Mj and is assigned a

non-zero probability, the normalized distribution of mutualinformation of the models converges with probability 1

Page 30: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Case 2: m∗j 6∈ Mj - Information-Theoretic Model Identification

Potential Limitations

MI Equivalence

I One exampleI True model: j always plays FCI Candidate model: j always plays DI Both models are assigned equal MI

I Dependency is elicited between D and FC

I Set of MI equivalence ⊇ Set of Observational equivalenceI NOT affect prediction performance

I The perceived dependency classifies D into FC through thelearned parameters Pr(Aj |Ap

j )

Page 31: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Evaluation

Method Evaluation

I MethodsI Bayesian Learning (BL)I Mutual Information (MI)I Adaptation Bayesian Learning (A− BL)

I Restart the BL process when the likelihoods become zero byassigning candidate models prior weights using thefrequency with which the observed action has beenpredicted by the candidate models so far

I KL DivergenceI Measure difference between An

j and Aj distributions

I Scenarios

I PG GamesI Negotiation Games (4 types of opponents)

Page 32: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Evaluation

Method Evaluation

I MethodsI Bayesian Learning (BL)I Mutual Information (MI)I Adaptation Bayesian Learning (A− BL)

I Restart the BL process when the likelihoods become zero byassigning candidate models prior weights using thefrequency with which the observed action has beenpredicted by the candidate models so far

I KL DivergenceI Measure difference between An

j and Aj distributionsI Scenarios

I PG GamesI Negotiation Games (4 types of opponents)

Page 33: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Experimental Results

Case 1: m∗j = m4

j , Mj={m1j , m3

j , m4j } Case 2: m∗

j = m1j , Mj={m2

j , m3j , m4

j }

Page 34: An Information-Theoretic Approach to Model Identification ...€¦ · IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08) An Information-Theoretic Approach

IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT’08)

An Information-Theoretic Approach to Model Identification in Interactive Influence Diagrams

Conclusions

Conclusions

I I-ID in Repeated GamesI Two Cases for Model Identification in I-IDI MI Complements BL


Recommended