+ All Categories
Home > Documents > SIF8072 Distributed Artificial Intelligence and Intelligent Agents

SIF8072 Distributed Artificial Intelligence and Intelligent Agents

Date post: 12-Feb-2016
Category:
Upload: amil
View: 41 times
Download: 0 times
Share this document with a friend
Description:
Lecture 2: Multi-agent Interactions. SIF8072 Distributed Artificial Intelligence and Intelligent Agents. http://www.idi.ntnu.no/~agent/. Lecturer: Sobah Abbas Petersen Email: [email protected]. Lecture Outline. Multi-agent Systems Utility and Preferences Game Theory and Payoff Matrices - PowerPoint PPT Presentation
Popular Tags:
43
SIF8072 Distributed Artificial Intelligence and Intelligent Agents http://www.idi.ntnu.no/~agent/ Lecture 2: Multi-agent Interactions Lecturer: Sobah Abbas Petersen Email: [email protected]
Transcript
Page 1: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

SIF8072 Distributed Artificial Intelligence

andIntelligent Agents

http://www.idi.ntnu.no/~agent/

Lecture 2: Multi-agent Interactions

Lecturer: Sobah Abbas PetersenEmail: [email protected]

Page 2: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

2

Lecture Outline

1. Multi-agent Systems

2. Utility and Preferences

3. Game Theory and Payoff Matrices

4. Strategies

5. Negotiation - Auctions

6. Summary

Page 3: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

3

References

Wooldridge: ”Introduction to MAS”– Multi-agent Interactions: Chapters 6– Auctions: Chapter 7

Page 4: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

4

Interactions

”The world functions through interacting agents. Each person pursues his/her own goals through encounters with other people or machines.”

”Rules of Encouter” by Rosenchein and Zlotskin, 1994

Page 5: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

5

Example 1

Two students decide to work together on their

exercises. They have to decide upon a time. One

prefers to work on Thursday afternoons after the

lecture while the other prefers to work on Friday

morning. How do they decide upon a time to do

the work?

Page 6: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

6

Example 2

A friend invites you out for a drink and the cinema

tonight. But your favourite TV program is on tonight.

You think:

– It would be nice to go out with my friend, but it’s cheaper to

watch TV.

– If you stay at home and watch TV, you might not have a chance

to go out with your friend for a long time.

– I can always record the program and watch it afterwards.

– I can invite my friend home.

Page 7: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

7

Multi-agent Systems (MAS)

• Contains a number of agents which:

– interact with one another through communication

– are able to act in an environment

– have different ”spheres of influence”

– may be linked by other relationships, e.g. organisational

• It is important to understand the type of interaction.

• Each agent can be assumed to be self-interested:

– has its own preferences and desires about how the world should be.

Page 8: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

8

Multi-agent Systems (MAS)

Environment

Sphere of influence

Multi-agent System

AgentInteractionOrganisational relationship

Page 9: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

9

Utilities and Preferences

• Assume we have 2 agents: Ag = {i,j}.

• Assume ={ 1, 2,….} is the set of ”outcomes” that

agents have preferences over.

• We capture preferences by utility functions:• ui : IR

• uj : IR

• Utility functions lead to preference orderings over outcomes:

≥ i ’ means ui() ≥ ui(’)

> i ’ means ui() > ui(’)

Page 10: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

10

What is Utility?

• Utility is not money, but a useful analogy

• Typical relationship between utility and money: Utility

Money

Page 11: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

11

Multi-agent Encounters 1

• Need a model of the environment in which the agents will act.

• Agents simultaneously choose an action and, as a result, an outcome in will result.

• Actual outcome depends on a combination of actions.

• Environment behaviour given by state transformer function: (reference: p31 of textbook)

: Ac Ac

Agent i’s action Agent j’s action

Page 12: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

12

Multi-agent Encounters 2

• Assume that each agent has two possible actions:

1. C: cooperate

2. D: defect Let Ac = {C,D}

Page 13: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

13

State Transformer Funtions

• Environment sensitive to actions of both agents:

(D,D)= 1 (D,C)= 2 (C,D)= 3 (C,C)= 4

• Environment where neither agent has any influence:

(D,D)= 1 (D,C)= 1 (C,D)= 1 (C,C)= 1

• Environment controlled by j:

(D,D)= 1 (D,C)= 2 (C,D)= 1 (C,C)= 2

Let Ac = {C,D}

Page 14: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

14

Agent’s Preference

• Consider the case where both agents influence the outcome and they have the following utility functions:

ui(1 )=1 ui(2 )=1 ui(3)=4 ui(4 )=4

uj(1 )=1 uj(2 )=4 uj(3)=1 uj(4 )=4

ui(D,D)=1 ui(D,C)=1 ui(C,D)=4 ui(C,C)=4

uj(D,D)=1 uj(D,C)=4 uj(C,D)=1 uj(C,C)=4

• Then, agenti’s preferences are:

C,C i C,D i D,C i D,D

Agenti preferes all outcomes that arise through C over all

outcomes that arise through D

Page 15: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

15

Payoff MatricesWe can characterise the previous scenario in a payoff matrixe.g. Top right cell:

i cooperates, j defects

Defect

Coop

Defect Coop

j

i

1

1

1

4

4

1

4

4

•Agent i is the column player

(payoff received by i shown in top right of each cell)

•Agent j is the row player

Page 16: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

16

Game Theory

• A mathematical theory that studies interactions about

self-interested agents.

• Essential elements of a game are:– Players (2 or more)

– Some choice of action (strategy)

– One or more outcomes (someone wins, someone loses)

– Information

• Suitable for situations where the other agent’s (player’s)

behaviour matters.

Page 17: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

17

The Prisoner’s Dilemma 1

• 2 men are collectively charged with a crime and held in separate cells. They have no way of communicating with each other or making an agreement. They are told:

– if one confesses and the other does not, confessor will be freed and the other jailed for 3 years.

– if both confess, then each will be jailed for 2 years.– If neither confess, then each will be jailed for 1 year.

•Confessing => defecting (D)•Not confessing => cooperating (C)

If you were one of the prisoners, what would you do?Discuss your answer with your neighbour.

Page 18: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

18

The Prisoner’s Dilemma 2

Payoff matrix for Prisoner’s Dilemma:

Defect

Coop

Defect Coop

j

i

22

41

14

33

• Top left: If both defect, punishment for mutual defection.

• Top right: if i cooperates and j defects,i gets sucker’s payoff of 1 while j gets 4.

• Bottom left: if j cooperates and i defects,j gets sucker’s payoff of 1 while i gets 4.

• Bottom right: Reward for mutual cooperation.

Numbers in the payoff matrix reflect how good an outcome is for the agent. e.g.ui(D,D)=2 ui(D,C)=4 ui(C,D)=1 ui(C,C)=3

uj(D,D)=2 uj(D,C)=1 uj(C,D)=4 uj(C,C)=3

Page 19: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

19

The Prisoner’s Dilemma 3

• The individual rational agent will defect!

– This guarantees a payoff of no worse than 2

– Cooperating guarantees a payoff of at most 1

• Defection is the best response to all possible strategies

– Both agents defect and get a payoff = 2.

• If both agents cooperate, they will each get payoff = 3.– (The other prisoner is my twin!)

Can we recover cooperation?The Iterated Prisoner’s Dilemma

Page 20: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

20

Let’s take a minute…..

How can we apply the Prisoner’s Dilemma to real

situations?

• e.g. Arms races – nuclear weapons

compliance treaty between two countries.

• Can you think of other situations?

Page 21: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

21

Strategies

• ”A strategy is the way an agent behaves in

an interaction”. (Ref: Rosenchein and Zlotskin, 1994)

– From game theory: strategies are actions of agents (Ac)

• When 2 agents encounter, important

question: What should I do?

Page 22: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

22

Dominance

• Given any particular strategy s (e.g. C or D),

there will be a number of outcomes.

• We say that s1 dominates s2 if every outcome

possible by i playing s1 is preferred over every

outcome possible by i playing s2.

Page 23: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

23

Nash Equilibrium

• 2 strategies s1 and s2 are in Nash Equilibrium if:

– Under the assumption that agent i plays s1, agent j can do no better

than play s2;

– Under the assumption that agent j plays s2, agent i can do no better

than play s1;

• Neither agant has any incentive to deviate from a Nash Equilibrium.

• Unfortunately:– Not every interaction scenario has a Nash Equilibrium.

– Some interaction scenarios have more than one Nash Equilibrium.

Page 24: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

24

Nash Equilibrium - Example

The Battle of the Sexes• Conflict between a man and a woman,

where the man wants to go to a Prize Fight and the woman wants to go to a Ballet

• They are deeply in love. So, they would make a sacrifice to be with each other.

2 Nash Equilibria Strategy combination (Prize Fight, Prize

Fight) Strategy combination (Ballet, Ballet)

Woman

Prize Fight

Ballet

Prize Fight Ballet

Man 21

00

00

12

Ref: ”Games and Information, E. Rasmussen, 2001

Page 25: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

25

Let’s play a little game…..

Guess half the average

• Choose a number between 0 and 100. Your

aim is to choose a number that is closest to

half the average of the numbers chosen by all

the students.

• What is your number?

Page 26: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

26

Competitive and zero-sum Interactions

• One agent can only get a more preferred

outcome at the expense of the other agent strictly competitive.

• Zero-sum encounters

– ui () + uj () = 0, for all .

– e.g. A football game where only one team can win.

Page 27: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

27

Assumptions in Game Theory

• All Players behave rationally– Not always the case with all agents!

• Each player knows the rule.

• Payoffs are known and fixed.

These are limitations!

Page 28: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

28

Multi-agent Interaction: Summary

• MAS: a number of agents which interact with one another through

communication.

• An agent’s action results in an outcome in the environment.

• Utility functions are used for preference orderings.

• Game theory – a mathematical theory that studies interactions among

agents.

• An agent’s action is a strategy:

– Dominant

– Nash Equilibrium

Page 29: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

29

Negotiation

• ”The process of several agents searching for an agreement”e.g. about price.

Reaching consensus

”Rules of Encouter” by Rosenchein and Zlotskin, 1994

Page 30: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

30

Auction: Example 1

Several millions of $ paid for art

at auction houses such as

Sotheby’s.

Ears 2 u, Vincent!

Page 31: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

31

Auction: Example 2

Online Auctions

You want to buy some exciting video games.

You see that there are some available on eBay.

You register at eBay and offer a bid for some of

these games.

                         

Page 32: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

32

Auctions

• An Auction takes place between an auctioneer and a

collection of bidders.

• Goal is for the auctioneer to allocate the goods to

one of the bidders.

• In most settings, the auctioneer desires to maximise

the price; bidders desire to minimise the price.

auctioneer

bidders

auctioneer

bidder

Price

Page 33: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

33

Auction Parameters

Value of goods Private,public/common, Correlated

Winner determination First price,second price

Bids may be Open cry, Sealed

Bidding may be One shot,ascending,descending

Page 34: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

34

English Auctions

• English auctions are:

– First price

– Open cry

– Ascending

• Dominant strategy: successively bid a small amount more than the

highest current bid until it reaches the valuation, then withdraw.

• Susceptible to Winners curse

– Winner is the one who overvalues the goods on offer and may end up

paying more than its worth.

auctioneerBidder 1

Price

Bidder x

Page 35: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

35

Dutch Auctions

• Dutch auctions are:

– Open cry

– Descending

• Auctioneer starts at an artificially high price. Then continually

lowers the offer price until an agent makes a bid which is equal

to the current offer price.

• Dominant strategy: None

• Susceptible to Winners curse

auctioneer

Bidder

Price

auctioneer

Page 36: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

36

First-price Sealed-bid Auctions

• One shot auction

• Single round, where bidders submit a sealed-bid for

the good.

• Good is awarded to agent that made the highest bid.

• Winner pays price of highest bid.

• Best strategy: bid less than true value.

auctioneer

Bidders

Page 37: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

37

Vickrey Auctions

• Vickrey auctions are:

– second-price

– sealed-bids

• Good is awarded to agent that made the highest bid.

• Winner pays price of second highest bid.

• Best strategy: bid the true value.

• Susceptible to anti-social behaviour

Page 38: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

38

Lies and Collusions

• Lies:

– By the bidders (e.g. In Vickrey auctions)

– By the auctioneer (shills, in Vickrey auction)

• Collusion of bidders

– Coalition of bidders where they agree beforehand to put

forward artificially low bids for the good on offer. When

the good is obtained, the bidders can then get the true value

of the good and share the profits.

Page 39: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

39

Limitations of Auctions

• Only concerned with the allocation of goods;

• Not adequate for settling agreements that

concerns matters of mutual interest.

Negotiation

Page 40: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

40

Let’s take a minute……

• Can you think of any auctions that you have come

across?

• How about offering your notebook to the highest

bidder at the end of the year…..

• Discuss with your neighbour.

Page 41: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

41

…..Selecting a Bid

Page 42: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

42

Auctions: Summary• An Auction takes place between an auctioneer and a collection of

bidders.

• In most settings, the auctioneer desires to maximise the price; bidders

desire to minimise the price.

• Types of Auctions:

– English auction

– Dutch auction

– First-price sealed bids

– Vickrey (Second-price sealed bids)

• Useful for allocating goods. But too simple for many other settings.

Page 43: SIF8072  Distributed Artificial Intelligence and Intelligent Agents

43

Next Lecture: Negotiation

Will be based on:

”Reaching Agreements”, Chapter 7 in Wooldridge: ”Introduction to MultiAgent Systems”

Coordination – Working together, Chapter 9


Recommended