+ All Categories
Home > Documents > GTAP Sessions 11-15 Mixed Strategy

GTAP Sessions 11-15 Mixed Strategy

Date post: 02-Jun-2018
Category:
Upload: pratheek-ps
View: 224 times
Download: 0 times
Share this document with a friend

of 58

Transcript
  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    1/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    GAME THEORY & APPLICATIONS

    J. Ajith Kumar, TAPMI, Manipal

    Mixed Strategy

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    2/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    You are asked to make an investment in ONEof two

    ventures, given the following information:

    2

    qIf you invest in Venture A, there is a 5% chance of

    making a profit of Rs. 80 crores, and a 95% chance

    of no profit at all.qBut, if you invest in Venture B, you are fully assured

    of a profit of Rs. 8 crores.

    Which venture will youinvest in A or B?

    (there is no correctanswer).

    OPENING EXERCISE

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    3/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Lottery A: Is for Rs. 10 crore, and 250,000 people will

    purchase it.

    3

    There is no one correctanswer

    OPENING EXERCISE

    You have Rs. 100 with you, with which you have to

    purchase either of two lotteries.

    The only winner of each lottery gets the full prize but a

    loser loses even the Rs. 100. Which lottery will you

    take?

    Lottery B: Is for Rs. 2 crore, and 25,000 people willpurchase it.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    4/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 4

    MATCHING PENNIES GAME (MULTIPLE ROUNDS)

    Each player chooses Redor Black. If players choose the same

    colour, Player 1 wins. If players choose different colours, Player 2wins.

    We will do 30 rounds. In each round, each player must guess

    what the other player will choose, and then make his own

    choice.

    At the end, each player will describe his thinking to all us.

    What is the result?

    After each round, players get to see what their respective

    choices were and who won the round.

    Players will be given a few seconds between rounds to think

    over their choice of the next round.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    5/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 5

    UNDERSTANDING MIXED STRATEGY

    MANOJ

    Park Beach

    ALOK

    Park 100, 80 0, 0

    Beach

    0, 0 60, 120

    Alok and Manoj are two elderly

    citizens who stay in two different

    localities of the same town. Every

    evening both of them visit either

    the park or the beach (there is only

    one of each in the town).

    Does either player have a dominant strategy?

    What are the PSNE? Are there any MSNE in this game?

    If they both visit the same place then they enjoy each

    otherscompany and that leads to positive payoffs for

    both. If they visit different places, neither benefits.

    The payoffs are given here.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    6/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 6

    UNDERSTANDING MIXED STRATEGY

    A mixed strategy of a given player is a probability

    distribution that the player assigns to his set of pure

    strategies.

    e.g. in this game, Aalok = (Park, Beach)

    The pure strategies that Alok can choose are Park, Beach.

    However, Alok can mixthe two and choose to go to

    the Park with probability p, and to the Beach with

    probability1p.

    If p = 0.1, then Alok = (0.1, 0.9) is a mixed strategy

    played by Alok.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    7/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 7

    UNDERSTANDING MIXED STRATEGY

    Similarly, Manoj can decide to mix his strategies andplay Park and Beach with probabilities q and 1q

    respectively.

    If q = 0.2, then Manoj = (0.2, 0.8) is a mixed strategy

    played by Manoj.

    Since bothpandqcan take infinite values between 0

    and 1, Alok and Manoj can choose between infinite

    possible strategies.

    A pure strategy is a special case of mixed strategy,

    when p (or q) is 0 or 1.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    8/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 8

    MIXED STRATEGY PROFILES

    We can assign to each mixed strategy an additional row

    or an additional column, depending upon the playerthat is playing it.

    MANOJ

    Park Manoj Beach

    ALOK

    Park 100, 80 ?, ? 0, 0

    Alok

    ?, ? ?, ? ?, ?

    Beach

    0, 0 ?, ? 60, 120

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    9/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 9

    THE MEANING OF MIXED STRATEGYWhat does it mean to say Alok willmix strategies and play (0.1, 0.9)?

    q From Aloks viewpoint: On 10% of the days, Alok will go to the Park, and on

    the remaining 90%, he will go to the Beach. Thus, when the game is

    played over a large number of rounds, the mixing of the two pure

    strategies happens in the proportions indicated by the mixed strategies.

    q However, on any given day, Alok will either go to the Park or the Beach.

    Thus, whenever Alok plays the game, he actually plays only a pure strategy.

    Manoj also knows this.

    Simple interpretation 1: Randomization

    This means that the mixed strategy is an abstraction and is never

    actually playedby the player in any particular round of the game.

    q From Manojs viewpoint: Manojs perception of how likely Alok is, to

    choose either place. In this example, Manoj might think there is a 10%

    chance that Alok will go to the Park and a 90% chance that he will go to the

    beach. If so, Manoj will say that Alok is playing the mixed strategy - (0.1,

    0.9).

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    10/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 1

    q Alok and Manoj are not particular individuals but represent two different

    populations. In this example chosen, a member of each type gets value

    only in interaction with a member of the other type and no value

    otherwise. Further, each population has its preferred distribution.

    In any given game, these are two useful ways of interpreting

    to create meanings as suitable to the context of that game.

    Simple interpretation 2: Population preference

    q Thus, Alok plays (0.1, 0.9)means that when asked what they prefer, 10%

    members of the Alok population will choose Park and 90% will choose

    Beach. Thus the mixed strategy represents the preference distribution of

    the population.

    THE MEANING OF MIXED STRATEGYWhat does it mean to say Alok willmix strategies and play (0.1, 0.9)?

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    11/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 11

    MIXED STRATEGY & EXPECTED PAYOFFS

    IfAlok = (0.1, 0.9) and Manoj= (0.2, 0.8) then what are their respective payoffs?

    We answer this by computing expected values that

    are probability-weighted averages of the payoffs

    they get by playing their respective pure strategies

    MANOJ

    Park Beach

    ALOK

    Park 100, 80 0, 0

    Beach

    0, 0 60, 120

    If Alok plays the pure strategy Park,given that Manoj is playing (0.2, 0.8),

    Aloks payoffs will be:

    Payoffs for Alok

    UAlok(Park ,Manoj) = 100*0.2 + 0*0.8 = 20

    Likewise, if Alok plays the pure strategy

    Beach

    :

    UAlok(Beach,Manoj) = 0*0.2 + 60*0.8 = 48

    Now, Alok himself is playing a mixed

    strategy (0.1, 0.9). Hence,

    UAlok(Alok,Manoj)=20*0.1 + 48*0.9 = 45.2

    Payoffs for Manoj(do this yourself)

    UManoj(Alok ,Manoj) = 88

    Hence, the payoffs are: (45.3, 88)

    Note: we work with

    absolute values here, not

    ordinals.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    12/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    We can now generalize the calculation of expected payoffs

    MANOJ

    Park (q) Beach (1-q)

    ALOK

    Park(p)

    p.q p.(1-q)

    Beach(1-p)

    (1-p).q (1-p).(1-q)

    Pure strategy payoff matrix Likelihood that a given strategy

    profile will be playedU1(1,2) = U1(Park, Park)*p.q + U1(Park, Beach)*p.(1q) +

    U1(Beach, Park)*(1p).q + U1(Beach, Beach)*(1p).(1q)

    U2(1,2) = U2(Park, Park)*p.q + U2(Park, Beach)*p.(1q) +

    U2(Beach, Park)*(1p).q + U2(Beach, Beach)*(1p).(1q)

    MANOJ

    Park (q) Beach (1-q)

    ALOK

    Park(p)

    100, 80 0, 0

    Beach(1-p)

    0, 0 60, 120

    12

    MIXED STRATEGY & EXPECTED PAYOFFS

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    13/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    MIXED STRATEGY NASH EQUILIBRIUM

    Being a rational actor, each player is keen to maximize his own

    payoffs.First, we look for the PSNEs. Are there any?

    Yes: (Park, Park) and (Beach, Beach) are both PSNEs. If one player

    knows with certainty that the other player is choosing Park, then

    he will also choose Park. Likewise, for Beach.

    However, here the players cannot know with certainty what the

    other player might do. They can at best make intelligent

    guesses about the likelihoods that other players will attach to

    their choices. In response, they can also play their choices with

    probabilities, i.e. they can each randomize.

    Since players can randomize, are there MSNEs also? Are there

    mixed strategies for each player, that are best responses to each

    other? If yes, how do we find them?

    13

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    14/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    LetAlokbe (p, 1p) and Manojbe (q, 1q), then

    The payoffs to Alok if he chooses Park / Beach:

    MANOJ

    Park (q) Beach (1-q)

    ALOK

    Park(p)

    100, 80 0, 0

    Beach(1-p)

    0, 0 60, 120

    UAlok(Park ,Manoj) = 100*q + 0*(1q) = 100q

    UAlok(Beach,Manoj) = 0*0.2 + 60*(1q) = 6060q

    What do the utility functions tell us?

    100

    60

    0

    1

    UAlok

    If Manoj is playing q < 3/8, then Alok will

    receive a higher payoff by playing Beach. If

    q > 3/8, then playing Park gives a higher

    payoff. But if Manoj plays q = 3/8, then Alok

    gets the same payoffs forBeachand Park.

    3/8

    37.5

    At q = 3/8, Alok becomes

    indifferent between his choices.

    Beach

    Park

    q 14

    MIXED STRATEGY NASH EQUILIBRIUM

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    15/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Now, computing utilities for Manoj if Alok plays (p, 1p)

    MANOJ

    Park (q) Beach (1-q)

    ALOK

    Park(p)

    100, 80 0, 0

    Beach(1-p)

    0, 0 60, 120

    UManoj(Alok, Park) = 80*p + 0*(1p) = 80pUManok(Alok, Beach) = 0*p + 120*(1p) = 120120p

    120

    80

    01

    If Alok is playing p < 3/5, then Manoj

    will receive a higher payoff by playing

    Beach. If p > 3/5, then playing Park

    gives a higher payoff. But if Alok plays

    p = 3/5, then Manoj gets the samepayoffs for Beachand Park.

    3/5

    48

    At p = 3/5, Manoj becomes

    indifferent between his choices.

    Beach

    Park

    UManoj

    p15

    MIXED STRATEGY NASH EQUILIBRIUM

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    16/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    From the utility functions, we can extract the best response

    functions.

    1

    p = BAlok(q)

    3/8

    p=0

    beach q

    p=1

    park1

    q = BManoj(p)

    3/5

    q=0

    beach

    p

    q=1

    park

    q

    Best response of Alok to Manoj

    16

    We can get the Nash equilibrium by overlapping the best response

    functions.

    Best response of Manoj to Alok

    MIXED STRATEGY NASH EQUILIBRIUM

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    17/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Overlapping the best response functions:

    p = BAlok(q)

    3/8

    p=0beach

    q

    p=1

    park

    q = BManoj(p)3/5

    q=0

    beachq=1

    park

    The best response functions

    intersect at: p = 3/5, q = 3/8

    17

    Hence, the Mixed Strategy Nash Equilibrium =

    { (3/5, 2/5), (3/8, 5/8) }

    This means that the

    equilibrium strategies of Alok

    and Manoj are:

    *Alok = (3/5, 2/5) and

    *Manoj = (3/8, 5/8)

    respectively.

    Can also be expressed as = { (0.6, 0.4), (0.375, 0.625) }

    MIXED STRATEGY NASH EQUILIBRIUM

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    18/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Making sense of the MSNE

    The MSNE is: { (0.6, 0.4), (0.375, 0.625) }. What does ittell us?

    18

    The likelihood is 60% that Alok will go to the Park and

    40%, the Beach. The corresponding likelihoods forManoj are 37.5% and 62.5%.

    If the other person sticks to his strategy, then neither

    has an incentive to change his strategy. For eachperson, his strategy is now his best response to the

    others strategy.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    19/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Making sense of the MSNE

    19

    Now suppose, Alok increases his likelihood of going to

    the Park to 75%. Then how should Manoj respond?

    What is his best response?

    As per our model, Manoj should respond by only going to thePark: (1, 0).

    For any likelihood of Aloks going to Park that is < 60%, Manojs

    payoff is always higher in going to the Beach than to the Park.

    Review Questions

    If Alok plays (0.55, 0.45), what is Manojs best response?

    If Manoj plays (0.45, 0.55), what is Aloks best response?

    If Manoj plays (0.35, 0.65), what is Aloks best response?

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    20/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Opponents Indifference Property

    The strategy chosen by each player in a MSNE makes the

    other player indifferent between his choices (this can be

    proved as a general result.)

    20

    e.g. when Alok plays (0.6, 0.4), then Manoj gets the

    same payoff regardless of what strategy he plays.

    whether he plays (0.1, 0.9), (0.375, 0.625), (0.8, 0.2) or

    anything else he gets a payoff of exactly 48.

    Question: If that is true, then why should Manoj play

    (0.375, 0.625), his equilibrium strategy?Answer: If Manoj plays anything other than (0.375, 0.625), then

    Alok will respond by moving away from (0.6, 0.4) towards one of

    the pure strategiesThat will cause a disequilibriumand Manoj

    will then have to respond by changing his strategyand so on

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    21/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Aloks equilibrium strategy (p*, 1p*) is such that it

    makes Manoj indifferent between his pure strategies.

    21

    80.p* + 0.(1p*) = 0.p* + 120.(1p*)

    200.p* = 120; p* = 0.6

    Hence, Aloks equilibrium strategy: (0.6, 0.4).

    Opponents Indifference Property

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    22/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 22

    Manoj

    s equilibrium strategy (q*, 1

    q*) is such that itmakes Alok indifferent between his pure strategies.

    100.q* + 0.(1q*) = 0.q* + 60.(1q*)

    160.q* = 60; q* = 0.375

    By this route, we dont need to determine the best

    response functions etc.!

    Hence, Manojs equilibrium strategy: (0.375, 0.625).

    Opponents Indifference Property

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    23/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Opponents Indifference Property

    23

    LESSON: Each players strategy in a MNSE is suchthat it makes the other player indifferent between

    her pure strategies that she plays with a positive

    probability.

    In a MSNE, given the other players strategies, the

    focal players expected payoff is,

    q equal for all pure strategies to which sheassigns a

    positive probability,

    q which is at least as good as those for pure

    strategies to which sheassigns a zero probability.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    24/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Opponents Indifference Property

    24

    Helps us find the MSNE in a game.

    Helps us test whether a given profile is a MNSE.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    25/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    TESTING FOR MSNE

    [ (3/4, 0, 1/4), (0, 1/3, 2/3) ]

    25

    PLAYER 2L C R

    PLAYER1T

    1, 2 3, 3 1, 1

    M 2, 1 0, 1 2, 0

    B 0, 4 5, 1 0, 7

    U1(T ,2) = (0)*1 + (1/3)*3 + (2/3)*1 = 5/3

    U1(M ,2) = (0)*2 + (1/3)*0 + (2/3)*2 = 4/3

    U1(B,2) = (0)*0 + (1/3)*5 + (2/3)*0 = 5/3

    U2(1, L) = (3/4)*2 + (0)*1 + (1/4)*4 = 5/2

    U2(1, C) = (3/4)*3 + (0)*1 + (1/4)*1 = 5/2

    U2(1, R) = (3/4)*1 + (0)*0 + (1/4)*7 = 5/2

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    26/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    TESTING FOR MSNE

    [ (3/4, 0, 1/4), (0, 1/3, 2/3) ]

    26

    are the same (i.e. 5/3) for the two pure strategies, T and B, to

    which she has assigned positive probability,

    which are at least as good as that for M (i.e. 4/3), to which shehas assigned zero probability.

    Player 1s expected payoffs, given Player 2 is playing (0, 1/3, 2/3):

    are the same (i.e. 5/2) for the two pure strategies, C and R, to

    which she has assigned positive probability,which are at least as good as that for L (i.e. 5/2), to which she

    has assigned zero probability.

    Player 2s expected payoffs, given Player 1 is playing (3/4, 0, 1/4):

    Hence, the given profile is a MSNE

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    27/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    TESTING FOR MSNE

    [ (1/3, 1/3, 1/3), (1/4, 1/2, 1/4) ]

    27

    PLAYER 2L C R

    PLAYER1T 1, 2 3, 3 1, 1

    M 2, 1 0, 1 2, 0

    B 0, 4 5, 1 0, 7

    U1(T ,2) = (1/4)*1 + (1/2)*3 + (1/4)*1 = 2

    U1(M ,2) = (1/4)*2 + (1/2)*0 + (1/4)*2 = 1

    U1(B,2) = (1/4)*0 + (1/2)*5 + (1/4)*0 = 5/2

    U2(1, L) = (1/3)*2 + (1/3)*1 + (1/3)*4 = 7/3

    U2(1, C) = (1/3)*3 + (1/3)*1 + (1/3)*1 = 5/3

    U2(1, R) = (1/3)*1 + (1/3)*0 + (1/3)*7 = 8/3

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    28/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    TESTING FOR MSNE

    28

    are different across the three pure strategies, T, M and B, to

    which she has assigned positive probabilities,

    Player 1s expected payoffs, given Player 2 is playing (1/4, 1/2, 1/4):

    are also different across the three pure strategies, L, C and R, to

    which she has assigned positive probabilities,

    Player 2s expected payoffs, given Player 1 is playing (1/3, 1/3, 1/3):

    Hence, the given profile is not a MSNE

    [ (1/3, 1/3, 1/3), (1/4, 1/2, 1/4) ]

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    29/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    FINDING THE MSNE

    29

    The method of using best responses is easy to use in 2-

    player 2-action games, but becomes difficult to apply

    when the number of actions is more than two.

    In 2-player games with more than 2 actions we can use a

    trail and error method, in conjunction with theOpponents Indifference Property to find the MSNE.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    30/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    FINDING THE MSNE

    30

    PLAYER 2B S X

    PLA

    YER1B 4, 2 0, 0 0, 1

    S 0, 0 2, 4 1, 3

    Find all the NE in this game:

    Enumerate all possible profile types, by varying each

    players playing his pure strategies with a positive or zero

    probability.

    A1: {B, S} and A2: {B, S, X}

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    31/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    FINDING THE MSNE

    31

    (1, 0), (1, 0, 0)

    (1, 0), (0, 1, 0)

    (1, 0), (0, 0, 1)

    (1, 0), (q, 0, 1q)

    (1, 0), (0, q, 1q)(1, 0), (q, 1q, 0)

    (1, 0), (q1, q2, 1q1q2)

    (0, 1), (1, 0, 0)

    (0, 1), (0, 1, 0)

    (0, 1), (0, 0, 1)

    Both players assign positive probability to

    only one of their actions.

    Player 1 assigns positive probability to only one action; Player 2 to

    more than one action.

    (0, 1), (q, 0, 1q)

    (0, 1), (0, q, 1q)(0, 1), (q, 1q, 0)

    (0, 1), (q1, q2, 1q1q2)

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    32/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    FINDING THE MSNE

    32

    (p, 1p ), (1, 0, 0)

    (p, 1p ), (0, 1, 0)

    (p, 1p ), (0, 0, 1)

    (p, 1p ), (q, 0, 1q)

    (p, 1p ), (0, q, 1q)

    (p, 1p ), (q, 1q, 0)

    (p, 1p ), (q1, q2, 1q1q2)

    Both players assign

    positive probability to

    more than one action.

    Player 2 assigns positiveprobability to only one action;

    Player 1 to more than one action.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    33/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    FINDING THE MSNE

    33

    PLAYER 2B S X

    P

    LAYER1B 4, 2 0, 0 0, 1

    S 0, 0 2, 4 1, 3

    This game has three MSNEs, of which two are PSNEs and

    one is not:

    [ (1, 0), (1, 0, 0) ],

    [ (0, 1), (0, 1, 0) ],

    [ (3/4, 1/4), (1/5, 0, 4/5) ]

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    34/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    NASH EQUILIBRIUM PURE & OTHERWISE

    NASH EQUILIBRIUM

    34

    Those that are PSNE

    (made up of only pure

    strategies)

    Those that are not PSNE

    (at least one player is not

    playing a pure strategy)

    Remember: a pure strategy is a special case of a mixed strategy.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    35/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    NASH EXISTENCE THEOREM

    Every finite game has at least one equilibrium.

    35

    Finite game: has a finite number of players, each with a

    finite number of pure strategies (actions).

    Is it possible that in a game that is not finite, there is

    no NE at all?

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    36/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    EXERCISE

    Find all the NEs in this game.

    36

    MANOJ

    Park (q) Beach (1-q)

    ALOK P

    ark(p)

    40, 40 10, 70

    Beach(1-p)

    70, 10 20, 20

    q We cannot find any that is not a PSNE.

    q We only have one NE that is a PSNE [(0,1), (0,1)].

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    37/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    EXERCISE

    Find all the NEs in this game.

    37

    SAILESH

    Red (q) Black (1q)

    KAVISH R

    ed(p)

    1, 1 1, 1

    Black(1

    p)

    1, 1 1, 1

    q Only NE is: [(0.5, 0.5), (0.5, 0.5)}there is no PSNE.

    q Recollect the results we got in the class??

    MANISH

    Right (q) Left (1q)

    NIKHIL R

    ight(p)

    1, 1 1, 1

    Left(1

    p)

    1, 1 1, 1

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    38/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    EXERCISE

    Find all the NEs in this game.

    38

    WIFE

    C (q) M (1q)

    HUSBA

    ND C

    (p)

    30, 20 0, 0

    M

    (1p)

    0, 0 20, 30

    q Three NEs:[(1, 0), (1, 0)], [(0, 1), (0, 1)], [(0.6, 0.4), (0.4, 0.6)].

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    39/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    GENERAL REALIZATIONS

    39

    q Sometimes, a finite game can have more than one equilibrium.

    q A PSNE is a degenerate case of a MSNE, where in the player

    chooses one action with probability 1 and all others withprobability 0.

    qWe can have three types of games:

    q Games in which all equilibria are only PSNE.

    q Games in which no equilibrium is a PSNE (e.g. Matching

    Pennies),

    q Games in which some equilibria are PSNE and some are

    not (e.g. Battle of the Sexes).

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    40/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    EXERCISE SET C

    40

    q Problem 6: Hawk Dove Game.

    q Problem 9: Swimming with sharks.

    q Problem 10: Testing for MSNE.

    q Problem 13: Defending Territory.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    41/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 41

    Do people actually

    play mixed

    strategies as

    predicted by Game

    Theory?

    Palacious-Huerta, I. (2003),

    Professionals Play

    Minimax, Review of

    Economic Studies, Vol. 70,

    pg. 395-415.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    42/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 42

    Do people actually play mixed strategies as

    predicted by Game Theory?

    qAnalyzed data from 1417 Penalty kicks from FIFA games: Spain,England, Italy.

    The payoff (success) matrix.

    q Given this payoff matrix, identify the kickers and goalies

    probabilities of going left and right, in a Nash Equilibrium.

    Is this Nash Equilibrium actually observed in practice?

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    43/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 43

    Do people actually play mixed strategies as

    predicted by Game Theory?

    This was observed in the data!

    qPeople dont use payoff matrices to calculate

    equilibrium values, before deciding how to act.

    qYet, in this study, Kicker and Goalie aggregatedbehaviour over time is consistent with the Nash

    Equilibrium prediction!

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    44/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 44

    Does randomization happen in the business

    and administration?

    qPolice checking of drunken driving and motorists

    strategies.

    qTax raids on companies & companiesstrategies.

    qVVIP security.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    45/58

    apProf. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES

    Exercise: Swimming with SharksProblem 9 of Exercise Set C

    45

    Alternatives: Yes (swim on the first day),

    No (dont swim on first day)

    Friend

    You

    Yes No

    No

    Yes

    What doesc represent?

    cis the ratio of the magnitude of displeasure experienced in a shark attack,

    to the magnitude of pleasure experienced in swimming without being

    attacked.

    What does represent?

    is players perceived likelihood of sharksexisting in the water.

    TRY THIS OUT ON YOUR OWN. AFTER YOU HAVE ATTEMPTED IT

    THOROUGHLY, COMPARE YOUR WORK WITH MY EFFORTS HERE.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    46/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 46

    Possible situations that can be modeled by this type of game

    q You are considering watching the first day first show of a new movie (there

    is a certain thrill in doing so and ideally, you wouldnt want to lose that). If

    you find the movie good, you will surely watch it a second time. But you

    also realize that there is a probability the movie will not be good and

    associate a pain cwith a bad experience. Will you watch it first and then

    tell your friend, or wait for friends to watch it and then tell you?

    q Two firms from the same country are considering investing in a new

    country, about whose market there is some uncertainty. Both can benefit

    from investing in the first year itself, however there is a possibility of things

    going wrong. Thus each might benefit by waiting for the other firm to

    invest, watching the result, and then deciding whether to invest in the

    second year or not.

    q Any situation where being the first mover can sometimes be better andsometimes worse than being a late mover.

    Other situations where this can be used?

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    47/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 47

    CREATIVE EXERCISE

    q groups of 4 we will have 15 posters from each

    section.

    q Form your groups & enter into Google Spreadsheet

    (Monday, Oct 20th, 11:59 pm). No change after that.

    q Two PLs in each section to plan & execute the

    whole project.

    qA project of GTAPers of 2013-15 batch.

    q demonstrate that Game Theory is useful to

    management in organizations.

    qIn-class presentation & Poster exhibition.

    MORE INSTRUCTIONS BY OCT 23RD

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    48/58

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    49/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 49

    Iterated Elimination of Strictly Dominated Strategies

    Consider the Prisoners Dilemma game PRISONER BQuiet Admit

    PRISONERA

    Quiet

    1, 1 10, 0

    Admit

    0, 10 5, 5

    Instead of examining each action profile and testing to

    see if it is a Nash Equilibrium, we can instead proceed

    with reasoning as follows:

    Player 1 thinks: Is there any strategy that Player 2 will surely not play?

    He examines Player 2s payoffs and finds that Player 2s strategy Quiet is

    strictly dominated by his strategy Admit. Hence, Player 1 concludes that

    Player 2 will never play Quiet. Which means that Player 2 can play only

    Admit. If that is so, Player 1 asks what should my response be?

    Obviously, Admitsince, for him too,Quietis strictly dominated by Admit.

    He then asks: Player 2 will think about what I will do. How will he think?. He

    realizes that Player 2 will reason about him (Player 1) similar to the way he

    reasoned about Player 2 and conclude that he (Player 1) would playAdmit.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    50/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 50

    Iterated Elimination of Strictly Dominated Strategies

    More generally speaking, each player can think over

    q What is the best strategy for me / others as well as what is never a best

    strategy for myself / others?

    q What the other player(s) think(s) about what I think as the best strategy

    for me / themselves as well as what is never a best strategy for me /

    themselves?

    q What the other player(s) think(s) about what I think about their thinking

    about the best strategy for me / themselves as well as what is never a

    best strategy for me / themselves? on so on

    It becomes clear from this way of thinking that each player caneliminate all strategies of himself / others that can never be a best

    strategy since that strategy(-ies) will never be played.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    51/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 51

    Iterated Elimination of Strictly Dominated Strategies

    Consider this game.L C R

    U 2, 0 2, 1 0, 0

    M 1, 1 1, 1 5, 0

    D 0, 1 4, 2 0, 1

    Identify all the strictly dominated strategies

    and eliminate them one by one.

    1. R is strictly dominated by C

    Eliminate R and create a smaller game. In the

    smaller game, we find,

    L C

    U 2, 0 2, 1

    M 1, 1 1, 1

    D 0, 1 4, 2

    2. M is strictly dominated by UEliminate M and create a smaller

    game. In the smaller game,

    L C

    U 2, 0 2, 1

    D 0, 1 4, 2

    3. L is strictly dominated by C

    Eliminate L and create a smaller

    game. In the smaller game,C

    U 2, 1

    D 4, 2

    C

    D 4, 2

    4. U is strictly

    dominated by D

    Eliminate U.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    52/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 52

    Iterated Elimination of Strictly Dominated Strategies

    L C R

    U 2, 0 2, 1 0, 0

    M 1, 1 1, 1 5, 0

    D 0, 1 4, 2 0, 1

    By iteration, the procedure led us to a

    single strategy profile. Hence, we infer

    that this is the unique Nash Equilibrium of

    the game.

    If there were other Nash Equilibria, the game would not terminate this way,

    in a single strategy profile.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    53/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 53

    Iterated Elimination of Strictly Dominated Strategies

    L C R

    U 3, 1 0, 1 0, 0

    M 1, 1 1, 1 5, 0

    D 0, 1 4, 1 0, 0

    A different example 1. R is strictly dominated by both C and L

    Eliminate R.

    2. M is strictly dominated by (, 0, ) and others

    Eliminate M.

    L C

    U 3, 1 0, 1

    M 1, 1 1, 1

    D 0, 1 4, 1L C

    U 3, 1 0, 1

    D 0, 1 4, 1

    This game has infinite Nash

    Equilibria. Find them!

    But now there is no strategy that is strictly

    dominated by another pure strategy. Can we find

    a mixed strategy that strictly dominates a pure

    strategy?

    By trial and error, we find that if Player 1 plays

    (, 0, ) or any (p, 0, 1p) such that 1/3 < p < 3/4,

    then this mixed strategy strictly dominates M.

    Why?

    Now we can find no more pure / mixed strategies that

    dominate an existing pure strategy. Terminate.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    54/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 54

    Iterated Elimination of Strictly Dominated Strategies

    q Helps us narrow down a complex game to a simpler one. Hence can be

    used as a pre-processing step, before using another technique to find Nash

    Equilibria.

    q The process preserves the games Nash Equilibria, since in every iteration

    we eliminate only those strategies that can never be best responses.

    q Dominance Solvability: Games that are fully solvable using this technique

    are said to be dominance solvable. In such a game, we terminate at a

    unique strategy profile after the iterative elimination procedure. Only

    some games are dominance solvable.

    q Order of Removal: If there are multiple strictly dominated strategies, then

    does the order of their removal matter to the solution? No. Verify this

    by using a different order for the first game demonstrated.

    q Weakly dominated strategies: Will this procedure work if we eliminate

    weakly dominated strategies also? IT MAY OR MAY NOT. A Nash

    Equilibrium can consist of weakly dominated strategies!.... At least one

    equilibrium will be preservedbut the order of removal can matter.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    55/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 55

    You are asked to make an investment

    in ONEof two ventures, given the

    following information:

    q If you invest in Venture A, there

    is a 5% chance of making a

    profit of Rs. 80 crores, and a

    95% chance of no profit at all.

    q But, if you invest in Venture B,you are fully assured of a profit

    of Rs. 8 crores.

    Which venture should you invest in

    A or B? There is no one correct

    answer. Both answers can be correct,

    and we are looking at how individualsmake choices.

    Think and answer Which venture

    will you invest in? Why?

    Lottery A: Is for Rs. 10 crore, and 250,000

    people will purchase it.

    There is no one correctanswer. Each

    person can have different reasons for his

    choice.

    What is your answer?

    You have Rs. 100 with you, with which you

    have to purchase either of two lotteries.

    The only winner of each lottery gets the fullprize but a loser loses even the Rs. 100.

    Which lottery will you take?

    Lottery B: Is for Rs. 2 crore, and 25,000

    people will purchase it.

    Experiment done in classroom.

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    56/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 56

    Experiment done in the classroom

    ? ?

    ? ?Venture

    Lottery

    A B

    B

    A

    In solving for mixed strategies, we used the concept ofexpected value. But do people actually make their

    preference on the basis of expected value?

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    57/58

    ap Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 57

    Expected Value Calculation

    For the decision to purchase lottery A

    EV(A) = 0.000004*100000000 + 0.999996*(-100) = 400 99.9996

    EV(A) = ~300.

    Investment

    Lottery

    For the decision to invest in Venture A

    EV(A) = 0.95*0 + 0.05*80 = 4

    For the decision to invest in Venture BEV(B) = 1.00*8 = 8

    Are peoples preferences consistent with EV?

    For the decision to purchase lottery B

    EV(B) = 0.00004*20000000 + 0.99996*(-100) = 800 99.996

    EV(B) = ~700.

    Are peoples preferences consistent across experiments?

  • 8/10/2019 GTAP Sessions 11-15 Mixed Strategy

    58/58

    p P f J Ajith K TAPMI M i l GAME THEORY NOTES

    How does Expected Value Relate to Preference?

    In both experiments, EV(B) > EV(A).

    If people are consistent with their choices, those who choseVenture A should choose Lottery A and those who chose Venture

    B should choose Lottery B.

    ? ?

    ? ?Investment

    Lottery

    A B

    B

    A 4, 300 4, 700

    8, 300 8, 700Investment

    Lottery

    A B

    B

    A

    No. of people who played the

    different choice combinationsThe payoffs associated with

    the different choice

    combinations


Recommended