2. Bayesian Games and Bayesian Nash Equilibrium (BNE) (Game Theory Playlist 9)

00:14:30
https://www.youtube.com/watch?v=pjKX4l_egJU

Sintesi

TLDRThe text explains Bayesian games in which players have distinct types and strategies. Each player 'i' has a set of strategies (si), types (ti), and payoff functions (ui). Payoffs depend on both the player's strategy and their type, which can lead to different optimal strategies for different types. The text emphasizes the concept of Bayesian Nash Equilibrium, where each player's strategy is the best response considering their type and the strategies of other players. It details how to calculate expected payoffs and describes the importance of conditioning on other players' types.

Punti di forza

  • 📚 Bayesian games involve players with strategies and types.
  • 🔍 Payoff functions depend on both the player's strategy and their type.
  • 🤝 Bayesian Nash Equilibrium requires that each player's strategy be a best response given others' strategies.
  • 📈 Expected payoffs are calculated using probabilities of different outcomes and corresponding payoffs.
  • 🌧️ Types can be correlated due to external factors, affecting strategic decisions.

Linea temporale

  • 00:00:00 - 00:05:00

    The Bayesian game consists of a list defining players' strategies, types, and payoff functions. Each player has a set of strategies denoted as s1, s2, ..., sn, which can be finite or infinite. Payoff functions for each player depend not only on their chosen strategy but also on the strategies of other players and their types. Therefore, a player’s payoff varies depending on their type, leading to different optimal strategies for each type.

  • 00:05:00 - 00:14:30

    The Bayesian Nash equilibrium is an extension of Nash equilibrium in Bayesian games, characterized by strategy profiles where for every type of a player, their chosen strategy maximizes expected utility given the strategies of others. The expected payoff is computed by summing the products of probabilities and outcomes across all potential types, which typically requires analyzing conditional probabilities related to the players' types and their strategic choices. This framework helps in verifying the existence and nature of equilibria in strategic environments.

Mappa mentale

Video Domande e Risposte

  • What is a Bayesian game?

    A Bayesian game is defined by a list of players, each with a set of strategies, types, and payoff functions that depend on the strategies and types of all players.

  • What is the significance of player types in Bayesian games?

    Player types affect the payoff functions, which can lead to different optimal strategies for different types.

  • What is a Bayesian Nash Equilibrium?

    A Bayesian Nash Equilibrium is the Nash Equilibrium of a Bayesian game where each player's strategy is a best response to the strategies of others, considering their types.

  • How are payoffs calculated in a Bayesian game?

    Payoffs are calculated based on the strategies chosen by all players and the types of each player, using a formula that sums the payoffs multiplied by probabilities.

  • What is meant by correlated types?

    Correlated types refer to instances where the types of different players may depend on a common factor, such as external conditions like weather.

Visualizza altre sintesi video

Ottenete l'accesso immediato ai riassunti gratuiti dei video di YouTube grazie all'intelligenza artificiale!
Sottotitoli
en
Scorrimento automatico:
  • 00:00:00
    a bayesian game is a list
  • 00:00:03
    uh s1 s2 up to sn so
  • 00:00:06
    each si is basically set of strategy for
  • 00:00:10
    player i so if each player has a
  • 00:00:13
    discrete or finite strategy
  • 00:00:15
    then that means si is a finite strategy
  • 00:00:18
    set
  • 00:00:18
    or it can be an infinite strategy set so
  • 00:00:21
    these are strategy sets for each player
  • 00:00:24
    and then t1 t2 up to tn
  • 00:00:27
    uh well for example in the uh double
  • 00:00:30
    ocean
  • 00:00:31
    i'm sorry a second price auction example
  • 00:00:33
    i previously mentioned
  • 00:00:34
    so each si was a zero infinity interval
  • 00:00:38
    so t1 t2 up to tn so these are the
  • 00:00:42
    typeset for each player i and in the
  • 00:00:45
    previous example
  • 00:00:46
    in the simplest environment remember uh
  • 00:00:49
    each buyer could have three potential
  • 00:00:51
    types
  • 00:00:52
    so the the ti is you know the typeset
  • 00:00:55
    for each player
  • 00:00:56
    i i'm sorry
  • 00:01:00
    and then u1 u2 up to u n these are the
  • 00:01:02
    payoff vectors
  • 00:01:04
    i'm sorry payoff functions um well
  • 00:01:06
    obviously the payoff function
  • 00:01:08
    depends on not only on player eyes
  • 00:01:12
    strategy but it also depends on all the
  • 00:01:15
    others players
  • 00:01:16
    strategy and here those strategies
  • 00:01:20
    are not sort of discriminated for
  • 00:01:23
    types because i write those strategies
  • 00:01:27
    as
  • 00:01:27
    function which i will describe in a
  • 00:01:29
    moment
  • 00:01:30
    and the payoff also depends on the type
  • 00:01:33
    of player i
  • 00:01:34
    right the different types may have
  • 00:01:36
    different payoffs
  • 00:01:37
    so in the previous example again if
  • 00:01:40
    you are the buyer with valuation 110
  • 00:01:44
    your different your your payoff function
  • 00:01:46
    was is different than
  • 00:01:48
    if you are a buyer with type valuation
  • 00:01:51
    90
  • 00:01:52
    because that buyer's valuation is 90
  • 00:01:55
    minus
  • 00:01:56
    price not 110 minus price all right so
  • 00:01:59
    the different types
  • 00:02:00
    actually has different payoffs which
  • 00:02:03
    basically derives
  • 00:02:05
    different optimal strategies for
  • 00:02:06
    different types all right
  • 00:02:08
    so one more additional thing which is
  • 00:02:11
    not
  • 00:02:11
    uh described here is the p the
  • 00:02:14
    probability the player's
  • 00:02:16
    belief about other types all right so we
  • 00:02:19
    denote it by
  • 00:02:20
    pi it's a conditional probability
  • 00:02:23
    t sub minus i which means the type of
  • 00:02:27
    all the other players except player i
  • 00:02:30
    conditional on player i's type well this
  • 00:02:34
    notation allows correlated types
  • 00:02:37
    so maybe your type and the other guy's
  • 00:02:39
    types are correlated
  • 00:02:40
    which is very well possible for example
  • 00:02:44
    maybe uh so if it is for example a a
  • 00:02:47
    football game
  • 00:02:48
    um all right and so the types can be
  • 00:02:52
    uh determined whether uh uh the the
  • 00:02:55
    weather is
  • 00:02:56
    rainy or snowy or or shining those or
  • 00:02:59
    maybe the sun is shining so therefore
  • 00:03:01
    if this is the case if this is how the
  • 00:03:03
    types are determined
  • 00:03:04
    well then maybe your type and your
  • 00:03:07
    opponent's type may be correlated
  • 00:03:09
    depending on the weather you see what i
  • 00:03:10
    mean so this notation allows
  • 00:03:13
    correlation in that sense so if for
  • 00:03:15
    example
  • 00:03:16
    this probably is equal to the
  • 00:03:19
    multiplication of
  • 00:03:20
    the individual types probabilities p
  • 00:03:24
    j t j j is different than i if this is
  • 00:03:27
    the case
  • 00:03:27
    well that means independent types
  • 00:03:29
    remember if 2
  • 00:03:31
    probability of a given b for example um
  • 00:03:35
    so if this is uh equal to probability of
  • 00:03:38
    a well then we say that
  • 00:03:41
    you know a and b are independent types
  • 00:03:44
    this is exactly what's happening here
  • 00:03:46
    all right okay so what else
  • 00:03:51
    i already mentioned the payoff function
  • 00:03:53
    for player i
  • 00:03:54
    it depends on the strategies of all the
  • 00:03:56
    players because that is the essence of
  • 00:03:59
    strategic environment and also depends
  • 00:04:01
    on the type of player i
  • 00:04:03
    well here uh what if
  • 00:04:06
    the uh the player eyes other types
  • 00:04:10
    can they affect my payoff well maybe yes
  • 00:04:13
    but we
  • 00:04:14
    usually ignore this correlation because
  • 00:04:17
    it complicates the game so here the
  • 00:04:19
    strategy is
  • 00:04:21
    important is a function s-i-t-i
  • 00:04:25
    which specifies what strategy player i
  • 00:04:28
    would pick if he is type ti
  • 00:04:31
    so therefore a strategy you can think of
  • 00:04:34
    strategy
  • 00:04:36
    of player i as a function which maps
  • 00:04:39
    each ti
  • 00:04:40
    to some strategy in the set of
  • 00:04:43
    strategies
  • 00:04:44
    as i all right again in my previous
  • 00:04:46
    example i said it is b1
  • 00:04:48
    1 b1 2 b1 3
  • 00:04:52
    all right so basically this is what
  • 00:04:55
    b1 type i is so if
  • 00:04:58
    if for example b1 type one
  • 00:05:02
    so if if you're type one your bid is
  • 00:05:04
    gonna be b11 this is how i denoted it
  • 00:05:07
    so they're not equal i'm sorry for my
  • 00:05:09
    notational
  • 00:05:10
    uh mix up um b1
  • 00:05:14
    t2 is equal to b12 b1
  • 00:05:17
    t3 is equal to b13 all right
  • 00:05:20
    so this is just another way of
  • 00:05:24
    notating the strategy of player 1 as a
  • 00:05:27
    triple
  • 00:05:28
    all right you can just represent it as a
  • 00:05:30
    function
  • 00:05:31
    all right so you can either write b1
  • 00:05:34
    parenthesis t1 t2 t3
  • 00:05:36
    or b 1 1 b 1 2 b 1 3 this is
  • 00:05:40
    totally up to you i mean you'll see in
  • 00:05:42
    some examples
  • 00:05:43
    i'm going to use those notations in some
  • 00:05:45
    other i'm going to use
  • 00:05:46
    that notation so uh depending on the
  • 00:05:49
    question uh the you know some notations
  • 00:05:52
    are easier to follow
  • 00:05:53
    some notations are more and you know
  • 00:05:58
    uh making the notation hard to follow
  • 00:06:01
    and so it's totally up to you which
  • 00:06:02
    notation you want
  • 00:06:04
    you would like to prefer so i just
  • 00:06:05
    wanted to give you another so this is a
  • 00:06:08
    more uh sort of a neat way of
  • 00:06:11
    defining a strategy this strategy is a
  • 00:06:14
    function which maps each type
  • 00:06:16
    to a strategy from the set of strategies
  • 00:06:20
    of that player i obviously all right so
  • 00:06:22
    what is bayesian nash equilibrium
  • 00:06:24
    it's nothing but a nash equilibrium of
  • 00:06:28
    the bayesian game all right so the
  • 00:06:30
    bayesian game is given by easily by the
  • 00:06:32
    set of strategies
  • 00:06:33
    so instead of writing s1 s2 s3 i just
  • 00:06:36
    wrote the
  • 00:06:38
    cartesian product of the strategy sets
  • 00:06:41
    type
  • 00:06:42
    space so these are spaces so that means
  • 00:06:46
    s is nothing but s1 cross s2 cross all
  • 00:06:50
    the way cross
  • 00:06:51
    s answers the cartesian product so s1 is
  • 00:06:54
    a set
  • 00:06:55
    capital s is a space all right so the
  • 00:06:58
    set and space are two different things
  • 00:07:00
    uh why is that well set doesn't really
  • 00:07:03
    have
  • 00:07:04
    i mean they may be the same thing
  • 00:07:06
    obviously but set
  • 00:07:07
    is just you know finite or infinite just
  • 00:07:10
    one dimensional
  • 00:07:11
    set the capital s however is an n
  • 00:07:14
    dimensional
  • 00:07:15
    set all right so we call it space t
  • 00:07:19
    symmetrically t1 cross t2 cross all the
  • 00:07:22
    way up to tn so this
  • 00:07:24
    uh type space p is the beliefs
  • 00:07:27
    right you know how those types are
  • 00:07:30
    distributed
  • 00:07:31
    according to what probability this is
  • 00:07:33
    given by a p
  • 00:07:34
    it may be a correlated times
  • 00:07:36
    uncorrelated types
  • 00:07:38
    depending on the uh problem or the
  • 00:07:40
    strategic environment we're analyzing
  • 00:07:42
    and finally the payoff vector
  • 00:07:44
    so once you're given this this is the
  • 00:07:46
    bayesian game
  • 00:07:49
    the bayesian nash equilibrium of this
  • 00:07:51
    game is the nash equilibrium
  • 00:07:53
    of this game all right so bayesian nash
  • 00:07:55
    is not something new in fact
  • 00:07:57
    a strategy profile s star is
  • 00:08:00
    based in nash uh for that means
  • 00:08:04
    for every player and for every type
  • 00:08:07
    remember you have to check best
  • 00:08:08
    responding thing
  • 00:08:09
    for each player and for each type of
  • 00:08:12
    that player
  • 00:08:13
    so uh the si star ti so this is the
  • 00:08:17
    strategy of type i
  • 00:08:19
    uh given that fixing the other's
  • 00:08:21
    strategy
  • 00:08:23
    should be a best response meaning it has
  • 00:08:25
    to the expected utility of this thing
  • 00:08:28
    uh this this profile has to be greater
  • 00:08:30
    than the expected utility
  • 00:08:32
    of playing some other strategy si again
  • 00:08:36
    fixing the others are playing according
  • 00:08:37
    to this strategy profile
  • 00:08:39
    so this inequality should be true for
  • 00:08:42
    all s i element of
  • 00:08:46
    capital s i all right remember as
  • 00:08:49
    player i or type i you're allowed to
  • 00:08:51
    choose
  • 00:08:52
    any strategy in this set so if any
  • 00:08:54
    strategy you pick here
  • 00:08:56
    the expected utility should be less than
  • 00:08:58
    or equal to the expected utility
  • 00:09:01
    of playing si star ti if this is the
  • 00:09:04
    case then i'm gonna call that
  • 00:09:05
    this is the best response for type i
  • 00:09:08
    well
  • 00:09:09
    if this star strategy is the best
  • 00:09:11
    response for each player and for
  • 00:09:13
    each type well then we actually got
  • 00:09:17
    one nash equilibrium all right obviously
  • 00:09:19
    there can be a bunch of other nash
  • 00:09:21
    equilibria
  • 00:09:22
    but this is how we check or verify if
  • 00:09:24
    something is nash equilibrium or not
  • 00:09:27
    well here obviously it is important how
  • 00:09:29
    we write this expected payoff
  • 00:09:31
    okay so important question is what is
  • 00:09:34
    that expected utility expected payoff
  • 00:09:37
    so uh i'll let me give you a generic
  • 00:09:40
    uh formula for it because the
  • 00:09:43
    idea well if you get this uh generic
  • 00:09:46
    formula i think
  • 00:09:48
    of well your life is going to be very
  • 00:09:50
    very simple
  • 00:09:52
    but obviously it's not so easy and
  • 00:09:54
    straightforward
  • 00:09:55
    so try to uh picture this formula
  • 00:09:58
    every in every exercise you solve all
  • 00:10:01
    right so that's very very important once
  • 00:10:03
    you solve an exercise come back to this
  • 00:10:06
    formula
  • 00:10:06
    and see how it fits this is how you can
  • 00:10:10
    really understand this formula and again
  • 00:10:12
    once you understand this trust me
  • 00:10:15
    finding basic nash equilibrium
  • 00:10:17
    is just piece of cake so how do we write
  • 00:10:20
    the expected payoff
  • 00:10:22
    uh given that player i plays
  • 00:10:25
    some strategy as i and given that other
  • 00:10:28
    players are playing according to s star
  • 00:10:31
    minus i
  • 00:10:32
    all right well first of all this is
  • 00:10:34
    going to be sum
  • 00:10:36
    right sum of payoffs multiplied by
  • 00:10:38
    probabilities
  • 00:10:40
    so this is how we calculate the expected
  • 00:10:43
    utility
  • 00:10:43
    of a lottery p remember so if p is for
  • 00:10:46
    example
  • 00:10:47
    p1 p2 all the way up to pn
  • 00:10:51
    so what we were doing is you know p1
  • 00:10:53
    times u1
  • 00:10:54
    p2 times u2 so these are the payoffs in
  • 00:10:58
    each
  • 00:10:58
    outcome and these are the probabilities
  • 00:11:00
    of these events
  • 00:11:01
    so plus p n times u n so uh
  • 00:11:05
    which is equivalent to saying you know p
  • 00:11:07
    i times u i
  • 00:11:08
    i mean here j i'm sorry and obviously j
  • 00:11:12
    from one to n so here i am summing
  • 00:11:15
    through different type profiles
  • 00:11:18
    except player i so what is t sub minus i
  • 00:11:23
    well t sub minus i is the type profile
  • 00:11:26
    vector so it includes type one of player
  • 00:11:30
    one
  • 00:11:30
    type two i'm sorry type of player one
  • 00:11:33
    type of player two
  • 00:11:34
    type of player i minus one i plus one
  • 00:11:38
    so every other player's types except
  • 00:11:41
    type of player i alright so it's not
  • 00:11:43
    going to be in
  • 00:11:45
    a part of this vector all right why well
  • 00:11:48
    because
  • 00:11:48
    it is here the type i's player i's type
  • 00:11:51
    is
  • 00:11:52
    ti is on this conditional part so what
  • 00:11:55
    is the probability
  • 00:11:56
    that the others type is given this given
  • 00:12:00
    that
  • 00:12:00
    player i's type is ti so this
  • 00:12:02
    probability
  • 00:12:04
    remember in our previous example it was
  • 00:12:06
    sort of independent one third one third
  • 00:12:08
    one third so it was
  • 00:12:09
    life was very simple there and so here
  • 00:12:12
    is this
  • 00:12:13
    ui u1 u2 thing well
  • 00:12:16
    obviously as you change this type
  • 00:12:18
    profile this payoff will
  • 00:12:20
    change all right so that's that's very
  • 00:12:22
    important oh
  • 00:12:25
    well i am sorry because this notation is
  • 00:12:28
    not
  • 00:12:28
    100 percent true but now i'm going to
  • 00:12:31
    make it
  • 00:12:32
    true 100 s i
  • 00:12:35
    comma s i plus one star
  • 00:12:38
    t i plus one comma dot dot dot
  • 00:12:42
    comma okay now the payoff
  • 00:12:45
    of player i given that he is playing
  • 00:12:48
    s i and his opponents are playing s
  • 00:12:51
    minus i
  • 00:12:52
    so i just open this now all right so
  • 00:12:55
    expected utility this expected part
  • 00:12:58
    comes with
  • 00:12:59
    you know multiplication of this
  • 00:13:00
    probability well what about this utility
  • 00:13:02
    part
  • 00:13:03
    well utility is depend on
  • 00:13:06
    player 1's strategy player 1 of type 1's
  • 00:13:09
    strategy player 2 and his type
  • 00:13:13
    strategy player type player
  • 00:13:16
    i minus 1 and his type all right player
  • 00:13:19
    eyes
  • 00:13:20
    strategy which is s i player i plus
  • 00:13:23
    once as uh type uh i plus
  • 00:13:27
    i'm sorry uh player i plus one and his
  • 00:13:30
    type
  • 00:13:30
    and that strategy and so on and so forth
  • 00:13:34
    this is the player n and and the type uh
  • 00:13:37
    his type and his strategy so
  • 00:13:39
    as you change this type profile that
  • 00:13:42
    means you're changing this
  • 00:13:43
    tease inside the parenthesis and so
  • 00:13:47
    therefore you automatically change the
  • 00:13:49
    strategies so there's only one thing
  • 00:13:51
    that is going to be
  • 00:13:52
    fixed which is s i all right so
  • 00:13:56
    the thing is this s i is fixed
  • 00:13:59
    as you change this type profile this
  • 00:14:02
    probability will change
  • 00:14:03
    this payoff will probably change but
  • 00:14:06
    here
  • 00:14:06
    what is changing here is that you're
  • 00:14:08
    fixing si
  • 00:14:10
    and you're only changing those
  • 00:14:11
    strategies
  • 00:14:13
    simply because you are changing those
  • 00:14:15
    type
  • 00:14:16
    profiles all right so this is how we
  • 00:14:18
    calculate the expected payoff
  • 00:14:20
    um where we used it
  • 00:14:24
    to calculate the best response of
  • 00:14:27
    a type of a player
Tag
  • Bayesian games
  • Strategies
  • Types
  • Payoffs
  • Bayesian Nash Equilibrium
  • Expected utility
  • Independent types
  • Correlated types
  • Payoff functions
  • Best response