




















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Game Theory in describes elements of game, agent strategy design, policies, the prisoners Dilemma and Goldon Balls.
Typology: Slides
1 / 28
This page cannot be seen from the preview
Don't miss anything!





















http://youtu.be/jILgxeNBK_
15781 Fall 2016: Lecture 22
๏ง A complete description of what the players can do: the set of all possible actions.
15781 Fall 2016: Lecture 22
๏ง A description of the payoff / consequences for each player for every possible combination of actions chosen by all players playing the game. ๏ง A description of all playersโ preferences over payoffs
๏ง Decision-making can involve choosing: ๏ง one single action or ๏ง a sequence of actions ๏ง Action outcomes can be certain or subject to uncertainty ๏ง A set ๐ด of alternative actions to choose from is given, it can be either discrete or continuous ๏ง Payoff (for a single agent): function ๐: ๐ด โ โ that associates a numerical values with every action in ๐ด ๏ง Optimal action ๐ โ (for a single agent scenario): ๐(๐ โ ) โฅ ๐ ๐ โ๐ โ ๐ด ๏ง Payoff (for a multi-agent scenario): The payoff of the action ๐ for agent ๐ depends on the actions of the other players! ๐: ๐ด ๐ โ โ ๏ง Strategy: rule for choosing an action at every point a decision might have to be made (depending or not on the other agents) ๏ง The strategy defines the behavior of an agent ๏ง The observed behavior of an agent following a given strategy is the outcome of the strategy
๏ง Pure strategy: a strategy in which there is no randomization , one specific action is selected with certainty at each decision node ๏ง All possible pure strategies define the pure strategy set ๐ ๏ง A decision tree can be used to represent a sequence of decisions 1 2 3
๏ง Three action sets (actions may the be same), that result in the pure strategy set: ๐ = {๐ 1 ๐ 1 ๐ 1 , ๐ 1 ๐ 1 ๐ 2 , ๐ 1 ๐ 2 ๐ 1 , ๐ 1 ๐ 2 ๐ 2 , ๐ 2 ๐ 1 ๐ 1 , ๐ 2 ๐ 1 ๐ 2 , ๐ 2 ๐ 2 ๐ 1 , ๐ 2 ๐ 2 ๐ 2 }
๐ด 1 = ๐ 1 , ๐ 2 , ๐ด 2 = ๐ 1 , ๐ 2 , ๐ด 3 = ๐ 1 , ๐ 2
15781 Fall 2016: Lecture 22
15781 Fall 2016: Lecture 22
15781 Fall 2016: Lecture 22 (STRATEGIC-) NORMAL-FORM GAME
๐
Payoff matrix
15781 Fall 2016: Lecture 22
๐
๐
๐ ๐+๐ ๐ 2
๐
๐
๐ ๐+๐ ๐ 2
๐
๐ 1 2
๐
๐
i
15781 Fall 2016: Lecture 22
15781 Fall 2016: Lecture 22 PRISONERโS DILEMMA: PAYOFF MATRIX
Donโt confess = Donโt rat out Cooperate with each other Confess = Defect Donโt cooperate to each other, act selfishly! Donโt Confess Confess
15781 Fall 2016: Lecture 22 ๏ง Confess (Defection, Acting selfishly) is a dominant strategy for B : no matters what A plays, the best reply strategy is always to confess ๏ง (Strictly) dominant strategy : yields a player strictly higher payoff,. no matter which decision(s) the other player(s) choose ๏ง Weakly: ties in some cases ๏ง Confess is a dominant strategy also for A ๏ง A will reason as follows: B โs dominant strategy is to Confess, therefore, given that we are both rational agents, B will also Confess and we will both get 6 years.
15781 Fall 2016: Lecture 22 ๏ง But, is the dominant strategy (C,C) the best strategy?