
















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The prevalence of central A in Hide-and-Seek games and compares three models - equilibrium, QRE, and level-k models - to explain this phenomenon. The document shows that logit QRE with perturbations of equal magnitudes but opposite signs across player roles can explain the prevalence of central A for both Hiders and Seekers, but it has difficulty explaining the greater prevalence of central A for Seekers. The level-k model, on the other hand, can explain all three of the robust patterns observed in RTH's data, including the role asymmetries, by using a behaviorally plausible specification of L0 and the type frequencies.
Typology: Study notes
1 / 24
This page cannot be seen from the preview
Don't miss anything!

















Vincent P. Crawford and Nagore Iriberri^1 University of California, San Diego 20 August 2004; revised 15 September 2004
Abstract: Rubinstein, Tversky, and Heller elicited subjects' initial responses to games in which a Hider and a Seeker choose simultaneously among four locations, with the Seeker winning a given amount if he chooses the same location as the Hider and the Hider otherwise winning that amount. The game has a unique equilibrium, in which both players randomize uniformly over locations; but the design framed locations non-neutrally and subjects deviated systematically from equilibrium in ways that were highly sensitive to the framing. This paper compares alternative explanations of the results and proposes a structural non-equilibrium model of initial responses to explain them.
Keywords: experimental game theory, framing effects, focal points, bounded rationality, strategic sophistication JEL classification numbers: C70, C
(^1) Email: [email protected] and [email protected]. We are grateful to the National Science Foundation (Crawford) and the Centro de Formacion del Banco de España (Iriberri) for research support; to Miguel Costa-Gomes, Victor Ferreira, Barry Nalebuff, Steven Scroggin, Ricardo Serrano-Padial, David Swinney, Joel Sobel, Joel Watson, and Mark Voorneveld for helpful comments or discussions; to Dale Stahl for providing a copy of Bacharach and Stahl (1997a); to Stahl and Daniel Zizzo for searching for Bacharach and Stahl (1997b); and to Ariel Rubinstein for providing a copy of Rubinstein and Tversky (1993), searching for additional data, and a helpful discussion. Glenn Close and Michael Douglas were no help at all.
1. Introduction Game theorists and economists have been intrigued by "Hide-and-Seek" games for more than 50 years (von Neumann (1953)). Rubinstein and Tversky (1993; henceforth "RT") and Rubinstein, Tversky, and Heller (1996; "RTH") were perhaps the first to study such games experimentally; see also Rubinstein (1999; "R"). RT, RTH, and R (collectively "RTH" from now on) elicited subjects' initial responses to several closely related Hide-and-Seek games. In a leading example of their games, one player, the Hider, hides a "treasure" or "prize" in one of four locations; and the other player, the Seeker, looks in one of the locations. Because the Seeker looks without observing the Hider's choice, their choices are strategically simultaneous. If the Seeker chooses the same location as the Hider he wins a given amount; if not, the Hider wins that amount.
Instead of giving subjects a payoff matrix, RTH explained the Hide-and-Seek games in "stories," probably increasing comprehension. Their key innovation was to present the games with non-neutral framing of the locations. R, for example, told Seekers: "You and another student are playing the following game: Your opponent has hidden a prize in one of four boxes arranged in a row. The boxes are marked as follows: A, B, A, A. Your goal is, of course, to find the prize. His goal is that you will not find it. You are allowed to open only one box. Which box are you going to open?" Hiders were told an analogous story. In effect the entire structure, including the order and labeling of locations, was publicly announced, with the goal of making it common knowledge.
This story makes the framing of locations non-neutral in two ways. The "B" location is uniquely distinguished by its label, and is thus focal in one of Schelling's (1960) senses. And the two "end A" locations, though not distinguished by their labels, may be inherently focal, as RT and RTH argue, citing Christenfeld (1995).^2 As RTH note, these two focalities interact to give the remaining location, "central A," its own brand of uniqueness as "the least salient location" (RT).
RTH's design is important as a tractable abstract model of applications in which games like Hide-and-Seek are played on naturally occurring, non-neutral, cultural or geographic "landscapes." Equilibrium analysis ignores such landscapes unless they directly influence the payoff structure. Consider, for instance, the complete-information Hide-and-Seek game derived from the above example, presented as a payoff matrix in Figure 1 with each player's winning payoff normalized to
(^2) See also R's discussion of Ayton and Falk's (1995) two-dimensional Hide-and-Seek experiment.
thinking." In RTH's words, "In the competitive games, however, the players employed a naïve strategy (avoiding the endpoints), that is not guided by valid strategic reasoning. In particular, the hiders in this experiment either did not expect that the seekers too, will tend to avoid the endpoints, or else did not appreciate the strategic consequences of this expectation." Table I. Aggregate Choice Frequencies in RTH's Experiments RTH-4 A B A A Hider (53) 9% 36% 40% 15% Seeker (62) 13% 31% 45% 11% RT-AABA-Treasure A A B A Hider (189) 22% 35% 19% 25% Seeker (85) 13% 51% 21% 15% RT-AABA-Mine A A B A Hider (132) 24% 39% 18% 18% Seeker (73) 29% 36% 14% 22% RT-1234-Treasure 1 2 3 4 Hider (187) 25% 22% 36% 18% Seeker (84) 20% 18% 48% 14% RT-1234-Mine 1 2 3 4 Hider (133) 18% 20% 44% 17% Seeker (72) 19% 25% 36% 19% R-ABAA A B A A Hider (50) 16% 18% 44% 22% Seeker (64) 16% 19% 54% 11% RTH-1 Triangle Circle Circle Circle Hider (53) 23% 23% 43% 11% Seeker (62) 29% 24% 42% 5% RTH-2 Polite Rude Honest Friendly Hider (53) 15% 26% 51% 8% Seeker (62) 8% 40% 40% 11% RTH-3 Smile Smile Smile Frown Hider (53) 21% 26% 34% 19% Seeker (62) 7% 25% 34% 34% RTH-5 Frown Smile Frown Frown Hider (53) 15% 40% 34% 11% Seeker (62) 16% 55% 21% 8% RTH-6 Hate Detest Love Dislike Hider (53) 11% 23% 38% 28% Seeker (62) 20% 21% 55% 14% Sample sizes in parentheses; focal labels in italics; order of presentation of locations to subjects as shown.
In our view, however, such robust patterns are unlikely to lack a coherent explanation; and given the simplicity of the strategic questions Hide-and-Seek games pose, the explanation is unlikely to be nonstrategic. On the contrary, such systematic deviations from equilibrium in games where its rationale is especially strong seem a particularly promising "proving ground" for alternative, non-equilibrium theories of strategic behavior. Understanding RTH's results is likely to prove helpful in many applications involving games like Hide-and-Seek played on non-neutral landscapes, where people's choices are sensitive to the landscape but equilibrium ignores it.
This paper compares alternative explanations of RTH's results and conducts an illustrative econometric analysis that helps to discriminate among them. We focus on explaining the patterns of deviation from equilibrium that are common to RTH-4 and, mutatis mutandis, the five most closely analogous RTH treatments in the top half of Table I: the prevalence of central A (or its analog, as explained in Section 2) for both Hiders and Seekers, its greater prevalence for Seekers (or their analogs), and the fact that Seekers (or their analogs) find the treasure more often than in equilibrium. We seek a parsimonious explanation that rests on behaviorally plausible assumptions.
Section 2 explains the other treatments in Table I and the senses in which they are analogous to RTH-4, and reports tests for differences in choice frequencies across treatments.
Section 3 considers what may be the simplest way to try to explain RTH's results: an equilibrium analysis of the Hide-and-Seek game with payoff perturbations that reflect "hard-wired" preferences about the salient focally labeled and/or end locations. This model can explain the prevalence of central A for both Hiders and Seekers by postulating behaviorally plausible perturbations of equal magnitudes but opposite signs across player roles, with Hiders averse to focally labeled and/or end locations and Seekers favoring them (Figure 2). But the model can only explain the greater prevalence of central A for Seekers by allowing large, unexplained differences across roles in the perturbations' magnitudes as well as their signs. Such differences give the model enough flexibility to explain almost any pattern, raising concerns about overfitting.
Section 4 considers explanations based on a generalization of equilibrium called Quantal Response Equilibrium ("QRE"; McKelvey and Palfrey (1995)), which explains the patterns of deviations from equilibrium in some other experiments. In a QRE players' choices are noisy, with the probability of each choice increasing in its expected payoff, given the distribution of others' choices; a QRE is thus a fixed point in the space of players' choice distributions. The specification is completed by a response distribution, whose noisiness is represented (inversely) by a precision
decision rules or types ; but it assumes that each player's type is drawn from the same distribution, thus eliminating unexplained differences in behavioral assumptions across player roles.^8
Our level- k model has five types, L0 , L1 , L2 , L3 , and L4. Type Lk for k > 0 anchors its beliefs in a naïve L0 type, described below, and then adjusts them via thought-experiments involving iterated best responses: L1 normally best responds to L0 , L2 to L1 , and so on.^9 Each type Lk for k > 0 also makes uniform errors, with a probability independent of k and player role.
Lk types have accurate models of the game and are rational; they depart from equilibrium only in basing their beliefs on simplified models of other players. This yields a workable model of others' choices while avoiding the cognitive complexity of equilibrium analysis. In Selten's (1998) words: "Basic concepts in game theory are often circular in the sense that they are based on definitions by implicit properties…. Boundedly…rational strategic reasoning seems to avoid circular concepts. It directly results in a procedure by which a problem solution is found. Each step of the procedure is simple, even if many case distinctions by simple criteria may have to be made."
Because Lk for k > 0 ignores the framing except as it affects the anchoring type L0 , L0 is the key to the model's potential to explain RTH's results. L0 plays multiple roles, as L1 's model of others, L2 's model of others' models of itself, and so on. We take L0 to be nonstrategic, as is usual in such analyses; but we allow L0 Hiders and Seekers to favor the salient focally labeled and/or end locations, to an equal extent in each player role. We also represent L0 's preferences directly as choice probabilities rather than payoff perturbations.^10 Our assumption that L0 's probabilities are non-uniform departs from much of the literature, but there is ample precedent for adapting L0 to the setting, and in RTH's games a uniform L0 would make Lk the same as equilibrium for all k.^11
Under these assumptions the level- k model can explain RTH's results for plausible values of L0 's choice probabilities and the population type frequencies, with no asymmetry across player
(^8) Symmetry in behavioral assumptions across roles is natural here because the types are general decision rules, meant to explain 9 differences in Hiders' and Seekers' behavior, and the roles were filled by subjects from the same pool. Costa-Gomes and Crawford (2004) provide experimental support for our assumptions that L2 best responds to an L without decision errors, and to L1 alone rather than a mixture of L1 and L0 , etc., unlike in Stahl and Wilson (1995) or Camerer, Ho, and Chong (2004). Our model with five types is the most general model of this kind, because in the unperturbed Hide-and-Seek game our types' choices cycle, so that 10 L5 is equivalent to L1 , L6 to L2 , and so on (Table II). This difference from our equilibrium model is less important than it may seem because it does not affect the range of possible best responses for 11 Lk for k > 0; and L0 's choices could be "purified" via privately observed perturbations. See for example Ho, Camerer, and Weigelt's (1998) analysis of guessing games; or Crawford's (2003) analysis of strategic deception via cheap talk, where the Sender's and Receiver's L0 types are based on truthfulness or credulity, as in the informal literature on deception. By contrast, the level- k model's other basic component, the adjustment of Lk beliefs for k > 0 via iterated best responses, appears to yield a satisfactory account of initial responses in many settings.
roles in behavioral assumptions. Given L0 's attraction to focally labeled and/or end locations, L Hiders choose central A to avoid L0 Seekers and L1 Seekers avoid central A in their searches for L0 Hiders. For similar but more complex reasons, L2 Hiders choose central A with probability between zero and one and L2 Seekers choose it with probability one; L3 Hiders avoid central A and L3 Seekers choose it with probability between zero and one; and L4 Hiders and Seekers both avoid central A (Table II). For our estimated type frequencies, which are similar to—but imply somewhat more sophistication than—those that have been estimated for other settings (see Costa-Gomes and Crawford (2004) and the papers discussed there), these choice patterns allow the level- k model to explain the prevalence of central A for both Hiders and Seekers, its greater prevalence for Seekers than Hiders, and the fact that Seekers find a Treasure more often than in equilibrium.^12
To put this explanation into perspective, compare it with a simpler alternative that has been suggested to us: "Hiders feel safer avoiding focal locations, so they are most likely to choose central A; and Seekers know this, so they are also most likely to choose central A." This sounds plausible, but it has two weaknesses: It implicitly assumes that Hiders are systematically less sophisticated ( L1 , in our terminology) than Seekers ( L2 ), and it does not explain the greater prevalence of central A for Seekers.^13 Our level- k model remedies both weaknesses by using a behaviorally plausible specification of L0 and the type frequencies, the same for Hiders and Seekers, to explain all three of the robust patterns RTH observed, including the role asymmetries.
Section 5's analysis shows that the level- k model is a promising alternative to equilibrium or QRE with payoff perturbations explanations of RTH's results, but all three models' predictions depend on behavioral parameters that must be estimated or translated from other settings. Section 6 compares the models more systematically, focusing on the equilibrium and level- k models because when estimated the QRE model effectively reduces to the equilibrium model. We first estimate the equilibrium and level- k models econometrically, pooling the data from the six treatments on which
(^12) Note that the level- k model's explanation goes against its specification of L0 , which avoids central A in either role. Our level- k analysis is similar in some respects to Bacharach and Stahl's (2000) "level- k variable-frame" analysis of coordination and Bacharach and Stahl's (1997a) analysis of Hide-and-Seek games (not in the published version, Bacharach and Stahl (2000); and Bacharach and Stahl (1997b), whose title suggests a more detailed version of their Hide-and-Seek analysis, is unavailable). However, our analysis makes much simpler behavioral assumptions, in that only 13 L0 responds directly to the framing, and our players need not reason about each other's knowledge of L. Note that levels of sophistication are meaningfully comparable across roles. The quotation in the text could be inverted to "Seekers are drawn to focal locations, so they are unlikely to choose central A; Hiders know this, so they are likely to choose central A." This version uses the same logic as the one in the text, with player roles interchanged; but it does not fit the patterns in RTH's data, suggesting that an explanation requires empirical knowledge in addition to logic.
Treasure, and R-ABAA. They also include two "Mine" treatments, RT-AABA-Mine and RT-1234- Mine, identical to the corresponding Treasure treatments except that the hidden object is undesirable, so that Hiders' and Seekers' payoffs are interchanged. This yields an equivalent normal form with players' roles reversed, leaving equilibrium predictions otherwise unchanged. However, because Hiders inherently move first, even though Seekers do not observe their choices Mine treatments have different extensive forms than Treasure treatments with roles reversed. RTH, suspecting that this difference might make it easier for Seekers to mentally simulate Hiders' choices, used Mine treatments to test whether it explains the role-asymmetric patterns in their data; but the Mine treatments yielded results very close to the corresponding Treasure treatments with roles reversed.^15 This suggests that the role-asymmetries were somehow driven by subjects' responses to the normal-form structure, as in all of the theoretical explanations considered here.
In the three ABAA or AABA Treasure treatments and the AABA Mine treatment, central A—RT's "least salient" location—was the modal choice for both Hiders and Seekers. This pattern extends to the 1234 Treasure and Mine treatments if we follow RT's suggestion that "the least salient response…may correspond to 3, or perhaps 2" and take 2 as analogous to B and 3 to central A. Given this identification, central A was more prevalent for Seekers in all four Treasure treatments and more prevalent for Hiders in both Mine treatments; thus this pattern is also the same in all six treatments if Hiders in Treasure and Seekers in Mine treatments are identified. Further, the frequencies with which Seekers found a Treasure or a Mine exceeded ¼, so that Seekers (Hiders) had higher (lower) expected payoffs than in equilibrium in Treasure treatments, and vice versa in Mine treatments. Accordingly, our analysis "builds in" these analogies by identifying 2 with B, and Mine treatments with the corresponding Treasure treatments with reversed player roles. To avoid unnecessary repetition, we use "central A" ("B") to refer to either a central A (B) or a 3 (2) location; and we refer to Mine treatments as if they were Treasure treatments with reversed roles.
After transforming the data as suggested by these identifications, chi-square tests for aggregate differences in subjects' choice frequencies across the six treatments in the top half of Table I reveal no significant difference for Seekers ( p -value 0.2297) and a marginally significant difference for Hiders ( p -value 0.0417). Pairwise tests show that this last difference can be attributed to RTH-4, which differed from the other treatments in having unusually high frequencies of B (as
(^14) Although equilibrium is also a general decision rule, it is less clear how to translate the payoff perturbations needed to explain RTH's results to new games, and we are unaware of any estimates of them to which ours can be compared. 15 Weber, Camerer, and Knez (2004) found weak effects of timing without observability in other games.
well as central A) for both Hiders and Seekers. Nonetheless, in keeping with our goal of explaining the prevalence of central A for both Hiders and Seekers, and its greater prevalence for Seekers, both of which hold in RTH-4 as well as the other five treatments, we pool all six treatments for the econometric analysis and remark how the results differ for RTH-4. The pooled sample includes 624 Hiders and 560 Seekers, with aggregate choice frequencies as shown in Table III.^16 Although our analysis focuses on the six treatments in the top half of Table I, the five treatments in the bottom half are worth some discussion because they provide additional evidence of the robustness of the patterns RTH observed. These treatments are all Treasure treatments with the same payoff structure as RTH-4, but with labels with positive or negative connotations and/or focally labeled end locations.^17 RTH-2 and RTH-5 are analogous to RTH-4 except for the connotations of the focal label. RTH-1 and RTH-3 are analogous to RTH-4 except that the focal label is at an end position, and in RTH-3 it has a negative connotation. RTH-6 is analogous to RTH-5 except that the focal label is in the third rather than second position; and is analogous to RTH-2 and RTH-4 except for this difference in position and that the focal label has a positive connotation in RTH-6 but negative or neutral connotations in RTH-2 or RTH-4. The choice frequencies for these imperfectly analogous RTH treatments echo those for the ones we analyze, with complicating shifts in expected directions. It seems likely that Section 5's level- k explanation of RTH's results could be adapted to these treatments by estimating new L0 choice probabilities.
3. Equilibrium with Payoff Perturbations This section considers explanations of RTH's results via equilibrium analysis of a perturbed game, whose players' payoffs are influenced by the salience of focally labeled and/or end locations as well as their strategic consequences. We take these payoff perturbations to reflect hard-wired preferences, independent of strategic reasoning. Following RTH's discussions of typical responses to salience, we assume that Seekers receive an additional, psychic payoff of e for choosing an end location (equal for both ends for simplicity) or f for choosing the focally labeled B location, and that Hiders lose payoffs of equal magnitudes for such choices (Figure 2). We start by taking e and f to have the same magnitudes for both player roles, and focus on the leading case where e , f > 0.^18
(^16) Minor corrections to the published data were needed to reconcile the reported frequencies and sample sizes. (^17) These subjects played a series of games with the same normal form but different labelings. They were anonymously and randomly paired, without feedback, to induce them to treat each of the games as "one-shot." 18 In Section 5 we report estimates of e = 0.2187 and f = 0.2010 without this restriction, confirming that e , f > 0 provides the best explanation of RTH's results for the equilibrium with restricted perturbations model.
highest expected payoff for Seekers, and so has probability greater than ¼ for them. But then some location other than L has higher expected payoff for Hiders, contradicting the initial hypothesis.
This section accordingly considers explanations that combine logit QRE with payoff perturbations as in Figure 2, which make QRE sensitive to framing and give it the potential to explain RTH's results. As is almost always the case, such models must be solved computationally. Figure 3 illustrates logit QRE with payoff perturbations restricted to be equal in magnitude
values that best fit RTH's data for Section 2's equilibrium with restricted perturbations model
is infinite, reducing the model to the equilibrium with restricted perturbations model (footnote 7). As in Figure 3, for all combinations of e , f = 0.1, 0.2, 0.3, or 0.4 (all consistent with a totally mixed
but never for Hiders; and it is always higher for Hiders, reversing the patterns in RTH's data.^20
Like equilibrium with payoff perturbations, logit QRE can only explain RTH's results by allowing the magnitudes of the perturbations e and f to differ across roles. This yields an effectively
equilibrium with unrestricted perturbations model (footnote 7). Figure 4 illustrates logit QRE with eH = 0.2910, fH = 0.2535, and eS = fS = 0.1539, the values that give the best fit for the equilibrium with unrestricted perturbations model.
5. A Model with Boundedly Rational "Level- k " Types Returning to the Hide-and-Seek game without payoff perturbations (Figure 1), we now consider a structural non-equilibrium model of initial responses with boundedly rational level- k types as discussed in the Introduction. Each player follows one of five types, L0 , L1 , L2 , L3 , or L4 , with given probabilities r , s , t , u , and v respectively, the same for each player role. Type L0 Hiders and Seekers both choose A, B, A, A with probabilities p /2, q , 1– p – q , p /2 respectively, taking the probabilities of the end locations to be equal for simplicity. We focus on the leading case in which p > ½ and q > ¼, reflecting L0 's nonstrategic attraction to salient locations. We also assume that for k > 0, an Lk Hider or Seeker normally chooses type k 's location for his role, defined by iterating
(^192) e + f must be larger for Hiders because larger values are associated with higher probabilities of central A, and Hiders' value determines Seekers' probability. In Section 5 we report estimates of eH = 0.2910, fH = 0.2535, and eS = fS = 0. for the equilibrium with unrestricted perturbations model: all positive as expected, and nearly twice as large for Hiders. 20 There is enough structure to suggest that this result is symptomatic of a theorem, but we have been unable to prove it.
best responses to L0 , with ties broken randomly with equal probabilities.^21 But with probability ε, equal across roles and values of k > 0, he makes an error, in which case he chooses each location with probability ¼; these errors are independently and identically distributed across players. This implies that a location can be chosen with probability greater than ¼ in the population of Hiders or Seekers only if it is a best response for some type that has positive probability in the population.
Figure 3. QRE with Payoff Perturbations of Equal Magnitudes Across Player Roles
Figure 4. QRE with Payoff Perturbations of Differing Magnitudes Across Player Roles
(^21) As noted in the Introduction, L0 plays multiple roles in this model, as L1 's model of others, L2 's model of others' model of itself, etc. This is a plausible simplification if the intuitions L0 reflects are widely understood. It is less strained than the knowledge assumptions of equilibrium with payoff perturbations because it refers only to L0 , which reflects more basic intuitions; and because here there is no need for the knowledge about L0 to be mutual or common.
Table II. T
yp
es' Ex
pected Pa
yoffs and Choice Probabilities
when
p^
q^ > 1 and 3
p^ + 2
q^ > 2
Hide
r^
Ex
p. Pa
yoff
Choice Pr.
Ex
p. Pa
yoff
Choice Pr.
Seeke
r^
Ex
p.^
Choice Pr.
Ex
p. Pa
yoff
Choice Pr.
p^ < 2
q^
p^ < 2
q^
p^ > 2
q^
p^ > 2
q^
p^ < 2
q^
p^ < 2
q^
p^ > 2
q^
p^ > 2
q
(Pr.
r )
(Pr.
r )
p /
p /
p /
p /
q^
q^
q^
q
1- p
1- p
1- p
1- p
p /
p /
p /
p /
(Pr.
s )
(Pr.
s )
p /2 < ¾
p /2 < ¾
p /2 > ¼
p /2 > ¼
q^
q^
q^ > ¼
q^ > ¼
p^ +
q^
p^ +
q^
p –
q^ < ¼
p –
q^ < ¼
p /2 < ¾
p /2 < ¾
p /2 > ¼
p /2 > ¼
(Pr.
t )^
(Pr.
t )
(Pr.
u )
(Pr.
u )
(Pr.
v )
(Pr.
v )
Total Probabilit
y^
p^ < 2
q^
p^ > 2
q^
Total Probabilit
y^
p^ < 2
q^
p^ > 2
q
rp /2+
ε )[
t /3+
u /3]+
r ) ε
rp
ε )[
u /3+
v /2]+
r ) ε
rp /2+
ε )[
u /3+
v /3]+
r ) ε
rp
ε )[
s /2+
v /3]+
r ) ε
rq
1- ε
)[ u
v ]+
r ) ε
rq +
ε )[
t /2+
u /3]+
r ) ε
rq +
ε )[
s +
v /3]+
r ) ε
rq
1- ε
)[ u
v /3]+
r ) ε
r (1-
p - q
ε )[
s +
t /3]+
r ) ε
r (1-
p - q
ε )[
s +
t /2]+
r ) ε
r (1-
p - q
1- ε
)[ t
r ) ε
r (1-
p - q
1- ε
)[ t
r ) ε
rp /2+
ε )[
t /3+
u /3]+
r ) ε
rp
ε )[
u /3+
v /2]+
r ) ε
rp /2+
ε )[
u /3+
v /3]+
r ) ε
rp
ε )[
s /2+
v /3]+
r ) ε
Figure 5. L1 's Through L4 's Choices as Functions of L0 's Choice Probabilities
analyses more concrete by using RTH's data to estimate those parameters econometrically. We focus on the equilibrium and level- k models because the estimated QRE model reduces to equilibrium. We then address the issue of overfitting by using estimates computed for each model, treatment by treatment, to "predict" the results of the other five treatments. Our purpose is to illustrate the possibilities of the level- k and equilibrium with perturbations models, not to take a definitive position on the behavioral parameters, which in our view would require more comprehensive experiments, perhaps with a design that tracks individual subjects' choices across different but related games as in Stahl and Wilson (1994, 1995); Costa-Gomes, Crawford, and Broseta (2001); or Costa-Gomes and Crawford (2004). Because both models have enough flexibility to fit the observed choice frequencies very well, our econometrics amount to
5 75 1^ p
q 1
Region 4 L1 H: B L1 L2 S: central AH: B & end As
3 p +2 q = 2 Region 6 L1 H: end As Region 1 L1 S: B^ L1 L1^ H: central AS: B L2L2 H: central & end AsS: end As L2 H: central & end As L3L3 H: B & central AS: central & end As L2 L3 S: central AH: B & end As L4 H: B^ L3 L4^ S: central & end AsH: B L4 S: B & central A L4 S: B & end As
L2 L3 S: BH: central & end As L3 L4 S: B & end AsH: central A L4 S: central & end As
p + 2 q = 1
Region 3 L1 H: B L1 L2 S: end AsH: B & central A
Region 2 L1 H: central A L1 L2 S: end AsH: B & central A Region 5^ p^ = 2 q^ L2L3^ S: central AH: B & end As L1 L1 H: end AsS: central A (^) L3L4 S: B & central AH: end As L2 L2 H: B & end AsS: end As L4 S: B & end As L3 L3 H: B & central AS: B & end As L4 L4 H: central AS: B & central A L2 L3 S: BH: B & central A L3 L4 S: B & central AH: end As L4 S: B & central A
We are unaware of any estimates with which to compare its estimated payoff perturbations, but their signs are behaviorally plausible; however, their magnitudes are nearly twice as large for Hiders than Seekers, a difference for which it seems hard to find a sensible explanation.
Table III. Parameter Estimates and Likelihoods for the Leading Models Model Ln L Parameter Estimates (^) Frequencies for Hiders and SeekersActual and Predicted Choice A B A A Observed choice H^ 0.2163^ 0.2115^ 0.3654^ 0. frequencies (^) S 0.1821 0.2054 0.4589 0. Equilibrium without -1641.4^ H^ 0.2500^ 0.2500^ 0.2500^ 0. perturbations/random (^) S 0.2500 0.2500 0.2500 0. Equilibrium with perturbations of equal -1568.5^ eH^ =^ eS^ = 0.2187^ H^ 0.1897^ 0.2085^ 0.4122^ 0. magnitudes across player roles fH^ =^ fS^ = 0.2010^ S^ 0.1897^ 0.2085^ 0.4122^ 0. Equilibrium with perturbations of -1562.4^ eH^ = 0.2910,^ fH^ = 0.2535^ H^ 0.2115^ 0.2115^ 0.3654^ 0. unrestricted magnitudes across player roles eS^ = 0.1539,^ fS^ = 0.1539^ S^ 0.1679^ 0.2054^ 0.4590^ 0. Level- k with -1564.4^ H^ 0.2052^ 0.2408^ 0.3488^ 0. p > ½, q > ¼, and p > 2 q
r = 0, s = 0.1896, t =0.3185, u =0.2446, v = 0.2473, ε = 0 (^) S 0.1772 0.2047 0.4408 0.
The level- k model's estimates are generally behaviorally plausible, with r = 0, so that L exists only in the minds of L1 through L4 ; and ε = 0, so there are no choices unexplained by L through L4.^25 Because the estimated r = 0, L0 's choice probabilities p and q are not identified; but the region in which the likelihood is maximized subject to p > ½ and q > ¼ is identified as Region 2 (Figure 5), where p > 2 q so L0 favors end more than focally labeled locations.^26 The estimated
(^25) The finding that there are no L0 subjects is consistent with the common finding that people systematically underestimate others' sophistication relative to their own (see for instance Weizsäcker (2003)). In Hide-and-Seek without payoff perturbations, our uniform errors are perfectly confounded with the equilibrium mixed-strategy probabilities. Thus our finding that ε = 0 also suggests the absence of an Equilibrium type. Further, we can reject explanations in which a not-too-large part of the population choose locations with given probabilities (like L0 ) and the rest play equilibrium in a game among themselves, taking the first part's behavior into account, like the Sophisticated type in Costa-Gomes, Crawford, and Broseta (2001), Crawford (2003), or Costa-Gomes and Crawford (2004). 26 The maximized log-likelihood in Region 1 is slightly lower, at -1567.9. In a sense the "best fit" is on the border between Regions 1 and 2, where p = 2 q , which yields log-likelihood -1563.8; but here the parameters are not identified
type frequencies are close to previous estimates, but somewhat more sophisticated than usual (Stahl and Wilson (1994); Costa-Gomes, Crawford, and Broseta (2001); Costa-Gomes and Crawford (2004); Camerer, Ho, and Chong (2004)).^27 This could be due either to more sophisticated subject pools, or to the simplicity and transparency of the strategic questions Hide-and-Seek poses. Given the flexibility of the equilibrium with unrestricted perturbations and level- k models' parameterizations, overfitting is a concern. We test for it by re-estimating the models separately for each of the six treatments and using each re-estimated model to "predict" the observed choice frequencies of the other five treatments. For the level- k model we restrict the estimates to Region 2 ( p > 2 q ) because the parameters are not identified in Region 1 ( p < 2 q ) without imposing r = ε = 0; and while the fit is slightly better in Region 1 for RTH-4 (log-likelihood -143.0 versus -144.2 in Region 2), RT-AABA-Treasure (-361.1 versus -363.6), and R-ABAA (-142.0 versus -142.1), Region 2 yields more sensible parameter estimates and better predictions. We evaluate goodness- of-fit by the mean squared deviation ("MSD") between predicted and observed choice frequencies. Tables IV-VII summarize the results of the overfitting test. Although the equilibrium with unrestricted perturbations model has a fit at least as good in each treatment (Table VII's diagonal), our favored specification of the level- k model has a modest advantage in prediction, with mean squared prediction error 18% lower and better predictions in 20 of 30 comparisons (Table VII).^28
7. Conclusion This paper has compared alternative explanations of the systematic patterns of deviation from equilibrium observed in RTH's experiments with two-person constant-sum Hide-and-Seek games with unique mixed-strategy equilibria and non-neutral framing of locations. We focus on two models, equilibrium with unrestricted payoff perturbations and a structural non-equilibrium model of initial responses based on boundedly rational "level- k " types. A third model, logit QRE with payoff perturbations, reduces when estimated to the equilibrium with perturbations model.
(in RTH's dataset) even if we restrict r = ε = 0. The parameters are identified if we restrict r = s = v = 0, allowing ε > 0, which yields the model with role-dependent L0 of the June 2004 preliminary draft of this paper. Due to these identification problems and the theoretical advantages of role-independent 27 L0 , we focus here on the interiors of regions. When the estimates are computed separately for RTH-4 (the treatment for which we found a marginally significant difference) and the other five treatments pooled, we find that RTH-4's likelihood (subject to p > ½ and q > ¼) slightly favors Region 1 ( p < 2 q ) over Region 2 (log-likelihood -143.0 versus -144.2), with very different type frequencies; while the pooled treatments' likelihood slightly favors Region 2 (log-likelihood -1415.0 versus -1417.8). 28 However, if we allow Region 1 estimates for RTH-4, RT-AABA-Treasure, and R-ABAA and assume r =ε=0 to avoid identification problems, the level- k and equilibrium with unrestricted perturbations models have the same overall MSD.