






Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The second-ace puzzle A deck has four cards: the ace and deuce of hearts, and the ace and deuce of spades. After a fair shuffle of the deck, two cards are ...
Typology: Exams
1 / 11
This page cannot be seen from the preview
Don't miss anything!







The second-ace puzzle A deck has four cards: the ace and deuce of hearts, and the ace and deuce of spades. After a fair shuffle of the deck, two cards are dealt to Alice. It is easy to see that, at this point, there is a probability of 1/6 that Alice has both aces, a probability of 5/6 that Alice has at least one ace, a probability of 1/2 that Alice has the ace of spades, and a probability of 1/2 that Alice has the ace of hearts: of the six possible deals of two cards out of four, Alice has both aces in one of them, at least one ace in five of them, the ace of hearts in three of them, and the ace of spades in three of them. (For readers unfamiliar with probability, there is an introduction in Chapter 2.)
Alice then says, “I have an ace.” Conditioning on this information (by discarding the possibility that Alice was dealt no aces), Bob computes the probability that Alice holds both aces to be 1/5. This seems reasonable. The probability, according to Bob, of Alice having two aces goes up if he learns that she has an ace. Next, Alice says, “I have the ace of spades.” Conditioning on this new information, Bob now computes the probability that Alice holds both aces to be 1/3. Of the three deals in which Alice holds the ace of spades, she holds both aces in one of them. As a result of learning not only that Alice holds at least one ace, but that the ace is actually the ace of spades, the conditional probability that Alice holds both aces goes up from 1/5 to 1/3. But suppose that Alice had instead said, “I have the ace of hearts.” It seems that a similar argument again shows that the conditional probability that Alice holds both aces is 1/3. Is this reasonable? When Bob learns that Alice has an ace, he knows that she must have either the ace of hearts or the ace of spades. Why should finding out which particular ace it is raise the conditional probability of Alice having two aces? Put another way, if this probability goes up from 1/5 to 1/3 whichever ace Alice says she has, and Bob knows that she has an ace, then why isn’t it 1/3 all along?
The Monty Hall puzzle The Monty Hall puzzle is very similar to the second-ace puzzle. Suppose that you’re on a game show and given a choice of three doors. Behind one is a car; behind the others are goats. You pick door 1. Before opening door 1, host Monty Hall (who knows what is behind each door) opens door 3, which has a goat. He then asks you if you still want to take what’s behind door 1, or to take instead what’s behind door 2. Should you switch? Assuming that, initially, the car was equally likely to be behind each of the doors, naive conditioning suggests that, given that it is not behind door 3, it is equally likely to be behind door 1 and door 2, so there is no reason to switch. On the other hand, the car is equally likely to be behind each of the doors. If it is behind door 1, then you clearly should not switch; but if it is not behind door 1, then it must be behind door 2 (since it is obviously not behind door 3), and you should switch to door 2. Since the probability that it is behind door 1 is 1/3, it seems that, with probability 2/3, you should switch. But if this reasoning is correct, then why exactly is the original argument incorrect? The second-ace puzzle and the Monty Hall puzzle are the stuff of puzzle books. Nevertheless, understanding exactly why naive conditioning does not give reasonable answers in these cases turns out to have deep implications, not just for puzzles, but for important statistical problems.
holds an ace or that she holds the ace of hearts. This puzzle already suggests that there are subtleties involved with conditioning. Things get even more complicated if uncertainty is not represented using probability, or if the new information does not come in a nice package that allows conditioning. (Consider, e.g., information like “people with jaundice typically have hepatitis.”) Chapter 3 examines conditioning in the
It turns out that in order to represent the puzzle formally, it is important to describe the protocol used by Alice. The protocol determines the set of runs, or possible sequences of events that might happen. The key question here is what Alice’s protocol says to do after she has answered “yes” to Bob’s question as to whether she has an ace. Roughly speaking, if her protocol is “if I have the ace of spades, then I will say that, otherwise I will say nothing,” then 1/3 is indeed Bob’s probability that Alice has both aces. This is the conditional probability of Alice having both aces given that she has the ace of spades. On the other hand, suppose that her protocol is “I will tell Bob which ace I have; if I have both, I will choose at random between the ace of hearts and the ace of spades.” Then, in fact, Bob’s conditional probability should not go up to 1/3 but should stay at 1 /5. The different protocols determine different possible runs and so result in different probability spaces. In general, it is critical to make the protocol explicit in examples such as this one.
!NL1TobAGGo,1o1h1-]W1.o LNWoW>1oOQN,),AGA[oW>)WoAW->ogGGoLNWo,1o1h1-]W1.o;A_1Lo
W>)Wo!NL1TobAGGo,1o (>)Wo)TT]HOWANLTo)Q1oONTTA,G1o),N]WoW>1oOQN,),AGA[oW>)WoW>1o ;])Q.oW1GGToAW->oW>)Wo!NL1TobAGGo,1o1h1-]W1.o;A_1LoW>)WoAW->o1T-)O1To1h1-]WANLo
ҔěҔ*Ҕ0 Ҕ
82 Chapter 3 Updating Beliefs
now, I just consider how sets of probabilities can be used to deal with the three-prisoners puzzle.
Example 3.3.1 The three-prisoners puzzle is an old chestnut that is somewhat sim- ilar in spirit to the second-ace puzzle discussed in Chapter 1, although it illustrates somewhat different issues.
Of three prisoners a, b, and c, two are to be executed, but a does not know which. He therefore says to the jailer, “Since either b or c is certainly going to be executed, you will give me no information about my own chances if you give me the name of one man, either b or c, who is going to be executed.” Accepting this argument, the jailer truthfully replies, “b will be executed.” Thereupon a feels happier because before the jailer replied, his own chance of execution was 2/3, but afterward there are only two people, himself and c, who could be the one not executed, and so his chance of execution is 1/2.
Note that in order for a to believe that his own chance of execution was 2/3 before the jailer replied, he seems to be implicitly assuming the principle of indifference. A straightforward application of the principle of indifference also seems to lead to a’s believing that his chances of execution goes down to 1/2 after hearing the jailer’s statement. Yet it seems that the jailer did not give him any new relevant information. Is a justified in believing that his chances of avoiding execution have improved? If so, it seems that a would be equally justified in believing that his chances of avoiding execution would have improved if the jailer had said “c will be executed.” It seems that a’s prospects improve no matter what the jailer says! That does not seem quite right. The principle of indifference is implicitly being applied here to a space consisting of three worlds—say wa , wb , and w (^) c —where in world w (^) x , prisoner x is pardoned. But this representation of a world does not take into account what the jailer says. Perhaps a better representation of a possible situation is as a pair (x, y), where x, y ∈ {a, b, c}. Intuitively, a pair (x, y) represents a situation where x is pardoned and the jailer says that y will be executed in response to a’s question. Since the jailer answers truthfully, x ̸= y; since the jailer will never tell a directly that a will be executed, y ̸= a. Thus, the set of possible worlds is {(a, b), (a, c), (b, c), (c, b)}. The event lives - a—a lives—corresponds to the set {(a, b), (a, c)}. Similarly, the events lives - b and lives - c correspond to the sets {(b, c)} and {(c, b)}, respectively. Assume in accord with the principle of indifference that each prisoner is equally likely to be pardoned, so that each of these three events has probability 1/3. The event says-b—the jailer says b—corresponds to the set {(a, b), (c, b)}; the story does not give a probability for this event. To do standard probabilistic condition-
3.3 Conditioning with Sets of Probabilities 83
ing, this set must be measurable and have a probability. The event {(c, b)} ( lives-c ) has probability 1/3. But what is the probability of {(a, b)}? That depends on the jailer’s strategy in the one case where he has a choice, namely, when a lives. He gets to choose between saying b and c in that case. The probability of (a, b) depends on the probability that he says b if a lives; that is, μ(says-b | lives-a). If the jailer applies the principle of indifference in choosing between saying b and c if a is pardoned, so that μ(says-b | lives-a) = 1 /2, then μ({(a, b)}) = μ({(a, c)}) = 1 /6, and μ(says-b) = 1 /2. With this assumption,
μ(lives-a | says-b) = μ(lives-a ∩ says-b)/μ(says-b) = ( 1 / 6 )/( 1 / 2 ) = 1 /3.
Thus, if μ(says-b) = 1 /2, the jailer’s answer does not affect a’s probability. Suppose more generally that μα , 0 ≤ α ≤ 1, is the probability measure such that μα ( lives- a) = μα ( lives- b) = μα ( lives- c) = 1 /3 and μα (says-b | lives-a) = α. Then straightforward computations show that
μ (^) α ({(a, b)}) = μ (^) α (lives-a) × μα (says-b | lives-a) = α/3, μ (^) α (says-b) = μα ({(a, b)}) + μ (^) α ({(c, b)}) = (α + 1 )/3, and
μ (^) α (lives-a | says-b) =
α/ 3 (α + 1 )/ 3
= α/(α + 1 ).
Thus, μ (^1) / 2 = μ. Moreover, if α ̸= 1 /2 (i.e., if the jailer had a particular preference for answering either b or c when a was the one pardoned), then a’s probability of being executed would change, depending on the answer. For example, if α = 0, then if a is pardoned, the jailer will definitely say c. Thus, if the jailer actually says b, then a knows that he is definitely not pardoned, that is, μ 0 (lives-a | says-b) = 0. Similarly, if α = 1, then a knows that if either he or c is pardoned, then the jailer will say b, while if b is pardoned the jailer will say c. Given that the jailer says b, from a’s point of view the one pardoned is equally likely to be him or c; thus, μ 1 (lives-a | says-b) = 1 /2. In fact, it is easy to see that if PJ = {μα : α ∈ [0, 1]}, then (PJ |says-b)∗ (lives-a) = 0 and (PJ |says-b)∗^ (lives-a) = 1 /2. To summarize, the intuitive answer—that the jailer’s answer gives a no information—is correct if the jailer applies the principle of indifference in the one case where he has a choice in what to say, namely, when a is actually the one to live. If the jailer does not apply the principle of indifference in this case, then a may gain information. On the other hand, if a does not know what strategy the jailer is using to answer (and is not willing to place a probability on these strategies), then his prior point probability of 1/3 “diffuses” to an interval.