




























































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Various topics in probability and random processes, including de morgan's law, conditional probabilities, joint and marginal probability mass functions, geometric and binomial distributions, markov chains, birth-death processes, maximum a posteriori (map) estimation, linear minimum mean-squared error (lmmse) estimation, and likelihood ratio tests. The content is technical and mathematical, suitable for advanced students or researchers in fields like electrical engineering, computer science, or applied mathematics. Detailed derivations, formulas, and examples related to these probability and stochastic modeling concepts.
Typology: Lecture notes
1 / 133
This page cannot be seen from the preview
Don't miss anything!





























































































(last updated: 9/29/22)
Massachusetts Institute of Technology
WWW site for book information and orders http://www.athenasc.com
Solution to Problem 1.1. We have
so A ∪ B = { 2 , 4 , 5 , 6 }, and (A ∪ B)c^ = { 1 , 3 }.
On the other hand,
Ac^ ∩ Bc^ = { 1 , 3 , 5 } ∩ { 1 , 2 , 3 } = { 1 , 3 }.
Similarly, we have A ∩ B = { 4 , 6 }, and
(A ∩ B)c^ = { 1 , 2 , 3 , 5 }.
On the other hand,
Ac^ ∪ Bc^ = { 1 , 3 , 5 } ∪ { 1 , 2 , 3 } = { 1 , 2 , 3 , 5 }.
Solution to Problem 1.2. (a) By using a Venn diagram it can be seen that for any sets S and T , we have S = (S ∩ T ) ∪ (S ∩ T c).
(Alternatively, argue that any x must belong to either T or to T c, so x belongs to S if and only if it belongs to S ∩ T or to S ∩ T c.) Apply this equality with S = Ac^ and T = B, to obtain the first relation
Ac^ = (Ac^ ∩ B) ∪ (Ac^ ∩ Bc).
Interchange the roles of A and B to obtain the second relation.
(b) By De Morgan’s law, we have
(A ∩ B)c^ = Ac^ ∪ Bc,
and by using the equalities of part (a), we obtain
(A∩B)c^ =
(Ac^ ∩B)∪(Ac^ ∩Bc)
(A∩Bc)∪(Ac^ ∩Bc)
= (Ac^ ∩B)∪(Ac^ ∩Bc)∪(A∩Bc).
(c) We have A = { 1 , 3 , 5 } and B = { 1 , 2 , 3 }, so A ∩ B = { 1 , 3 }. Therefore,
(A ∩ B)c^ = { 2 , 4 , 5 , 6 },
The event A ∩ B can be written as the union of two disjoint events as follows:
A ∩ B = (A ∩ B ∩ C) ∪ (A ∩ B ∩ Cc),
so that P(A ∩ B) = P(A ∩ B ∩ C) + P(A ∩ B ∩ Cc). (2)
Similarly, P(A ∩ C) = P(A ∩ B ∩ C) + P(A ∩ Bc^ ∩ C). (3)
Combining Eqs. (1)-(3), we obtain the desired result.
Solution to Problem 1.10. Since the events A ∩ Bc^ and Ac^ ∩ B are disjoint, we have using the additivity axiom repeatedly,
P
(A∩Bc)∪(Ac^ ∩B)
= P(A∩Bc)+P(Ac^ ∩B) = P(A)−P(A∩B)+P(B)−P(A∩B).
Solution to Problem 1.14. (a) Each possible outcome has probability 1/36. There are 6 possible outcomes that are doubles, so the probability of doubles is 6/36 = 1/6.
(b) The conditioning event (sum is 4 or less) consists of the 6 outcomes
2 of which are doubles, so the conditional probability of doubles is 2/6 = 1/3.
(c) There are 11 possible outcomes with at least one 6, namely, (6, 6), (6, i), and (i, 6), for i = 1, 2 ,... , 5. Thus, the probability that at least one die is a 6 is 11/36.
(d) There are 30 possible outcomes where the dice land on different numbers. Out of these, there are 10 outcomes in which at least one of the rolls is a 6. Thus, the desired conditional probability is 10/30 = 1/3.
Solution to Problem 1.15. Let A be the event that the first toss is a head and let B be the event that the second toss is a head. We must compare the conditional probabilities P(A ∩ B | A) and P(A ∩ B | A ∪ B). We have
and
P(A ∩ B | A ∪ B) =
Since P(A ∪ B) ≥ P(A), the first conditional probability above is at least as large, so Alice is right, regardless of whether the coin is fair or not. In the case where the coin is fair, that is, if all four outcomes HH, HT , T H, T T are equally likely, we have
P(A ∩ B) P(A)
A generalization of Alice’s reasoning is that if A′, B′, and C′^ are events such that B′^ ⊂ C′^ and A′^ ∩ B′^ = A′^ ∩ C′^ (for example if A′^ ⊂ B′^ ⊂ C′), then the event
A′^ is at least as likely if we know that B′^ has occurred than if we know that C′^ has occurred. Alice’s reasoning corresponds to the special case where A′^ = A ∪ B, B′^ = A, and C′^ = A ∪ B.
Solution to Problem 1.16. In this problem, there is a tendency to reason that since the opposite face is either heads or tails, the desired probability is 1/2. This is, however, wrong, because given that heads came up, it is more likely that the two-headed coin was chosen. The correct reasoning is to calculate the conditional probability
p = P(two-headed coin was chosen | heads came up)
=
P(two-headed coin was chosen and heads came up) P(heads came up)
We have
P(two-headed coin was chosen and heads came up) =
P(heads came up) =^1 2
so by taking the ratio of the above two probabilities, we obtain p = 2/3. Thus, the probability that the opposite face is tails is 1 − p = 1/3.
Solution to Problem 1.17. Let A be the event that the batch will be accepted. Then A = A 1 ∩ A 2 ∩ A 3 ∩ A 4 , where Ai, i = 1,... , 4, is the event that the ith item is not defective. Using the multiplication rule, we have
Solution to Problem 1.18. Using the definition of conditional probabilities, we have
P(A ∩ B | B) =
Solution to Problem 1.19. Let A be the event that Alice does not find her paper in drawer i. Since the paper is in drawer i with probability pi, and her search is successful with probability di, the multiplication rule yields P(Ac) = pidi, so that P(A) = 1 − pidi. Let B be the event that the paper is in drawer j. If j 6 = i, then A ∩ B = B, P(A ∩ B) = P(B), and we have
pj 1 − pidi
Similarly, if i = j, we have
pi(1 − di) 1 − pidi
Solution to Problem 1.20. (a) Figure 1.1 provides a sequential description for the three different strategies. Here we assume 1 point for a win, 0 for a loss, and 1/2 point
The term pw pd corresponds to the win-draw outcome, the term pw (1 − pd)pw corre- sponds to the win-lose-win outcome, and the term (1 − pw )p^2 w corresponds to lose-win- win outcome.
(b) If pw < 1 /2, Boris has a greater probability of losing rather than winning any one game, regardless of the type of play he uses. Despite this, the probability of winning the match with strategy (iii) can be greater than 1/2, provided that pw is close enough to 1/2 and pd is close enough to 1. As an example, if pw = 0.45 and pd = 0.9, with strategy (iii) we have
P(Boris wins) = 0. 45 · 0 .9 + 0. 452 · (1 − 0 .9) + (1 − 0 .45) · 0. 452 ≈ 0. 54.
With strategies (i) and (ii), the corresponding probabilities of a win can be calculated to be approximately 0.43 and 0.36, respectively. What is happening here is that with strategy (iii), Boris is allowed to select a playing style after seeing the result of the first game, while his opponent is not. Thus, by being able to dictate the playing style in each game after receiving partial information about the match’s outcome, Boris gains an advantage.
Solution to Problem 1.21. Let p(m, k) be the probability that the starting player wins when the jar initially contains m white and k black balls. We have, using the total probability theorem,
p(m, k) =
m m + k
k m + k
1 − p(m, k − 1)
k m + k
p(m, k − 1).
The probabilities p(m, 1), p(m, 2),... , p(m, n) can be calculated sequentially using this formula, starting with the initial condition p(m, 0) = 1.
Solution to Problem 1.22. We derive a recursion for the probability pi that a white ball is chosen from the ith jar. We have, using the total probability theorem,
pi+1 =
m + 1 m + n + 1
pi +
m m + n + 1
(1 − pi) =
m + n + 1
pi +
m m + n + 1
starting with the initial condition p 1 = m/(m + n). Thus, we have
p 2 = 1 m + n + 1
· m m + n
= m m + n
More generally, this calculation shows that if pi− 1 = m/(m + n), then pi = m/(m + n). Thus, we obtain pi = m/(m + n) for all i.
Solution to Problem 1.23. Let pi,n−i(k) denote the probability that after k ex- changes, a jar will contain i balls that started in that jar and n − i balls that started in the other jar. We want to find pn, 0 (4). We argue recursively, using the total probability
theorem. We have
pn, 0 (4) =^1 n
n
· pn− 1 , 1 (3),
pn− 1 , 1 (3) = pn, 0 (2) + 2 ·
n − 1 n
n
· pn− 1 , 1 (2) +
n
n
· pn− 2 , 2 (2),
pn, 0 (2) =
n
n
· pn− 1 , 1 (1),
pn− 1 , 1 (2) = 2 ·
n − 1 n
n
· pn− 1 , 1 (1),
pn− 2 , 2 (2) =
n − 1 n
n − 1 n
· pn− 1 , 1 (1),
pn− 1 , 1 (1) = 1.
Combining these equations, we obtain
pn, 0 (4) =
n^2
n^2
4(n − 1)^2 n^4
4(n − 1)^2 n^4
n^2
n^2
8(n − 1)^2 n^4
Solution to Problem 1.24. Intuitively, there is something wrong with this rationale. The reason is that it is not based on a correctly specified probabilistic model. In particular, the event where both of the other prisoners are to be released is not properly accounted in the calculation of the posterior probability of release. To be precise, let A, B, and C be the prisoners, and let A be the one who considers asking the guard. Suppose that all prisoners are a priori equally likely to be released. Suppose also that if B and C are to be released, then the guard chooses B or C with equal probability to reveal to A. Then, there are four possible outcomes:
(1) A and B are to be released, and the guard says B (probability 1/3). (2) A and C are to be released, and the guard says C (probability 1/3). (3) B and C are to be released, and the guard says B (probability 1/6). (4) B and C are to be released, and the guard says C (probability 1/6).
Thus,
P(A is to be released | guard says B) =
P(A is to be released and guard says B) P(guard says B)
=
Similarly,
P(A is to be released | guard says C) =^2 3
Thus, regardless of the identity revealed by the guard, the probability that A is released is equal to 2/3, the a priori probability of being released.
Solution to Problem 1.25. Let m and m be the larger and the smaller of the two amounts, respectively. Consider the three events
A = {X < m), B = {m < X < m), C = {m < X).
Thus, the observation of a white cow makes the hypothesis “all cows are white” more likely to be true.
Solution to Problem 1.27. Since Bob tosses one more coin that Alice, it is im- possible that they toss both the same number of heads and the same number of tails. So Bob tosses either more heads than Alice or more tails than Alice (but not both). Since the coins are fair, these events are equally likely by symmetry, so both events have probability 1/2. An alternative solution is to argue that if Alice and Bob are tied after 2n tosses, they are equally likely to win. If they are not tied, then their scores differ by at least 2, and toss 2n+1 will not change the final outcome. This argument may also be expressed algebraically by using the total probability theorem. Let B be the event that Bob tosses more heads. Let X be the event that after each has tossed n of their coins, Bob has more heads than Alice, let Y be the event that under the same conditions, Alice has more heads than Bob, and let Z be the event that they have the same number of heads. Since the coins are fair, we have P(X) = P(Y ), and also P(Z) = 1 − P(X) − P(Y ). Furthermore, we see that
Now we have, using the total probability theorem,
as required.
Solution to Problem 1.30. Consider the sample space for the hunter’s strategy. The events that lead to the correct path are:
(1) Both dogs agree on the correct path (probability p^2 , by independence). (2) The dogs disagree, dog 1 chooses the correct path, and hunter follows dog 1 [probability p(1 − p)/2]. (3) The dogs disagree, dog 2 chooses the correct path, and hunter follows dog 2 [probability p(1 − p)/2].
The above events are disjoint, so we can add the probabilities to find that under the hunter’s strategy, the probability that he chooses the correct path is
p^2 +
p(1 − p) +
p(1 − p) = p.
On the other hand, if the hunter lets one dog choose the path, this dog will also choose the correct path with probability p. Thus, the two strategies are equally effective.
Solution to Problem 1.31. (a) Let A be the event that a 0 is transmitted. Using the total probability theorem, the desired probability is
P(A)(1 − 0 ) +
(1 − 1 ) = p(1 − 0 ) + (1 − p)(1 − 1 ).
(b) By independence, the probability that the string 1011 is received correctly is
(1 − 0 )(1 − 1 )^3.
(c) In order for a 0 to be decoded correctly, the received string must be 000, 001, 010, or 100. Given that the string transmitted was 000, the probability of receiving 000 is (1 − 0 )^3 , and the probability of each of the strings 001, 010, and 100 is 0 (1 − 0 )^2. Thus, the probability of correct decoding is
3 0 (1 − 0 )^2 + (1 − 0 )^3.
(d) When the symbol is 0, the probabilities of correct decoding with and without the scheme of part (c) are 3 0 (1 − 0 )^2 + (1 − 0 )^3 and 1 − 0 , respectively. Thus, the probability is improved with the scheme of part (c) if
3 0 (1 − 0 )^2 + (1 − 0 )^3 > (1 − 0 ),
or (1 − 0 )(1 + 2 0 ) > 1 ,
which is equivalent to 0 < 0 < 1 /2.
(e) Using Bayes’ rule, we have
The probabilities needed in the above formula are
P(0) = p, P(1) = 1 − p, P(101 | 0) = ^20 (1 − 0 ), P(101 | 1) = 1 (1 − 1 )^2.
Solution to Problem 1.32. The answer to this problem is not unique and depends on the assumptions we make on the reproductive strategy of the king’s parents. Suppose that the king’s parents had decided to have exactly two children and then stopped. There are four possible and equally likely outcomes, namely BB, GG, BG, and GB (B stands for “boy” and G stands for “girl”). Given that at least one child was a boy (the king), the outcome GG is eliminated and we are left with three equally likely outcomes (BB, BG, and GB). The probability that the sibling is male (the conditional probability of BB) is 1/. Suppose on the other hand that the king’s parents had decided to have children until they would have a male child. In that case, the king is the second child, and the sibling is female, with certainty.
Here, (1 − pi)
j 6 =i pj^ is the probability that^ n^ −^ 1 plants have failed and plant^ i^ is the one that has not failed.
Solution to Problem 1.37. The probability that k 1 voice users and k 2 data users simultaneously need to be connected is p 1 (k 1 )p 2 (k 2 ), where p 1 (k 1 ) and p 2 (k 2 ) are the corresponding binomial probabilities, given by
pi(ki) =
ni ki
pk i i(1 − pi)ni−ki^ , i = 1, 2.
The probability that more users want to use the system than the system can accommodate is the sum of all products p 1 (k 1 )p 2 (k 2 ) as k 1 and k 2 range over all possible values whose total bit rate requirement k 1 r 1 +k 2 r 2 exceeds the capacity c of the system. Thus, the desired probability is
{(k 1 ,k 2 ) | k 1 r 1 +k 2 r 2 >c, k 1 ≤n 1 , k 2 ≤n 2 }
p 1 (k 1 )p 2 (k 2 ).
Solution to Problem 1.38. We have
pT = P(at least 6 out of the 8 remaining holes are won by Telis),
pW = P(at least 4 out of the 8 remaining holes are won by Wendy).
Using the binomial formulas,
pT =
k=
k
pk^ (1 − p)^8 −k^ , pW =
k=
k
(1 − p)k^ p^8 −k^.
The amount of money that Telis should get is 10 · pT /(pT + pW ) dollars.
Solution to Problem 1.39. Let the event A be the event that the professor teaches her class, and let B be the event that the weather is bad. We have
P(A) = P(B)P(A | B) + P(Bc)P(A | Bc),
and
P(A | B) =
i=k
n i
pib(1 − pb)n−i,
P(A | Bc) =
i=k
n i
pig (1 − pg )n−i.
Therefore,
i=k
n i
pib(1 − pb)n−i^ +
i=k
n i
pig (1 − pg )n−i.
Solution to Problem 1.40. Let A be the event that the first n − 1 tosses produce an even number of heads, and let E be the event that the nth toss is a head. We can obtain an even number of heads in n tosses in two distinct ways: 1) there is an even number of heads in the first n − 1 tosses, and the nth toss results in tails: this is the event A ∩ Ec; 2) there is an odd number of heads in the first n − 1 tosses, and the nth toss results in heads: this is the event Ac^ ∩ E. Using also the independence of A and E, qn = P
(A ∩ Ec) ∪ (Ac^ ∩ E)
= P(A ∩ Ec) + P(Ac^ ∩ E) = P(A)P(Ec) + P(Ac)P(E) = (1 − p)qn− 1 + p(1 − qn− 1 ). We now use induction. For n = 0, we have q 0 = 1, which agrees with the given formula for qn. Assume, that the formula holds with n replaced by n − 1, i.e.,
qn− 1 =
1 + (1 − 2 p)n−^1 2
Using this equation, we have
qn = p(1 − qn− 1 ) + (1 − p)qn− 1 = p + (1 − 2 p)qn− 1
= p + (1 − 2 p)
1 + (1 − 2 p)n−^1 2 = 1 + (1^ −^2 p)
n 2
so the given formula holds for all n.
Solution to Problem 1.41. We have
P(N = n) = P(A 1 ,n− 1 ∩ An,n) = P(A 1 ,n− 1 )P(An,n | A 1 ,n− 1 ),
where for i ≤ j, Ai,j is the event that contestant i’s number is the smallest of the numbers of contestants 1,... , j. We also have
P(A 1 ,n− 1 ) = 1 n − 1
We claim that P(An,n | A 1 ,n− 1 ) = P(An,n) =
n
The reason is that by symmetry, we have
P(An,n | Ai,n− 1 ) = P(An,n | A 1 ,n− 1 ), i = 1,... , n − 1 ,
while by the total probability theorem,
P(An,n) =
i=
P(Ai,n− 1 )P(An,n | Ai,n− 1 )
= P(An,n | A 1 ,n− 1 )
i=
P(Ai,n− 1 )
= P(An,n | A 1 ,n− 1 ).
Another possible sample space consists of all the possible ordered color pairs, i.e., {RR, RW, W R, W W }. We then have to calculate the probability of the event {RW, W R}. We consider a sequential description of the experiment, i.e., we first select the first ball and then the second. In the first stage, the probability of a red ball is m/(m+n). In the second stage, the probability of a red ball is either m/(m+n−1) or (m−1)/(m+n−1) depending on whether the first ball was white or red, respectively. Therefore, using the multiplication rule, we have
m m + n
m − 1 m − 1 + n
m m + n
n m − 1 + n
n m + n
m m + n − 1
n m + n
n − 1 m + n − 1
The desired probability is
P
m m + n
n m − 1 + n
n m + n
m m + n − 1
=
2 mn (m + n)(m + n − 1)
(b) We calculate the conditional probability of all balls being red, given any of the possible values of k. We have P(R | k = 1) = m/(m + n) and, as found in part (a), P(RR | k = 2) = m(m − 1)/(m + n)(m − 1 + n). Arguing sequentially as in part (a), we also have P(RRR | k = 3) = m(m − 1)(m − 2)/(m + n)(m − 1 + n)(m − 2 + n). According to the total probability theorem, the desired answer is
1 3
m m + n
m(m − 1) (m + n)(m − 1 + n)
m(m − 1)(m − 2) (m + n)(m − 1 + n)(m − 2 + n)
Solution to Problem 1.52. The probability that the 13th card is the first king to be dealt is the probability that out of the first 13 cards to be dealt, exactly one was a king, and that the king was dealt last. Now, given that exactly one king was dealt in the first 13 cards, the probability that the king was dealt last is just 1/13, since each “position” is equally likely. Thus, it remains to calculate the probability that there was exactly one king in the first 13 cards dealt. To calculate this probability we count the “favorable” outcomes and divide by the total number of possible outcomes. We first count the favorable outcomes, namely those with exactly one king in the first 13 cards dealt. We can choose a particular king in 4 ways, and we can choose the other 12 cards in
12
ways, therefore there are 4 ·
12
favorable outcomes. There are
13
total outcomes, so the desired probability is
For an alternative solution, we argue as in Example 1.10. The probability that the first card is not a king is 48/52. Given that, the probability that the second is
not a king is 47/51. We continue similarly until the 12th card. The probability that the 12th card is not a king, given that none of the preceding 11 was a king, is 37/41. (There are 52 − 11 = 41 cards left, and 48 − 11 = 37 of them are not kings.) Finally, the conditional probability that the 13th card is a king is 4/40. The desired probability is
48 · 47 · · · 37 · 4 52 · 51 · · · 41 · 40
Solution to Problem 1.53. Suppose we label the classes A, B, and C. The proba- bility that Joe and Jane will both be in class A is the number of possible combinations for class A that involve both Joe and Jane, divided by the total number of combinations for class A. Therefore, this probability is
Since there are three classes, the probability that Joe and Jane end up in the same class is
A much simpler solution is as follows. We place Joe in one class. Regarding Jane, there are 89 possible “slots”, and only 29 of them place her in the same class as Joe. Thus, the answer is 29/89, which turns out to agree with the answer obtained earlier.
Solution to Problem 1.54. (a) Since the cars are all distinct, there are 20! ways to line them up.
(b) To find the probability that the cars will be parked so that they alternate, we count the number of “favorable” outcomes, and divide by the total number of possible outcomes found in part (a). We count in the following manner. We first arrange the US cars in an ordered sequence (permutation). We can do this in 10! ways, since there are 10 distinct cars. Similarly, arrange the foreign cars in an ordered sequence, which can also be done in 10! ways. Finally, interleave the two sequences. This can be done in two different ways, since we can let the first car be either US-made or foreign. Thus, we have a total of 2 · 10! · 10! possibilities, and the desired probability is
2 · 10! · 10! 20!
Note that we could have solved the second part of the problem by neglecting the fact that the cars are distinct. Suppose the foreign cars are indistinguishable, and also that the US cars are indistinguishable. Out of the 20 available spaces, we need to choose 10 spaces in which to place the US cars, and thus there are
10
possible outcomes. Out of these outcomes, there are only two in which the cars alternate, depending on
from a l-letter alphabet is equal to
l!
l − 1 w − 1
Solution to Problem 1.58. (a) The sample space consists of all ways of drawing 7 elements out of a 52-element set, so it contains
7
possible outcomes. Let us count those outcomes that involve exactly 3 aces. We are free to select any 3 out of the 4 aces, and any 4 out of the 48 remaining cards, for a total of
3
4
choices. Thus,
P(7 cards include exactly 3 aces) =
(b) Proceeding similar to part (a), we obtain
P(7 cards include exactly 2 kings) =
(c) If A and B stand for the events in parts (a) and (b), respectively, we are looking for P(A ∪ B) = P(A) + P(B) − P(A ∩ B). The event A ∩ B (having exactly 3 aces and exactly 2 kings) can occur by choosing 3 out of the 4 available aces, 2 out of the 4
4 3
2
2
distinct outcomes. Hence,
P(7 cards include 3 aces and/or 2 kings) =
Solution to Problem 1.59. Clearly if n > m, or n > k, or m − n > 100 − k, the probability must be zero. If n ≤ m, n ≤ k, and m − n ≤ 100 − k, then we can find the probability that the test drive found n of the 100 cars defective by counting the total number of size m subsets, and then the number of size m subsets that contain n lemons. Clearly, there are
m
different subsets of size m. To count the number of size m subsets with n lemons, we first choose n lemons from the k available lemons, and then choose m − n good cars from the 100 − k available good cars. Thus, the number of ways to choose a subset of size m from 100 cars, and get n lemons, is
k n
100 − k m − n
k n
100 − k m − n
m
Solution to Problem 1.60. The size of the sample space is the number of different ways that 52 objects can be divided in 4 groups of 13, and is given by the multinomial formula 52! 13! 13! 13! 13!
There are 4! different ways of distributing the 4 aces to the 4 players, and there are
48! 12! 12! 12! 12!
different ways of dividing the remaining 48 cards into 4 groups of 12. Thus, the desired probability is
4!
An alternative solution can be obtained by considering a different, but proba- bilistically equivalent method of dealing the cards. Each player has 13 slots, each one of which is to receive one card. Instead of shuffling the deck, we place the 4 aces at the top, and start dealing the cards one at a time, with each free slot being equally likely to receive the next card. For the event of interest to occur, the first ace can go anywhere; the second can go to any one of the 39 slots (out of the 51 available) that correspond to players that do not yet have an ace; the third can go to any one of the 26 slots (out of the 50 available) that correspond to the two players that do not yet have an ace; and finally, the fourth, can go to any one of the 13 slots (out of the 49 available) that correspond to the only player who does not yet have an ace. Thus, the desired probability is 39 · 26 · 13 51 · 50 · 49
By simplifying our previous answer, it can be checked that it is the same as the one obtained here, thus corroborating the intuitive fact that the two different ways of dealing the cards are probabilistically equivalent.