Probability Theory: Expected Winnings and Distributions, Exams of Probability and Statistics

Solutions to the final exam questions of a university course on probability theory. The topics covered include expected winnings from a game, the chi-squared distribution, joint probability density functions, and conditional probabilities. Students can use this document as study notes, summaries, or as a reference for understanding these concepts.

Typology: Exams

2012/2013

Uploaded on 02/21/2013

rahas
rahas 🇮🇳

4.3

(15)

86 documents

1 / 6

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Math/Stats 425, Sec. 1, Fall ’04: Introduction to Probability
Final Exam: Solutions
1. In a game, a contestant is shown two identical envelopes containing money. The
contestant does not know how much money is in either envelope. The contestant can
choose an envelope at random (equally likely), see how much money is in it, and then
either keep this amount or exchange the chosen envelope for the other one, keeping the
money in the second envelope. Now suppose that one envelope has $1000, and the other
has $2000. A strategy to play this game is for the contestant to fix a number x, and if
the first envelope is revealed to have $x, then one switches; if it has more than $x,
one keeps the first envelope. What is the expected amount of money the contestant gets
following this strategy if x= 500? What is the expected amount won if the strategy is
used with x= 1500? Finally, what is the expected amount won if the strategy is used
with x= 2500?
[25]
This is a simple expectation calculation. Let Ybe the RV “the winnings”. The two
possibilities are according to whether the $1000 envelope or the $2000 envelope is chosen,
and these are equally likely. In the first strategy, x= 500, if one were to draw $ 1000,
one would hold it, and if one drew $2000, one would hold that, too. So, the expectation
under this strategy is E(Y|x= 500) = 1000 ·1
2+ 2000 ·1
2= 1500.When x= 1500, if one
drew $1000, one would exchange it for the $2000 envelope, and if one drew $2000, one
keeps it. Therefore, E(Y|x= 1500) = 2000 ·1
2+ 2000 ·1
2= 2000.Finally, if one used the
strategy for x= 2500, then if one drew $1000, one would turn it back for the envelope
with $2000, and if one drew $2000, one would exchange it for the envelope with $1000,
so E(Y|x= 2500) = 2000 ·1
2+ 1000 ·1
2= 1500 again. In fact,
E(Y|x) = 1500,if x < 1000,or x2000,and,
2000,if 1000 x < 2000.
Remark: Since you don’t know beforehand what the amounts in the envelopes will be,
one could randomize one’s choice of the strategy x, i.e., replace xby a non-negative random
variable Xso that we could now compute the expected winnings with this strategy as
E(Y) = E(E(Y|X)),
by the conditional expectation formula, and this last is always 1500, and if P(1000
X<2000) >0,then E(Y)>1500.It pays to guess!
2. Let Zbe the unit normal random variable, i.e., a normal R.V. with mean 0 and
variance 1. Let Ybe the random variable Z2. [Yis the χ2(chi-squared) distribution with
one degree of freedom. The χ2-distributions are very important in statistics.]
(i) What are the possible values of Y?
Since Zcan take on any real value, Ycan take on any non-negative real value.
(ii) What is the formula for the probability density function fY(y)?
1
pf3
pf4
pf5

Partial preview of the text

Download Probability Theory: Expected Winnings and Distributions and more Exams Probability and Statistics in PDF only on Docsity!

Math/Stats 425, Sec. 1, Fall ’04: Introduction to Probability

Final Exam: Solutions

  1. In a game, a contestant is shown two identical envelopes containing money. The contestant does not know how much money is in either envelope. The contestant can choose an envelope at random (equally likely), see how much money is in it, and then either keep this amount or exchange the chosen envelope for the other one, keeping the money in the second envelope. Now suppose that one envelope has $1000, and the other has $2000. A strategy to play this game is for the contestant to fix a number x, and if the first envelope is revealed to have ≤ $x, then one switches; if it has more than $x, one keeps the first envelope. What is the expected amount of money the contestant gets following this strategy if x = 500? What is the expected amount won if the strategy is used with x = 1500? Finally, what is the expected amount won if the strategy is used with x = 2500?

[25]

  • This is a simple expectation calculation. Let Y be the RV “the winnings”. The two possibilities are according to whether the $1000 envelope or the $2000 envelope is chosen, and these are equally likely. In the first strategy, x = 500, if one were to draw $ 1000, one would hold it, and if one drew $2000, one would hold that, too. So, the expectation under this strategy is E(Y|x = 500) = 1000 · 12 + 2000 · 12 = 1500. When x = 1500, if one drew $1000, one would exchange it for the $2000 envelope, and if one drew $2000, one keeps it. Therefore, E(Y|x = 1500) = 2000 · 12 + 2000 · 12 = 2000. Finally, if one used the strategy for x = 2500, then if one drew $1000, one would turn it back for the envelope with $2000, and if one drew $2000, one would exchange it for the envelope with $1000, so E(Y|x = 2500) = 2000 · 12 + 1000 · 12 = 1500 again. In fact,

E(Y|x) =

1500 , if x < 1000 , or x ≥ 2000 , and, 2000 , if 1000 ≤ x < 2000.

Remark: Since you don’t know beforehand what the amounts in the envelopes will be, one could randomize one’s choice of the strategy x, i.e., replace x by a non-negative random variable X so that we could now compute the expected winnings with this strategy as

E(Y) = E(E(Y|X)),

by the conditional expectation formula, and this last is always ≥ 1500, and if P (1000 ≤ X < 2000) > 0 , then E(Y) > 1500. It pays to guess!

  1. Let Z be the unit normal random variable, i.e., a normal R.V. with mean 0 and variance 1. Let Y be the random variable Z^2. [Y is the χ^2 (chi-squared) distribution with one degree of freedom. The χ^2 -distributions are very important in statistics.]

(i) What are the possible values of Y?

  • Since Z can take on any real value, Y can take on any non-negative real value.

(ii) What is the formula for the probability density function fY(y)?

  • We use the usual technique: fY(y) = (^) dyd FY(y), and

FY(y) = P (Y ≤ y) = P (−

y ≤ Z ≤

y) =

2 π

∫ √y

−√y

e−^

t 22 dt,

for y ≥ 0. Taking the derivative of this integral with respect to y we get

d dy

FY(y) =

2 π

y

e−y/^2 −

y

e−y/^2 },

where the first term inside the {}’s comes from differentiating the upper end point of the integral and the second term comes from differentiating the lower endpoint of the integral, using the fundamental theorem of calculus and the chain rule. Altogether we have

fY(y) =

2 πy

e−y/^2 , for 0 < y.

(iii) Y has the same distribution as one of the Gamma distributions. For which parameters (s, λ) is the Gamma distribution the same as that of Y?

  • The Gamma[λ, s] distrbution has pdf, for 0 ≤ y, given by λe

−λs(λy)s− 1 Γ(s). Comparing the exponents on the e’s above, we see that −y 2 should match −λy, which gives λ = 12. Now comparing the exponents on the y’s in both expressions we get y−^1 /^2 = ys−^1 , which gives s = 12. All that is left to match is the constant Γ(s), and the law of total probability then tells us that Γ(^12 ) =

π. Note that this last part is not a part of the question: that is, once you have matched the s and λ, and have only constant factors left to compare, the law of total probability tells you that they have to match, and it isn’t necessary to show this to know that the probability density functions are the same.

[30]

  1. Suppose X, Y are two jointly distributed RV’s with joint probability density function

fX,Y(x, y) =

12 xy(1 − x) 0 < x < 1 , 0 < y < 1 , 0 otherwise

(i) Check that fX,Y(x, y) is indeed a probability density function.

  • This just means checking two things: first, that fX,Y(x, y) is non-negative for (x, y) in the square S = { 0 < x < 1 , 0 < y < 1 }, which is obvious, and that

S fX,Y(x, y)^ dx dy^ = 1, which is true and is an elementary calculus exercise to check.

(ii) What is the probability density function of X? What is the p.d.f. of Y?

  • To find the marginals, we just have to integrate out the y or the x according to the case. So,

fX(x) =

0

12 xy(1 − x)dy = 12x(1 − x)

0

ydy = 6x(1 − x), 0 < x < 1.

and

fY(y) = 12y

0

x(1 − x)dx = 2y, 0 < y < 1.

then what can you say about the RV’s U 1 , U 2?

  • In this case, under technical assumptions which we will ignore here, the two distributions have to be the same: see the textbook, section 7.1, p. 366, just above example 6e, and examples 6e through 6h.

Now let X, Y be two independent RV’s both ∼ Exponential[λ]. Recall that these may be thought of as waiting time distributions.

(ii) Use moment generating functions to show that X + Y ∼ Gamma[2, λ]. What is the interpretation of this in terms of waiting times?

  • Use the principle in part (i):

MX+Y(t) = MX(t) · MY(t) =

λ λ − t

λ λ − t

λ λ − t

)^2 ,

where we have used the tables for moment generating functions. Again using the table for the m.g.f. for the Gamma[s, λ] distribution, we see that the m.g.f. is ( (^) λλ−t )s, and so we have matched a Gamma distribution with s = 2 and λ as given.

In terms of waiting tiimes, it says that we can interpret Gamma[2,λ] as the waiting time distribution, waiting for the arrival of first one, then another event according to independent Exp[λ] distributions.

(iii) If X + Y = T > 0, what are the possible values of X? What is the conditional distribution for X, given that X + Y = T? Interpret this in terms of waiting times.

  • In this conditioned situation, 0 ≤ X ≤ T , since Y ≥ 0. Then the conditional distribution function is given, for 0 ≤ x ≤ T , by:

fX|X+Y=T (x) =

fX,Y(x, T − x) fX+Y(T )

λ^2 e−λx^ · e−λ(T^ −x) λe−λT^ (λT )(2−1)^

where the last numerator comes from the Gamma[2, λ] distribution. Doing a little algebra to cancel stuff top and bottom simplifies things to:

fX|X+Y=T (x) =

T

, for 0 ≤ x ≤ T,

i.e., X is unformly distributed over the interval [0, T ]. This means , in terms of waiting times, that if we know the second of these two arrivals occurs at time T , then the first arrival was a random time anywhere prior to time T , i.e., uniformly distributed on the interval [0, T ].

Now, I will accept an argument like the above, but strictly speaking, you should make a change of variables first. I will show you the argument in detail, and point out why you get the right answer either way in this case (something = 1, so you can’t see the possible difference).

Let us change (random) variables from X, Y, to X, S = X + Y. At the level of possible values, this amounts to (x, s) = g(x, y) = (x, x + y) = (g 1 (x, y), g 2 (x, y)). The change of variables formula gives us

fX,S(x, s) = fX,Y(g−^1 (x, s)) · | det(Dg(g−^1 (x, s))|,

where

Dg(x, y) =

∂g 1 ∂x

∂g 1 ∂y

∂g 2 ∂x

∂g 2 ∂y

and so, det(Dg(g−^1 (x, s)) = 1, and

fX,S(x, s) = fX,Y(x, s − x) = λe−λ xλe−λ(s−x)^ = λ^2 e−λs, 0 ≤ x ≤ s.

We also have fS(s) = λe−λs(λs)^2 −^1 = λ^2 se−λs.

Now,

fX|X+Y=T (x) = fX|S=T (x) =

fX,S(x, T ) fS (T )

λ^2 e−λT λ^2 T e−λT^

T

, 0 ≤ x ≤ T,

as found above.

[30]

  1. A certain component is necessary for the operation of a computer server system and must be replaced immediately upon failure for the server to continue operating. The lifetime of such a component is a random variable X with mean lifetime 100 hours and standard deviation 30 hours. Estimate the number of such components which must be in stock in order to be 95% sure that the system can run continuously for the next 2000 hours? (We ignore any other possible failure for the server!)
  • Let us denote Xi a sequence of i.i.d.’s, all with the same distribution as X, and let SN = X 1 +... + XN be the total time one can run the server if one has N copies of the critical component. The question asks you to estimate an N for which P (SN ≥ 2000) ≥ 0 .95. We can use the Central Limit Theorem to do this. Let’s check the expectation of SN :

E(SN ) = N E(X 1 ) = 100N.

To estimate this, we are thinking of N so large that 2000 is an outlier on the short side, so we are probably looking for an N significantly > 20. In terms of the CLT, we want:

  1. 95 ≤ P (SN ≥ 2000) = P (

SN − 100 N

N

2000 − 100 N

N

) ≈ P (Z ≤

100 N − 2000

N

and since the cumulative distribution function FZ(z) = Φ(z), we have to choose N at least large enough to make 10030 N√^ −N^2000 ≥ 1 .645, since Φ(1.645) = 0. 95. That is, we want to guarantee that 100 N − 1. 645 · 30

N − 2000 ≥ 0.

Now replacing

N by t ( and N by t^2 ), we want to guarantee that

100 t^2 − 1. 645 · 30 t − 2000 ≥ 0.

Now you can either do the next step by trial and error or a calculator, or just say what should happen at this point. I would try to solve these equations for the first value of t > 0 where the parabola crosses the x-axis, that is, solve

100 t^2 − 1. 645 · 30 t − 2000 = 0.

The positive root is t = 4.726, or N ≥ 22 .332. So, according to this estimate, we have to have at least 23 components to get the components working at least 2000 hours, with probability ≥ 95%.