




Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A collection of problems and exercises related to probability theory and random variables, including topics such as probability spaces, random variables, expected values, and measure-theoretic framework. The exercises cover various concepts, such as estimating the average number of raisins per cookie, describing the sigma-algebra generated by a random variable, determining whether a sequence of real numbers is a random variable, and proving that a probability measure cannot be extended to a sigma-field. The document also includes exercises on generating random numbers with specific probability densities and assessing the quality of random number generation.
Typology: Assignments
1 / 8
This page cannot be seen from the preview
Don't miss anything!





This homework has 155 points for MATP 6960 (200 points for MATP 4960) plus 45 bonus points available but, as always, homeworks are graded out of 100 points. Full credit will generally be awarded for a solution only if it is both correctly and efficiently presented using the techniques covered in the lecture and readings, and if the reasoning is properly explained. If you used software or simulations in solving a problem, be sure to include your code, simulation results, and/or worksheets documenting your work. If you score more than 100 points, the extra points do count toward your homework total.
Suppose in a group of 8 people, a committee of three members is selected in the first year, a committee of four members is selected in the second year, and a committee of five members is selected in the third year. Each of these committees is supposed to be chosen at random with no favoritism, and membership on a committee one year does not alter the probability for being chosen to serve on a committee in a future year. One of the group of 8 people was chosen for none of the committees and claims that there must be some deliberate manipulation of the random selection process. In an ensuing civil court case, a lawyer hires you as a mathematical consultant to provide some hard numbers.
a. (5 points) If the committee selection process were truly fair, what would be the probability that the particular person in question would be chosen for none of the committees in the three years?
b. (10 points) If the committee selection process were truly fair, what would be the probability that each person in the group would be selected for a committee at least once in the three years?
Suppose you are running a snack factory and are designing a new line of raisin cookies. Assuming the raisins just get mixed in a big batch of dough before being separated into cookie pieces, estimate the average number of raisins per cookie you would need so that at least 99% of the cookies have raisins. Explain your reasoning.
Suppose bits of information, with values 0 or 1 are sent over a transmission line, but are subject to distortion, which we model by adding a normally distributed random variable with mean 0 and standard deviation σ to the value of 0 or 1 that was intended to be transmitted. (Of course some measurable quantity like signal amplitude or frequency is altered rather than a literal value of 0 or 1 in the line, but the idea is the same.) Suppose that the receiver assumes the value 0 was transmitted if he receives a value less than 1/2, and assumes the value 1 was transmitted if he receives a value greater than or equal to 1/2. What is the probability that the receiver makes an error in interpreting the transmitted bit? Express your answer in terms of σ and as always, explain your reasoning, but you do not need to simpilfy your answer.
Suppose a random variable X is uniformly distributed on the real interval [a, b]. Show that |X| has an absolutely continuous probability distribution and derive its probability density.
for all n = 1, 2 ,.. ..
a. (15 bonus points) Prove that P cannot be extended to a probability measure on σ(C), the σ-field generated by C.
b. (5 bonus points) Explain carefully and precisely why this result is not in contradiction with the probability extension theorem stated in, for example, Grigoriu, Stochastic Calculus, Sec. 2.3.3.
Consider a probability space (Ω, F, P ) where Ω = S^1 = {x ∈ R^3 ||x| = 1} is the unit sphere in three dimensions, F = B(S^1 ), and P is the uniform probability measure. Define random variables Θ and Φ on this probability space as the values of the usual (poloidal and azimuthal) spherical angles, that is, so that the random variables corresponding to the Cartesian coordinates X 1 , X 2 , and X 3 are expressed:
X 1 = sin Θ cos Φ, X 2 = sin Θ sin Φ, X 3 = cos Θ.
a. (5 points) Derive the CDF for Φ, and if the probability distribution is abso- lutely continuous, provide the probability density.
b. (8 points) Derive the CDF for Θ, and if the probability distribution is abso- lutely continuous, provide the probability density.
c. (7 points) Determine whether or not Φ and Θ are independent random vari- ables, and justify your conclusion.
Consider a cell with N chromosomes (need not be human!) which is subjected to radiation which causes some of the chromosomes to break and recombine with each other. In particular, we define a chromosomal interchange to be an event in which two chromosomes break, exchange one of their pieces with each other, and each recombine (the wrongly matched pieces). Suppose r chromosomal interchanges occur, and that each chromosome is equally likely to be involved in a chromosomal interchange. What is the probability that exactly m of the N chromosomes will be corrupted due to an interchange? You may ignore the possibility of a chromosomal interchange being “undone” by just the right rebreaking and recombination.
Suppose a parallel computing algorithm delegates three tasks to three separate pro- cessors, to which we will refer as A, B, and C. Each processor starts its task at the same time and performs its computation in parallel, except near the end of its com- putation, processor B requires the result from processor A, and similarly, near the end of its computation, processor C requires the result from processor B. Therefore, the parallel algorithm will proceed with minimal waste of processor time if the pro- cessors finish their main computations in the order: A then B then C. By “main computation” we mean the computations up to the point that a processor needs a result from another processor. The times required by each processor for their main computation are rather uncertain, however, due to variability in the computational task to be performed and the load on the processors due to other computations. Suppose that, from some statistical analysis, we model processor A as requiring a uniformly distributed amount of time varying between 1 and 2 seconds for its main computation, and processor B as requiring an exponentially distributed amount of time with mean 2 seconds for its main computation. Processor C’s main computation is actually two subtasks which are performed in succession (one right after the other), each of which requires a time which is exponentially distributed with mean 1.5 sec- onds. Under such a probability model, what fraction of the time will the processors complete their main computations in an order other than the desired one (A then B then C)?
In electronic and mechanical control systems such as shock absorbers in vehicles, it is sometimes desirable to introduce a critically damped oscillator. A linear mechanical oscillator can be characterized by mass m, force constant k, and damping coefficient γ; and its response to disturbances characterized by its eigenvalues which are the roots of the equation mλ^2 + γλ + k = 0.
(Similar mathematical equations apply for electronic circuits, though different sym- bols are typically used.) The system is critically damped when the eigenvalues have negative real part and no imaginary part, which happens for example when the sys- tem is designed with m = k = 1 and γ = 2 (in suitably nondimensionalized units). However, in the manufacturing of the circuit, these parameters have uncertainties. For simplicity, suppose m is uniformly distributed between 1 − δm and 1 + δm, k is uniformly distributed between 1 − δk and 1 + δk, and γ is uniformly distributed between 2 − δγ and 2 + δγ , where δk, δm, and δγ are certain nondimensionalized un- certainty parameters. Suppose that undesirable ringing occurs when an eigenvalue has an imaginary part which is greater in magnitude than some critical value Ω. Un-
simulated. Plot the “theoretical” and “simulated” binned probability distributions on the same graph. How do they compare?
2.9.3 Comparison Against Correct Mean and Variance (10 points)
To make the comparison more quantitative, compute the sample mean
μ ˆ =
j=
Xj
and sample variance
σ ˆ^2 =
j=
(Xj − μˆ)^2
where N is the number of random variables you simulate and Xj are the realizations of the random variable. Compare these sample means and variances to the correct theoretical values. Be sure to explain how you obtain the theoretical values.
You may be puzzled by why the sample variance has N − 1 rather than N in the denominator. The reason is statisical: dividing by N would produce an underestima- tion bias of the variance. That’s because you’re computing the fluctuations from your sample mean, but your sample mean is itself a random variable with variance σ^2 /N. To correct for this (since you’re trying to estimate the mean-square fluctuations with respect to the true mean μ), the denominator for ˆσ^2 is made to be N − 1.
Sometimes it is helpful to consider a single continuous state space (such as S = Rd) where both discretely and continously distributed random variables can live. One does this by associating to each random variable X a measure PX so that
Prob (X ∈ B) = PX(B)
for all Borel sets B ⊆ S. Then averages involving the random variable X are expressed as
〈f (X)〉 =
S
f (x) dPX(x) (1)
with the integral interpreted in the sense of Lebesgue (as a limit where f is approxi- mated by “simple functions”). See Billingsley’s Probability and Measure or Folland’s Real Analysis for a decent discussion and other texts for not so nice discussions.
We discussed in class the case of an (absolutely) continuously distributed random variable, for which there exists an (integrable) probability density pX so that
B
pX(x) dx. (2)
Then
〈f (X)〉 =
S
f (x) dPX(x) =
S
f (x)p(x) dx
can be interpreted in the less intimidating Riemann-Stieltjes sense (limit of a fine discretization of the independent variable x) if f and pX are piecewise continuous.
In this problem, consider a random variable X which only takes a finite or count- ably infinite set of values {an}n∈I with
Prob {X = an} = pn
and
n∈I pn^ = 1. The twist is that rather than taking the state space to be^ {an}n∈I^ , we take it to be S = Rd. You are asked to show this can be done within the general measure-theoretic framework.
To do this, define the measure
PX =
n∈I
pnδan
where the Dirac measure δx is defined:
δx(B) =
1 if x ∈ B, 0 if x 6 ∈ B.
for Borel sets B ⊆ S. Note that the measure PX is certainly not absolutely continuous (with respect to Lebesgue measure), so that one cannot write an expression like (2) for an ordinary function pX (x).
Prove that the general measure-theoretic definition of the statistical average (1), interpreted as a Lebesgue integral, recovers the familiar result from discrete proba- bility:
〈f (X)〉 =
S
f (x) dPX(x) =
n∈I
pnf (an),
provided f is a continuous function.
What can go wrong if f is discontinuous?