Continuous Probability Distributions - Assignment | STAT 541, Assignments of Biostatistics

Material Type: Assignment; Class: Introduction to Biostatistics; Subject: STATISTICS; University: University of Wisconsin - Madison; Term: Unknown 1989;

Typology: Assignments

Pre 2010

Uploaded on 09/02/2009

koofers-user-ix2-1
koofers-user-ix2-1 🇺🇸

10 documents

1 / 4

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
1
Continuous Probability Distributions
Recall that we have contrasted continuous variables from discrete variables
by noting that while there are gaps between possible values of a discrete
variable, a continuous variable can take any value in some interval of the
number line.
Suppose Xis a discrete random variable taking on the values 1,2,...,n
with probability distribution function fX(x). Then
P[2 X4] = fX(2) + fX(3) + fX(4)
Suppose Xis a continuous random variable with probability density
function fX(x). Then
P[a < X < b] = Zb
af(x)dx (1)
2
Suppose Xis a continuous random variable taking on any value x[c,d].
Then the probability density function satisfies
fX(x)>0 if x[c, d]
= 0 otherwise
Zd
cfX(u)du = 1,
The probability P(a < X < b) equals
Zb
afX(u)du
3
A random variable Xis said to follow a Gaussian (Normal)
distribution with population parameters µand σif its probability
density function is given by
fX(x) = 1
2πσ2e(xµ)2
2σ2
A random variable Xis said to follow the standard Normal
distribution if its population parameters µand σequal 0 and 1,
respectively. In this case,
fX(x) = 1
2πex2
2
4
There are a number of reasons why the normal distribution is one of
the most useful of all probability densities:
It is simple. It is completely characterized by two population
parameters, µand σ.
It reasonably approximates many distributions found in nature
(“it works”).
It is a physical law that random variables generated from sums of
other random variables tend to become normally distributed as
the number of variables summed gets larger (the famous Central
Limit Theorem).
pf3
pf4

Partial preview of the text

Download Continuous Probability Distributions - Assignment | STAT 541 and more Assignments Biostatistics in PDF only on Docsity!

Continuous Probability Distributions

Recall that we have contrasted continuous variables from discrete variables by noting that while there are gaps between possible values of a discrete variable, a continuous variable can take any value in some interval of the number line.

Suppose X is a discrete random variable taking on the values 1, 2 ,... , n with probability distribution function fX (x). Then

P [2 ≤ X ≤ 4] = fX (2) + fX (3) + fX (4)

Suppose X is a continuous random variable with probability density function fX (x). Then

P [a < X < b] = ∫ (^) b a f^ (x)dx^ (1)

Suppose X is a continuous random variable taking on any value x ∈ [c, d]. Then the probability density function satisfies

fX (x) > 0 if x ∈ [c, d] = 0 otherwise ∫ (^) d c fX^ (u)du^ = 1, The probability P (a < X < b) equals ∫ (^) b a fX^ (u)du

3

A random variable X is said to follow a Gaussian (Normal) distribution with population parameters μ and σ if its probability density function is given by

fX (x) = √ 21 πσ 2 e−^ (x^ −^ μ)

2 2 σ^2

A random variable X is said to follow the standard Normal distribution if its population parameters μ and σ equal 0 and 1, respectively. In this case,

fX (x) = √^12 π e−x

2 2

4 There are a number of reasons why the normal distribution is one of the most useful of all probability densities:

  • It is simple. It is completely characterized by two population parameters, μ and σ.
  • It reasonably approximates many distributions found in nature (“it works”).
  • It is a physical law that random variables generated from sums of other random variables tend to become normally distributed as the number of variables summed gets larger (the famous Central Limit Theorem).

Example ( Normal distribution)

Suppose X represents blood pressure and suppose that the population mean μ = 129 and the population standard deviation σ = 19.8.

P [X > 150] =

∫ (^) ∞ 150

√^1

2 πσ^2 e−^ (x^ −^ μ)

2 2 σ^2 = ∫ (^) ∞ 150

√^1

2 π(19.8)^2 e−^ (x^ −^ (129))

2 2(19.8)^2

The probability that a Normal random variable takes on a value within an interval is equal to the area under the part of the normal density which lies above the interval.

Unfortunately, there is no simple formula for calculating this area, so we need to use a table.

Fortunately, we only need a table for the standard normal density with mean μ = 0 and standard deviation σ = 1.

7

If we want to calculate probabilities for a general normal random variable X with mean μ and SD σ, we need to construct a new random variable, called the standardized score of X,

Z = X^ − σ μ.

Given values for μ and σ, we can actually go back and forth between the “X scale” and the “Z scale:”

X = μ + σZ.

This is similar to how we go back and forth between Celsius (C) and Fahrenheit (F ):

C = F^1 −. 8 32 , and

F = 32 + 1. 8 × C.

8 Two simple rules can be very helpful in calculating normal probabilities:

  • Since the total area under any density is 1, P [Z > z] = 1 − P [Z ≤ z].
  • Since the normal density is symmetric about 0, P [Z < z] = P [Z > −z] = 1 − P [Z ≤ −z];

Suppose we want to know the limits within which 95% of the population lies. From Table A.3, we get that P [Z > 1 .96] = 0. 025

so that P [− 1. 96 ≤ Z ≤ 1 .96] = 0.95. Now,

P [− 1. 96 ≤ Z ≤ 1 .96]

= P [30 × (− 1 .96) + 175 ≤ 30 × Z + 175 ≤ 30 × 1 .96 + 175]

=^. P [116 ≤ X ≤ 234].

We usually write these limits as (116,234) or [116,234].

Forced vital capacity (FVC), a standard measure of pulmonary function, is the volume of air a person can expel in 6 seconds. Current research looks at potential risk factors, such as smoking, air pollution, indoor allergies, that may affect FVC in grade-school children. One problem is that sex, age, and height affect pulmonary function, and these variables must be corrected for before looking at other risk factors. One way to make these adjustments for a particular child is to find the mean μ and standard deviation σ for children of the same age (in 1-year groups), sex, and height (in 2-in. height groups) from large surveys and compute a standardized FVC. The standardized FVC then approximately follows a N(0,1) distribution, provided the original FVC values were approximately normal. Suppose a child is in poor pulmonary health if his or her standardized FVC is less than -1.5. What percentage of children are in poor pulmonary health?

15

Suppose a child is considered to have normal lung growth if his or her standardized FVC is within 1.5 standard deviations of the mean. What proportion of children are within this normal range? What proportion of children are within 2 standard deviations of the mean (how does this match up with the empirical rule)?

16 What proportion of children have standardized FVC’s between -1. and 0.8?