






Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
In the following Lecture Notes of Business Management, the Lecturer has illustrated these points in detail : The Normal Distribution, Introduction, Formula For the Pdf, Mean, Median, Mode, Notation, Normal Approximation, Probability Calculations, Standard Normal Distribution
Typology: Study notes
1 / 11
This page cannot be seen from the preview
Don't miss anything!







The normal distribution is possibly the best known and most used continuous probability distribu- tion. It provides a good model for data in very many different applications, for example, the yields of crops, the heights of people, students’ marks. The outcomes of many production processes also follow normal distributions and hence it is used widely in industry.
The normal distribution has two parameters: the mean μ and the variance σ^2. The standard devia- tion σ =
σ^2 but we usually use the variance to specify the parameters. The probability density function of a normal distribution is often said to be “bell shaped” :
The formula for the pdf is
f (x) =
2 πσ^2
exp
(x − μ)^2 2 σ^2
There is no simple formula for calculating probabilities. However, they can be determined using tables or statistical packages such as Minitab.
There are four important characteristics of the normal distribution:
Below is a plot of the pdf of normal distributions for different values of μ and σ.
Note that the mean μ locates the distribution on the x–axis and the standard deviation σ affects the spread of the distribution, with larger values giving flatter and wider curves.
If a random variable X has a normal distribution with mean μ and variance σ^2 , then we write
X ∼ N
μ, σ^2
For example, a random variable X which follows a normal distribution with mean 10 and variance 25 is written as X ∼ N (10, 25) or X ∼ N (10, 52 ). It is important to note that the second parameter in this notation is the variance and not the standard deviation.
An important property concerns addition (or subtraction) of normal random variables. If X 1 and X 2 both have normal distributions and Y = X 1 + X 2 then Y also has a normal distribution. If Z = X 1 − X 2 then Z also has a normal distribution. If X 1 ∼ N (μ 1 , σ 12 ) and X 2 ∼ N (μ 2 , σ 22 ) then Y has mean μy = μ 1 + μ 2 and Z has mean μz = μ 1 − μ 2. If X 1 and X 2 are independent then Y and Z both have variance σ^2 y = σ z^2 = σ^21 + σ^22.
For example, if wagonloads of material have weights which are normally distributed with mean 3 tonnes and standard deviation 0.5 tonnes then the total weight from two wagonloads is normally distributed with mean 3 + 3 = 6 tonnes and standard deviation
If X ∼ N (μ, σ^2 ) and a and b are fixed numbers and W = a + bX then W ∼ N (a + bμ, k^2 σ^2. For example, if the weight of a load of material has a normal distribution with mean 3 tonnes and
(e) The probability that the random variable Z lies between than − 1. 2 and 1. 5 is
P (− 1. 2 < Z < 1 .5) = P (Z < 1 .5) − P (Z ≤ − 1 .2) = 0. 9332 − 0. 1151 = 0. 8181.
We now consider how to determine probabilities for general normal distributions.
(a) The probability that an 18-20 year old has an IQ less than 85 is P (X < 85). Using the formula P (X ≤ x) = P
x − μ σ
we need to calculate z =
85 − μ σ
and from tables we obtain P (Z < −1) = 0. 1587. Therefore
P (X < 85) = 0. 1587.
(b) The probability that an 18-20 year old has an IQ greater than 142 is P (X > 142). Now P (X > 142) = 1 − P (X ≤ 142) and
z =
142 − μ σ
Using tables, we see that P (Z ≤ 2 .8) = 0. 9974 and so
P (X > 142) = 1 − 0 .9974 = 0. 0026.
(a) The probability that the tin has less than 25 mg of vitamin C is P (X < 25). Now
z =
25 − μ σ
and from tables we obtain P (Z < 1 .25) = 0. 8944. Therefore
P (X < 25) = 0. 8944.
(b) The probability that the tin has more than 25 mg of vitamin C is P (X > 25). Now P (X > 25) = 1 − P (X ≤ 25) and so
P (X > 25) = 1 − 0 .8944 = 0. 1056.
(c) The probability that the tin has between 18 mg and 25 mg of vitamin C is
P r(18 < X < 25) = P (X < 25) − P (X ≤ 18).
We can determine P (X ≤ 18) from tables using
z =
18 − μ σ
giving P (X ≤ 18) = P (Z < − 0 .5) = 0. 3085. Therefore
P r(18 < X < 25) = 0. 8944 − 0 .3085 = 0. 5859.
We can also use the tables in reverse. For example, we might want to know below what value are 95% of the population. This is equivalent to determining the value of z that satisfies P (Z < z) = 0. 95. From tables, we can see that P (Z < 1 .64) = 0. 9495 and P (Z < 1 .65) = 0. 9505. Therefore the value we want for z lies between 1.64 and 1.65. If a more accurate value is needed we can interpolate between these values: 0.95 is half-way between 0.9495 and 0.9505 and so we take z = 1. 645. This is a more accurate answer and sufficient in most cases. However, the exact value for z can be found from more detailed tables or via a computer package such as Minitab. Here are some more examples.
P (Z < − 1 .28) = 0. 1003 and P (Z < − 1 .29) = 0. 0985
and so we take
z = − 1 .29 +
In other words P (Z < − 1 .2817) = 0. 1 and so 10% of the standard normal population falls below − 1. 2817.
P (X ≤ x) = P
x − μ σ
and so we need x so that P
x − 100 15
We know (from earlier) that P (Z < − 1 .2817) = 0. 1 and therefore we solve
x − 100 15
Minitab is very helpful with calculating normal probabilities. The following commands will cal- culate probabilities P (X < x) and also values of x that satisfy P (X < x) = p:
opens up dialogue box
The normal distribution can be used as an approximation to the binomial distribution Bin(n, p) for large n and medium p. (Say if both np and n(1 − p) are greater than about 7). To approximate
a Bin(n, p) distribution we use a normal distribution with the same mean and variance. That is μ = np and σ^2 = np(1 − p).
Example. We plan to do a market research survey in which people will be asked whether or not they would buy a new product. A random sample of 600 people will be asked. Suppose that the true proportion of people in the population who would buy the product is 40%, i.e. p = 0. 4. Find the probability that, in our survey, between 220 and 260 (inclusive) answer “Yes.”
In this case n = 600 and p = 0. 4 so np = 240 and n(1 − p) = 360. Therefore we can use the approximation. We use the normal distribution with mean μ = np = 240 and variance σ^2 = np(1 − p) = 144. That is N (240, 144). The standard deviation is σ =
If X ∼ N (240, 144) then
Pr(X < 220) = Pr
= Pr(Z < − 1 .67) = 0. 0475.
So Pr( fewer than 220 “Yes” ) ≈ 0. 0475.
We can sometimes obtain a better approximation by using a continuity correction. Since B(n, p) is a discrete distribution, Pr( number of “Yes” = 220) > 0. Using the continuity correction, we count as “220” everything between 219.5 and 220.5 so, for Pr( number of “Yes” < 220) we would use
Pr(X < 219 .5) = Pr
= Pr(Z < − 1 .71) ≈ 0. 0436.
Here the continuity correction makes a noticeable difference. Sometimes it does not.
In the same way
Pr(X < 260 .5) = Pr
= Pr(Z < 1 .71) ≈ 0 .9564 = 1 − 0. 0436.
So the probability that, in our survey, between 220 and 260 (inclusive) people say “Yes” is approx- imately 0. 9564 − 0 .0436 = 0. 9128 ≈ 0. 91.
(a) Find the probability that a bag is underweight. (b) Assuming that the weights of bags are independent, find an approximate value for the probability that, in a batch of 100 bags, more than 15 are underweight.
- z -0.09 -0.08 -0.07 -0.06 -0.05 -0.04 -0.03 -0.02 -0.01 0. The table contains values of P r(Z < z), where Z ∼ N (0, 1).