Statistics Notes: Mean and Median, Variance and Standard Deviation - Prof. Patrick S. Murp, Study notes of Data Analysis & Statistical Methods

An explanation of the concepts of mean and median as measures of central tendency in statistics, as well as an introduction to variance and standard deviation as measures of dispersion. It also includes instructions on how to calculate these statistics using a calculator.

Typology: Study notes

Pre 2010

Uploaded on 08/03/2009

koofers-user-fwm-2
koofers-user-fwm-2 🇺🇸

10 documents

1 / 1

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Mean/Median NOTES
n
= number of individuals in the data set.
The symbol
represents sum.
It is the capital letter S in the Greek alphabet and is pronounced “sigma.”
CENTER
MEAN: the average of a data set. The sample mean is denoted by
x
and is pronounced “x bar”.
== x
n
n
x
x1
MEDIAN: the 50
th
percentile of a data set.
This says that 50% of the individuals have data values less than or equal to the median.
CASE:
n
is odd.
The median is the middle number in the ordered list.
CASE:
n
is even.
The median is the average of the two middle numbers in the ordered list.
The mean is affected by outliers and strong skewness. The median is more resistant (not affected)
to outliers and skewness. However, the mean provides more information than the median because it looks
at the value of every data. We prefer to use the mean as the measure of center except when we have
outliers or definite skewness. When that happens we use the median as the measure of center.
FIVE-NUMBER SUMMARY: minX,
1
Q
, Median,
3
Q
, maxX.
VARIANCE
Loosely speaking, the variance is the average distance the data values are distributed about the mean.
(
)
1
2
=
n
xx
VAR
The formula squares the distances of the data from the mean so that we are adding positive numbers
together in the sum, otherwise we get
(
)
0=
xx
. We divide by
1
n
(this is the “degrees of
freedom” as we shall see later on in inferential statistics) because if you have
1
n
of the differences then
the last one, or
th
n
one, is just the additive inverse (since
(
)
0=
xx
).
STANDARD DEVIATION: the square root of the variance.
The sample standard deviation is denoted by
x
S
.
Since we are summing up squares in the variance, we take the square root of the VAR in order that the
units of measure (meters, gallons, $, etc.) are not squared. This gives us the standard deviation of data
about the mean.
To get the values of
x
,
x
S
and the Five-Number Summary on your calculator, press STAT go to the
CALC menu and press 1 for 1-VAR-STATS. State the list you are using and then press ENTER.
( )
1
2
=
n
xx
S
x
( )
2
x
x
SVAR
VARS
=
=

Partial preview of the text

Download Statistics Notes: Mean and Median, Variance and Standard Deviation - Prof. Patrick S. Murp and more Study notes Data Analysis & Statistical Methods in PDF only on Docsity!

Mean/Median NOTES

n = number of individuals in the data set.

The symbol ∑ represents sum.

It is the capital letter S in the Greek alphabet and is pronounced “sigma.”

CENTER

MEAN: the average of a data set. The sample mean is denoted by x and is pronounced “x bar”.

= = x n n

x x

MEDIAN: the 50th^ percentile of a data set.

This says that 50% of the individuals have data values less than or equal to the median.

CASE: n is odd. The median is the middle number in the ordered list.

CASE: n is even. The median is the average of the two middle numbers in the ordered list.

The mean is affected by outliers and strong skewness. The median is more resistant (not affected)

to outliers and skewness. However, the mean provides more information than the median because it looks

at the value of every data. We prefer to use the mean as the measure of center except when we have

outliers or definite skewness. When that happens we use the median as the measure of center.

FIVE-NUMBER SUMMARY: minX, 1

Q , Median, Q 3 , maxX.

VARIANCE

Loosely speaking, the variance is the average distance the data values are distributed about the mean.

2

n

x x VAR

The formula squares the distances of the data from the mean so that we are adding positive numbers

together in the sum, otherwise we get ∑ ( x − x ) = 0. We divide by n − 1 (this is the “degrees of

freedom” as we shall see later on in inferential statistics) because if you have n − 1 of the differences then

the last one, or

th

n one, is just the additive inverse (since^ ∑ (^ x − x )^ = 0 ).

STANDARD DEVIATION: the square root of the variance.

The sample standard deviation is denoted by S (^) x.

Since we are summing up squares in the variance, we take the square root of the VAR in order that the

units of measure (meters, gallons, $, etc.) are not squared. This gives us the standard deviation of data

about the mean.

To get the values of (^) x , S (^) x and the Five-Number Summary on your calculator, press STAT go to the

CALC menu and press 1 for 1-VAR-STATS. State the list you are using and then press ENTER.

2

n

x x Sx

2 x

x

VAR S

S VAR