Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Discrete and Continuous Random Variables: A Comprehensive Overview, Slides of Discrete Mathematics

Rice University Discrete Mathematics

A continuous random variable is characterized by its probability density function, a graph which has a total area of 1 beneath it: The probability of the random ...

Typology: Slides

2022/2023

Uploaded on 03/01/2023

leonpan 🇺🇸

4

(12)

286 documents

1 / 6

This page cannot be seen from the preview

Don't miss anything!

Discrete Random Variables

A dichotomous random variable takes only the values 0 and 1. Let X be such a random

variable, with Pr(X=1) = p and Pr(X=0) = 1-p . Then E[X] = p, and Var[X] = p(1-p) .

Consider a sequence of n independent experiments, each of which has probability p of “being

a success.” Let Xk = 1 if the k-th experiment is a success, and 0 otherwise. Then the total

number of successes in n trials is X = X1 +...+ Xn ; X is a binomial random variable, and

Pr(X=k) = n

kp(1 - p ) .

kn-k

⎛

⎝

⎜⎞

⎠

⎟⋅

E[X] = np , and Var[X] = np(1-p) . (These results follow from the properties of the expected

value and variance of sums of independent random variables.)

Next, consider a sequence of independent experiments, and let Y be the number of trials up to

(and including) the first success. Y is a geometric random variable, and

Pr(Y = k) = (1- p)p .

k-1

E[Y] = 1/p , and Var[Y] = (1-p)/p2 . (These results follow from the evaluation of infinite sums.)

A hypergeometric random variable Z results from drawing a sample of size n from a

population of size N containing g “good” members, and then counting the number of “good”

members in the sample:

.

n

N

zn

gN

z

g

= z)=Pr(Z

⎟

⎠

⎞

⎜

⎝

⎛

⎟

⎠

⎞

⎜

⎝

⎛

−

⋅

⎟

⎠

⎞

⎜

⎝

⎛

(This formula was used to compute the relevant probabilities in the “Bag R vs. Bag B” example.)

Discover Slides of Discrete Mathematics Rice University

Partial preview of the text

Download Discrete and Continuous Random Variables: A Comprehensive Overview and more Slides Discrete Mathematics in PDF only on Docsity!

Discrete Random Variables

A dichotomous random variable takes only the values 0 and 1. Let X be such a random

variable, with Pr(X=1) = p and Pr(X=0) = 1-p. Then E[X] = p, and Var[X] = p(1-p).

Consider a sequence of n independent experiments, each of which has probability p of “being

a success.” Let Xk = 1 if the k-th experiment is a success, and 0 otherwise. Then the total

number of successes in n trials is X = X 1 +...+ Xn ; X is a binomial random variable , and

Pr(X = k) =

n

k

⎛ p (1 - p )k n- k.

⎝⎜^

E[X] = np , and Var[X] = np(1-p). (These results follow from the properties of the expected

value and variance of sums of independent random variables.)

Next, consider a sequence of independent experiments, and let Y be the number of trials up to

(and including) the first success. Y is a geometric random variable , and

Pr(Y = k) = (1- p ) k-1p.

E[Y] = 1/p , and Var[Y] = (1-p)/p^2. (These results follow from the evaluation of infinite sums.)

A hypergeometric random variable Z results from drawing a sample of size n from a

population of size N containing g “good” members, and then counting the number of “good”

members in the sample:

n

N

n z

N g

z

g

Pr(Z=z)=

(This formula was used to compute the relevant probabilities in the “Bag R vs. Bag B” example.)

Continuous Random Variables

A continuous random variable is a random variable which can take any value in some interval. A

continuous random variable is characterized by its probability density function , a graph which has

a total area of 1 beneath it: The probability of the random variable taking values in any interval

is simply the area under the curve over that interval.

The normal distribution : This most-familiar of continuous probability distributions has the

classic “bell” shape (see the left-hand graph below). The peak occurs at the mean of the

distribution, i.e., at the expected value of the normally-distributed random variable with this

distribution, and the standard deviation (the square root of the variance) indicates the spread of

the bell, with roughly 68% of the area within 1 standard deviation of the peak.

The normal distribution arises so frequently in applications due to an amazing fact: If you take a

bunch of independent random variables (with comparable variances) and average them, the result

will be roughly normally distributed, no matter what the distributions of the separate variables

might be. (This is known as the “Central Limit Theorem”.) Many interesting quantities (ranging

from IQ scores, to demand for a retail product, to lengths of shoelaces) are actually a composite

of many separate random variables, and hence are roughly normally distributed.

If X is normal, and Y = aX+b, then Y is also normal, with E[Y] = a⋅E[X] + b and

StdDev[Y] = a⋅StdDev[X]. If X and Y are normal (independent or not), then X+Y and X-Y =

X+(-Y) are also normal (intuition: the sum of two bunches is a bunch). Any normally-

distributed random variable can be transformed into a “standard” normal random variable (with

mean 0 and standard deviation 1) by subtracting off its mean and dividing by its standard

deviation. Hence, a single tabulation of the cumulative distribution for a standard normal random

variable (attached) can be used to do probabilistic calculations for any normally-distributed

random variable.

The exponential distribution : Consider the time between successive incoming calls at a

switchboard, or between successive patrons entering a store. These “interarrival” times are

typically exponentially distributed. If the mean interarrival time is 1/λ (so λ is the mean arrival

rate per unit time), then the variance will be 1/λ^2 (and the standard deviation will be 1/λ ). The

right-hand graph above displays the graph of the exponential density function when λ = 1.

Generally, if X is exponentially distributed, then Pr(s < s < X ≤ t) = e-λs^ - e-λt^ (where

e ≈ 2.71828).

The main use of the beta distribution is to “fit” it to observed data when building a model of a

real-world phenomenon: The beta distribution can take a wide variety of shapes, as seen in the

graphs below.

The lefthand graph arises when α = 5 and β = 3 , the center when α = 1.5 and β = 3, and the

right when α = 0.5 and β = 0.5. Generally, if α > 2, the density has a slope of 0 at x = 0, if

2 > α > 1, the density is near-vertical near x = 0, and if 1 > α > 0, the density rises as x

approaches 0. (Analogous properties involving β hold when x is near 1.) When α = β = 1,

the beta distribution is uniform on [0,1].

Right-Tail Probabilities of the Normal Distribution

+0.01 +0.02 +0.03 +0.04 +0.05 +0.06 +0.07 +0.08 +0.09 +0.
0.0 0.5000 0.4960 0.4920 0.4880 0.4840 0.4801 0.4761 0.4721 0.4681 0.4641 0.
0.1 0.4602 0.4562 0.4522 0.4483 0.4443 0.4404 0.4364 0.4325 0.4286 0.4247 0.
0.2 0.4207 0.4168 0.4129 0.4090 0.4052 0.4013 0.3974 0.3936 0.3897 0.3859 0.
0.3 0.3821 0.3783 0.3745 0.3707 0.3669 0.3632 0.3594 0.3557 0.3520 0.3483 0.
0.4 0.3446 0.3409 0.3372 0.3336 0.3300 0.3264 0.3228 0.3192 0.3156 0.3121 0.
0.5 0.3085 0.3050 0.3015 0.2981 0.2946 0.2912 0.2877 0.2843 0.2810 0.2776 0.
0.6 0.2743 0.2709 0.2676 0.2643 0.2611 0.2578 0.2546 0.2514 0.2483 0.2451 0.
0.7 0.2420 0.2389 0.2358 0.2327 0.2296 0.2266 0.2236 0.2206 0.2177 0.2148 0.
0.8 0.2119 0.2090 0.2061 0.2033 0.2005 0.1977 0.1949 0.1922 0.1894 0.1867 0.
0.9 0.1841 0.1814 0.1788 0.1762 0.1736 0.1711 0.1685 0.1660 0.1635 0.1611 0.
1.0 0.1587 0.1562 0.1539 0.1515 0.1492 0.1469 0.1446 0.1423 0.1401 0.1379 0.
1.1 0.1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 0.1170 0.
1.2 0.1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985 0.
1.3 0.0968 0.0951 0.0934 0.0918 0.0901 0.0885 0.0869 0.0853 0.0838 0.0823 0.
1.4 0.0808 0.0793 0.0778 0.0764 0.0749 0.0735 0.0721 0.0708 0.0694 0.0681 0.
1.5 0.0668 0.0655 0.0643 0.0630 0.0618 0.0606 0.0594 0.0582 0.0571 0.0559 0.
1.6 0.0548 0.0537 0.0526 0.0516 0.0505 0.0495 0.0485 0.0475 0.0465 0.0455 0.
1.7 0.0446 0.0436 0.0427 0.0418 0.0409 0.0401 0.0392 0.0384 0.0375 0.0367 0.
1.8 0.0359 0.0351 0.0344 0.0336 0.0329 0.0322 0.0314 0.0307 0.0301 0.0294 0.
1.9 0.0287 0.0281 0.0274 0.0268 0.0262 0.0256 0.0250 0.0244 0.0239 0.0233 0.
2.0 0.0228 0.0222 0.0217 0.0212 0.0207 0.0202 0.0197 0.0192 0.0188 0.0183 0.
2.1 0.0179 0.0174 0.0170 0.0166 0.0162 0.0158 0.0154 0.0150 0.0146 0.0143 0.
2.2 0.0139 0.0136 0.0132 0.0129 0.0125 0.0122 0.0119 0.0116 0.0113 0.0110 0.
2.3 0.0107 0.0104 0.0102 0.0099 0.0096 0.0094 0.0091 0.0089 0.0087 0.0084 0.
2.4 0.0082 0.0080 0.0078 0.0075 0.0073 0.0071 0.0069 0.0068 0.0066 0.0064 0.
2.5 0.0062 0.0060 0.0059 0.0057 0.0055 0.0054 0.0052 0.0051 0.0049 0.0048 0.
2.6 0.0047 0.0045 0.0044 0.0043 0.0041 0.0040 0.0039 0.0038 0.0037 0.0036 0.
2.7 0.0035 0.0034 0.0033 0.0032 0.0031 0.0030 0.0029 0.0028 0.0027 0.0026 0.
2.8 0.0026 0.0025 0.0024 0.0023 0.0023 0.0022 0.0021 0.0021 0.0020 0.0019 0.
2.9 0.0019 0.0018 0.0018 0.0017 0.0016 0.0016 0.0015 0.0015 0.0014 0.0014 0.

Discrete and Continuous Random Variables: A Comprehensive Overview, Slides of Discrete Mathematics

Related documents

Partial preview of the text

Download Discrete and Continuous Random Variables: A Comprehensive Overview and more Slides Discrete Mathematics in PDF only on Docsity!

Discrete Random Variables

A dichotomous random variable takes only the values 0 and 1. Let X be such a random

variable, with Pr(X=1) = p and Pr(X=0) = 1-p. Then E[X] = p, and Var[X] = p(1-p).

Consider a sequence of n independent experiments, each of which has probability p of “being

a success.” Let Xk = 1 if the k-th experiment is a success, and 0 otherwise. Then the total

number of successes in n trials is X = X 1 +...+ Xn ; X is a binomial random variable , and

Pr(X = k) =

n

k

⎛ p (1 - p )k n- k.

⎝⎜^

E[X] = np , and Var[X] = np(1-p). (These results follow from the properties of the expected

value and variance of sums of independent random variables.)

Next, consider a sequence of independent experiments, and let Y be the number of trials up to

(and including) the first success. Y is a geometric random variable , and

Pr(Y = k) = (1- p ) k-1p.

E[Y] = 1/p , and Var[Y] = (1-p)/p^2. (These results follow from the evaluation of infinite sums.)

A hypergeometric random variable Z results from drawing a sample of size n from a

population of size N containing g “good” members, and then counting the number of “good”

members in the sample:

n

N

n z

N g

z

g

Pr(Z=z)=

(This formula was used to compute the relevant probabilities in the “Bag R vs. Bag B” example.)

Continuous Random Variables

A continuous random variable is a random variable which can take any value in some interval. A

continuous random variable is characterized by its probability density function , a graph which has

a total area of 1 beneath it: The probability of the random variable taking values in any interval

is simply the area under the curve over that interval.

The normal distribution : This most-familiar of continuous probability distributions has the

classic “bell” shape (see the left-hand graph below). The peak occurs at the mean of the

distribution, i.e., at the expected value of the normally-distributed random variable with this

distribution, and the standard deviation (the square root of the variance) indicates the spread of

the bell, with roughly 68% of the area within 1 standard deviation of the peak.

The normal distribution arises so frequently in applications due to an amazing fact: If you take a

bunch of independent random variables (with comparable variances) and average them, the result

will be roughly normally distributed, no matter what the distributions of the separate variables

might be. (This is known as the “Central Limit Theorem”.) Many interesting quantities (ranging

from IQ scores, to demand for a retail product, to lengths of shoelaces) are actually a composite

of many separate random variables, and hence are roughly normally distributed.

If X is normal, and Y = aX+b, then Y is also normal, with E[Y] = a⋅E[X] + b and

StdDev[Y] = a⋅StdDev[X]. If X and Y are normal (independent or not), then X+Y and X-Y =

X+(-Y) are also normal (intuition: the sum of two bunches is a bunch). Any normally-

distributed random variable can be transformed into a “standard” normal random variable (with

mean 0 and standard deviation 1) by subtracting off its mean and dividing by its standard

deviation. Hence, a single tabulation of the cumulative distribution for a standard normal random

variable (attached) can be used to do probabilistic calculations for any normally-distributed

random variable.

The exponential distribution : Consider the time between successive incoming calls at a

switchboard, or between successive patrons entering a store. These “interarrival” times are

typically exponentially distributed. If the mean interarrival time is 1/λ (so λ is the mean arrival

rate per unit time), then the variance will be 1/λ^2 (and the standard deviation will be 1/λ ). The

right-hand graph above displays the graph of the exponential density function when λ = 1.

Generally, if X is exponentially distributed, then Pr(s < s < X ≤ t) = e-λs^ - e-λt^ (where

e ≈ 2.71828).

The main use of the beta distribution is to “fit” it to observed data when building a model of a

real-world phenomenon: The beta distribution can take a wide variety of shapes, as seen in the

graphs below.

The lefthand graph arises when α = 5 and β = 3 , the center when α = 1.5 and β = 3, and the

right when α = 0.5 and β = 0.5. Generally, if α > 2, the density has a slope of 0 at x = 0, if

2 > α > 1, the density is near-vertical near x = 0, and if 1 > α > 0, the density rises as x

approaches 0. (Analogous properties involving β hold when x is near 1.) When α = β = 1,

the beta distribution is uniform on [0,1].

Right-Tail Probabilities of the Normal Distribution