

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Cheatsheet for Probability Distributions
Typology: Cheat Sheet
1 / 2
This page cannot be seen from the preview
Don't miss anything!


notation U [a, b]
cdf
x − a b − a
for x ∈ [a, b]
b − a
for x ∈ [a, b]
expectation
(a + b)
variance
(b − a)^2
mgf
etb^ − eta t (b − a) story: all intervals of the same length on the distribution’s support are equally probable.
notation Gamma (k, θ)
θk^ xk−^1 e−θx Γ (k)
Ix> 0
Γ (k) =
0
xk−^1 e−xdx
expectation kθ
variance kθ^2
mgf (1 − θt)−k^ for t <
θ
ind. sum
∑^ n
i=
Xi ∼ Gamma
( (^) n ∑
i=
ki, θ
story: the sum of k independent exponentially distributed random variables, each of which has a mean of θ (which is equivalent to a rate parameter of θ−^1 ).
notation G (p)
cdf 1 − (1 − p)k^ for k ∈ N
pmf (1 − p)k−^1 p for k ∈ N
expectation
p
variance
1 − p p^2
mgf
pet 1 − (1 − p) et story: the number X of Bernoulli trials needed to get one success. Memoryless.
notation P oisson (λ)
cdf e−λ
∑k
i=
λi i!
pmf
λk k!
· e−λ^ for k ∈ N
expectation λ
variance λ
mgf exp
λ
et^ − 1
ind. sum
∑^ n
i=
Xi ∼ P oisson
( (^) n ∑
i=
λi
story: the probability of a number of events occurring in a fixed period of time if these events occur with a known average rate and independently of the time since the last event.
notation N
μ, σ^2
2 πσ^2
e−(x−μ)
(^2) /( 2 σ (^2) )
expectation μ
variance σ^2
mgf exp
μt +
σ^2 t^2
ind. sum
∑^ n
i=
Xi ∼ N
( (^) n ∑
i=
μi,
∑^ n
i=
σ^2 i
story: describes data that cluster around the mean.
notation N (0, 1)
cdf Φ(x) =
2 π
∫ (^) x
−∞
e−t
(^2) / 2 dt
2 π
e−x
(^2) / 2
expectation
λ variance
λ^2 mgf exp
t^2 2
story: normal distribution with μ = 0 and σ = 1.
notation exp (λ)
cdf 1 − e−λx^ for x ≥ 0
pdf λe−λx^ for x ≥ 0
expectation
λ
variance
λ^2 mgf
λ λ − t
ind. sum
∑^ k
i=
Xi ∼ Gamma (k, λ)
minimum ∼ exp
( (^) k ∑
i=
λi
story: the amount of time until some specific event occurs, starting from now, being memoryless.
notation Bin(n, p)
cdf
∑^ k
i=
(n
i
pi^ (1 − p)n−i
pmf
(n
i
pi^ (1 − p)n−i
expectation np
variance np (1 − p)
mgf
1 − p + pet
)n
story: the discrete probability distribution of the number of successes in a sequence of n independent yes/no experiments, each of which yields success with probability p.
FX (x) = P (X ≤ x)
FX (x) =
−∞
fX (t) dt ∫ (^) ∞
−∞
fX (t) dt = 1
fX (x) =
d dx
FX (x)
The function X∗^ : [0, 1] → R for which for any p ∈ [0, 1], FX
X∗^ (p)−
≤ p ≤ FX (X∗^ (p))
FX∗^ = FX
E (X∗) = E (X)
0
X∗(p)dp
−∞
FX (t) dt +
0
(1 − FX (t)) dt
−∞
xfX xdx
E (g (X)) =
−∞
g (x) fX xdx
E (aX + b) = aE (X) + b
Var (X) = E
Var (X) = E
Var (aX + b) = a^2 Var (X)
σ (X) =
Var (X)
Cov (X, Y ) = E (XY ) − E (X) E (Y )
Cov (X, Y ) = E ((X − E (x)) (Y − E (Y )))
Var (X + Y ) = Var (X) + Var (Y ) + 2Cov (X, Y )
ρX,Y =
Cov (X, Y ) σX , σY
MX (t) = E
etX^
E (Xn) = M (^) X(n )(0)
MaX+b (t) = etbMaX (t)
FX,Y (x, y) = P (X ≤ x, Y ≤ y)
B
fX,Y (s, t) dsdt
FX,Y (x, y) =
∫ (^) x
−∞
∫ (^) y
−∞
fX,Y (s, t) dtds ∫ (^) ∞
−∞
−∞
fX,Y (s, t) dsdt = 1
FX (a) =
∫ (^) a
−∞
−∞
fX,Y (s, t) dtds
FY (b) =
∫ (^) b
−∞
−∞
fX,Y (s, t) dsdt
fX (s) =
−∞
fX,Y (s, t)dt
fY (t) =
−∞
fX,Y (s, t)ds
E (ϕ (X, Y )) =
R^2
ϕ (x, y) fX,Y (x, y) dxdy
P (X ≤ x, Y ≤ y) = P (X ≤ x) P (Y ≤ y)
FX,Y (x, y) = FX (x) FY (y) fX,Y (s, t) = fX (s) fY (t) E (XY ) = E (X) E (Y )
Var (X + Y ) = Var (X) + Var (Y ) Independent events: P (A ∩ B) = P (A) P (B)
bayes P (A | B) =
fX|Y =y (x) =
fX,Y (x, y) fY (y)
fX|Y =n (x) =
fX (x) P (Y = n | X = x) P (Y = n)
FX|Y =y =
∫ (^) x
−∞
fX|Y =y (t) dt
E (X | Y = y) =
−∞
xfX|Y =y (x) dx
E (E (X | Y )) = E (X) P (Y = n) = E (IY =n) = E (E (IY =n | X))
lim sup An = {An i.o.} =
m=
n=m
An
lim inf An = {An eventually} =
m=
n=m
An
lim inf An ⊆ lim sup An (lim sup An)c^ = lim inf Acn (lim inf An)c^ = lim sup Acn
P (lim sup An) = lim n→∞
n=m
An
P (lim inf An) = lim n→∞
n=m
An
n=
P (An) < ∞ ⇒ P (lim sup An) = 0
And if An are independent: ∑^ ∞
n=
P (An) = ∞ ⇒ P (lim sup An) = 1
notation Xn
p −→ X
meaning lim n→∞
P (|Xn − X| > ε) = 0
notation Xn
D −→ X
meaning lim n→∞
Fn (x) = F (x)
notation Xn −a.s.−−→ X
meaning P
lim n→∞
Xn = X
n=
P (|Xn − X| > ε) < ∞ (by B.C.)
notation Xn
Lp −−→ X
meaning lim n→∞
E (|Xn − X|p) = 0
Lq −−→ ⇒ q>p≥ 1
Lp −−→
−^ a.s.−−→ ⇒ −−p→ ⇒ −−D→
If Xn
D −→ c then Xn
p −→ c If Xn
p −→ X then there exists a subsequence nk s.t. Xnk −^ a.s.−−→ X
If Xi are i.i.d. r.v.,
weak law Xn
p −→ E (X 1 )
strong law Xn
a.s. −−−→ E (X 1 )
Sn − nμ σ
n
D −→ N (0, 1)
If tn → t, then
P
Sn − nμ σ
n
≤ tn
→ Φ (t)
P (|X| ≥ t) ≤
t
P (|X − E (X)| ≥ ε) ≤
Var (X) ε^2
Let X ∼ Bin(n, p); then: P (X − E (X) > tσ (X)) < e−t
(^2) / 2
Simpler result; for every X: P (X ≥ a) ≤ MX (t) e−ta
for ϕ a convex function, ϕ (E (X)) ≤ E (ϕ (X))
n=
P (Y > n) < ∞ (Y ≥ 0)
n=
P (X > n) (X ∈ N)
X ∼ U (0, 1) ⇐⇒ − ln X ∼ exp (1)
For ind. X, Y , Z = X + Y :
fZ (z) =
−∞
fX (s) fY (z − s) ds
If A is in the tail σ-algebra Ft, then P (A) = 0 or P (A) = 1
cdf of Gamma distribution:∫ t
0
θk^ xk−^1 e−θk (k − 1)!
dx
This cheatsheet was made by Peleg Michaeli in January 2010, using LATEX. version: 1. comments: [email protected]