






















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Notes; Class: Introduction to Image Processing and Analysis; Subject: Computer Science and Engineering; University: Arizona State University - Tempe; Term: Fall 2003;
Typology: Study notes
1 / 30
This page cannot be seen from the preview
Don't miss anything!























Chitta Baral
Department of Computer Science and Engineering
Arizona State University
Tempe, AZ 85287-5406 USA
http://www.public.asu.edu/
∼ cbaral/cse571-f99/
October 12, 2003
Probability, Bayes nets and Causality
means: belief in
under the assumption that
is known with
absolute certainty.
and
are independent.
and
are conditionally independent
given
Dawid’s notation: (
P than that of joint events.Bayesian philosophers see the conditional relationship as more basic
(^) (
A
Probability, Bayes nets and Causality
Goal:
to provide convenient means of expressing substantive assumptions
to facilitate economical representations of joint probability functions
to facilitate efficient inferences from observations
relationshipIdea: Directed acyclic graphs is used to represent causal or temporal
Basic decomposition scheme
x
1 , x
2 , x
3 ) =
x 1 ∧ x 2 ∧ x 3
x
1 | x
2 , x
3 ) P
(^) ( x
2
∧
x
3 ) =
x
1 | x
2 , x
3 ) P
(^) ( x
2 | x
3 ) P
(^) ( x
3 )
Probability, Bayes nets and Causality
Prediction and abduction
x
y
Need to compute
y
| x
).
y
| x
) =
p ( y, x
p ( x ) = ∑ s P
y, x, s
∑ y,s
y, x, s
An example:
The Network ∗
tampering
f ire
Directed Edges: (
tampering, alarm
f ire, alarm
f ire, smoke
alarm, leaving
leaving, report
Probability, Bayes nets and Causality
P local probability distributions:
(^) ( alarm
f ire, tampering
alarm
f ire,
tampering
alarm
f ire, tampering
alarm
f ire,
tampering
smoke
f ire
smoke,
f ire
leaving
alarm
leaving
alarm
report
leaving
report
leaving
Different kinds of inferences ∗
Diagnostic inferences:
f ire
report
Causal inferences (prediction):
leaving
tampering
Intercausal inferences:
f ire
alarm, tampering
Mixed inferences:
alarm
report, f ire
P An illustration:
(^) ( tampering
report, smoke
P (^) ( tampering,report,smoke
)
P
(^) ( report,smoke
)
Probability, Bayes nets and Causality
Similarly, we can also compute
f 1 ( alarm
T, tampering
f 1 ( alarm
F, tampering
(^) ) and
f 1 ( alarm
F, tampering
∑We can now write the denominator as: tampering,leaving,alarm
tampering
leaving
alarm
report
leaving
f 1 ( alarm, tampering
∑
tampering,leaving
tampering
report
leaving
∑
alarm
leaving
alarm
f 1 ( alarm, tampering
Let us denote
∑ alarm
leaving
alarm
f 1 ( alarm, tampering
by
f 2 ( leaving, tampering
). We can compute it as we compute
f 1
= The denominator can now be written as:
∑
tampering,leaving
tampering
report
leaving
f 2 ( leaving, tampering
∑
tampering
tampering
∑
leaving
report
leaving
f 2 ( leaving, tampering
Let us denote
∑
leaving
report
leaving
Probability, Bayes nets and Causality
f 2 ( leaving, tampering
) by
f 3 ( tampering
) and compute it like
the other
f i s.
∑The denominator can now be written as: tampering
tampering
f 3 ( tampering
Main Issues and challenges
Computing the conditional probabilities efficiently
Inference in general networks in NP-hard
(say for trees).Many efficient algorithms are defined for particular kind of networks ∗
Algorithm based on message passing architecture for trees.
Join-tree propagation
Cutset conditioning
Hybrid combinations of the above two
Approximation methods: stochastic simulation.
Probability, Bayes nets and Causality
distributions can not.)Causal networks can predict the effect of actions. (Simple joint
Stability and autonomy
the network without changing the others.Autonomy: It is possible to change one parent child relationship in
minimum of extra information.Stability: One can predict the effect of external interventions with
of autonomy, the change is local.merely the immediate changes implied by the intervention. Becausefunction for each of the many possible interventions, we specifyAutonomy and intervention: Instead of specifying a new probability
LetDefinition: Causal Bayesian network
v
) be a probability distribution on a set
of variables, and let
x ( v
) denote the distribution resulting from the intervention
do
x
) which sets any subset
of variables to constants
x
.
Denote by
the set of all interventional distributions
x ( v ),
12
Probability, Bayes nets and Causality
including
v
) which represents no intervention. A DAG
is said to
be a
causal Bayesian network
compatible with
iff the following
three conditions hold for every
x
x ( v
) is Markov relative to
x ( v i ) = 1, for all
i
∈
, whenever
v i
is consistent with
x
.
x ( v i | pa
i ) =
v i | pa
i ) for all
i
∈
, whenever
pa
i
is consistent
with
x
.
Properties:
for all
v
consistent with
x
:
x ( v
) =
∏
{ i | V i ∈ X
}
P
(^) ( v i | pa
i )
For all
i ,
P
(^) ( v i | pa
i ) =
pa
i ( v i )
parents, corresponds to causal effects.)(The above ensures, conditional probabilities with respect to
Probability, Bayes nets and Causality
1
remains true regardless of what we learn or know about the
season or the pavement.
Falling barometer predicts rain, does not explain it.
Probability, Bayes nets and Causality
Two views of non-determinism
due to our ignorance of the underlying boundary condition.Nature’s laws are deterministic, and randomness surfaces merelyLaplace’s (1814) conception of natural phenomena:
All relationships are inherently stochastic.Modern (quantum mechanical) conception of physics:
Why Pearl’s book uses Laplace’s conception of causality
sciencesbesides the fact that it is used in genetics, econometrics and social
It is more general. ∗
round;relationships (with stochastic inputs), but not the other wayEvery stochastic model can be emulated by many functional
Probability, Bayes nets and Causality
is a set of functions
f 1 ,... , f
n }
giving rise to a set of structural
equations of the form:
x
i
=
f i ( pa
i , u
i ),
i
= 1
,... , n
Types of queries that can be answered using functional causal models
: Would the pavement be slippery if we
find
the
sprinkler off?
: Would the pavement be slippery if we
make sure
that the sprinkler is off?
: Would the pavement be slippery
had
the
and the sprinkler is on?sprinkler been off, given that the pavement is in fact not slippery
Prediction using Markovian causal models:
member ofCausal diagram: A graph obtained by having edges from each
i
to
i .
called semi-Markovian.If the causal diagram is acyclic then the corresponding model is
Probability, Bayes nets and Causality
the values of
variables will be uniquely determined by the
variables.
The joint distribution
x
1 ,... , x
n ) is determined uniquely by
the distribution
u
) of the error variables.
is calledIf in addition the error terms are mutually independent, the model
Markovian
Theorem (Pearl and Verma): Every Markovian causal model
induces a distribution
x
1 ,... , x
n ) that satisfies the Markov
condition relative to the causal diagram
associated with
(^) , that
is each variable
i
is independent on all its non-descendants, given
its parents
i
in
Theorem (Drudgel and Simon): For every Bayesian network
characterized by a distribution
(^) , there exists a function model that
generates a distribution identical to
over the probabilistic specificationAdvantages of doing prediction using causal-functional specification ∗
When organizing knowledge using Markov causal models reliable