Error-Correcting Codes Lab: Proof of Markov Property and Probability Computation, Slides of Applied Mathematics

A part of the lecture notes for a university course on error-correcting codes. It includes a proof of the markov property for a given set of variables and an explanation of how to compute the required probabilities using the given lemmas. The document also introduces some notation for simplifying the probability computations.

Typology: Slides

2010/2011

Uploaded on 11/09/2011

mjforever
mjforever 🇺🇸

4.8

(25)

254 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
18.413: Error-Correcting Codes Lab February 26, 2004
Lecture 7
Lecturer: Daniel A. Spielman
To begin, let me point out that there was a typo in the lecture notes from last lecture. Lemma 6.4.1
should have said:
Lemma 7.0.1.
Pext [X1=a1|Y2Y3=b2b3] = X
a2:(a1,a2)∈C12
P[X2=a2|X1=a1]Pext [X2=a2|Y2=b2]Pext [X2=a2|Y3=b3].
7.1 Markov Property
Last lecture, we considered three variables X1,X2and X3chosen uniformly from those that satisfy
(X1, X2) C12 A1×A2and and (X2, X3) C23 A2×A3.
We claimed that the variables (X1, X2, X3) then satisfy what the book calls the “Markov” property.
That is, for all a1, a2, a3,
P [X1X3=a1a3|X2=a2] = P [X1=a1|X2=a2] P [X3=a3|X2=a2].
I’ll now sketch a proof. It basically follows by by recalling the definition of the probability of an
even conditioned on X2=a2. We first note that the set of choices for (X1, X2, X3) given that
X2=a2is
Sa2
def
={(X1, a2, X3) : (X1, a2) C12 and (a2, X3) C23}.
Conditioning on X2=a2, we obtain a sample chosen uniformly from Sa2. Thus, for (a1,a2, a3)
Sa2,
P [(X1, X2, X3) = (a1, a2, a3)|X2=a2] = 1
|Sa2|.
Note that
|Sa2|=|{a1: (a1, a2) C12}| |{a3: (a2, a3) C23}|
The claim now follows from observing that
P [(X1, X2) = (a1, a2)|X2=a2] = |{a3: (a2, a3) C23}|
|Sa2|=1
|{a1: (a1, a2) C12}|
and
P [(X2, X3) = (a2, a3)|X2=a2] = |{a1: (a1, a2) C12}|
|Sa2|=1
|{a3: (a2, a3) C23}| .
7-1
pf2

Partial preview of the text

Download Error-Correcting Codes Lab: Proof of Markov Property and Probability Computation and more Slides Applied Mathematics in PDF only on Docsity!

18.413: Error-Correcting Codes Lab February 26, 2004

Lecture 7

Lecturer: Daniel A. Spielman

To begin, let me point out that there was a typo in the lecture notes from last lecture. Lemma 6.4.

should have said:

Lemma 7.0.1.

P

ext

[X 1

= a 1

|Y

2

Y

3

= b 2

b 3

] =

a 2 :(a 1 ,a 2 )∈C 12

P [X

2

= a 2

|X

1

= a 1

] P

ext

[X 2

= a 2

|Y

2

= b 2

] P

ext

[X 2

= a 2

|Y

3

= b 3

].

7.1 Markov Property

Last lecture, we considered three variables X 1

, X

2

and X 3

chosen uniformly from those that satisfy

(X

1

, X

2

) ∈ C

12

⊆ A

1

× A

2

and and (X 2

, X

3

) ∈ C

23

⊆ A

2

× A

3

We claimed that the variables (X 1

, X

2

, X

3

) then satisfy what the book calls the “Markov” property.

That is, for all a 1 , a 2 , a 3 ,

P [X

1

X

3

= a 1

a 3

|X

2

= a 2

] = P [X

1

= a 1

|X

2

= a 2

] P [X

3

= a 3

|X

2

= a 2

].

I’ll now sketch a proof. It basically follows by by recalling the definition of the probability of an

even conditioned on X 2 = a 2. We first note that the set of choices for (X 1 , X 2 , X 3 ) given that

X

2

= a 2

is

Sa 2

def

= {(X 1 , a 2 , X 3 ) : (X 1 , a 2 ) ∈ C 12 and (a 2 , X 3 ) ∈ C 23 }.

Conditioning on X 2 = a 2 , we obtain a sample chosen uniformly from Sa 2

. Thus, for (a 1 , a 2 , a 3 ) ∈

S

a 2

P [(X

1

, X

2

, X

3

) = (a 1

, a 2

, a 3

)|X

2

= a 2

] =

|Sa 2

Note that

|S

a 2

| = |{a 1

: (a 1

, a 2

) ∈ C

12

}| |{a 3

: (a 2

, a 3

) ∈ C

23

The claim now follows from observing that

P [(X 1 , X 2 ) = (a 1 , a 2 )|X 2 = a 2 ] =

|{a 3

: (a 2

, a 3

) ∈ C

23

|S

a 2

|{a 1

: (a 1

, a 2

) ∈ C

12

and

P [(X

2

, X

3

) = (a 2

, a 3

)|X

2

= a 2

] =

|{a 1 : (a 1 , a 2 ) ∈ C 12 }|

|Sa 2

|{a 3 : (a 2 , a 3 ) ∈ C 23 }|

Lecture 7: February 26, 2004 7-

7.2 Simplifying Probability Computation

First, lets establish that the fundamental quantities we are interested in have the form

P [X

i

= a i

|E] ,

where E is some event, usually a union of the observed variables. We will typically want these

values for all a i

, so we really want a vector

(P [Xi = a 1 |E] , P [Xi = a 2 |E] ,... , P [Xi = an|E] , ) ,

where a 1 ,... , an are the symbols in the alphabet Ai. We will denote such a vector by

P [X

i

|E].

Using this notation, and letting denote componentwise product ((a, b) (c, d) = (ac, bd)), we

have

P

post

[Xi|E] =

P

prior

[Xi]

P

ext

[Xi|E]

Before returning to our probability computations for (X 1

, X

2

, X

3

), I’ll also introduce the simpler

notation P

ext

[Xi = ai|Yi] for P

ext

[Xi = ai|Yi = bi]. We will use this notation whenever bi is fixed

throughout our computation, which it generally is as it is what was received.

We then have, from Lemma 6.2.1,

P

post

[X

1

|Y

1

Y

2

Y

3

] =

P

prior

[X

1

]

P

ext

[X

1

|Y

1

]

P

ext

[X

1

|Y

2

Y

3

] ,

and, from Lemma 7.0.1,

P

ext

[X 1

= a 1

|Y

2

Y

3

] =

a 2 :(a 1 ,a 2 )∈C 12

P [X

2

= a 2

|X

1

= a 1

] P

ext

[X 2

= a 2

|Y

2

] P

ext

[X 2

= a 2

|Y

3

].

Using these formulas, we go through the following steps to compute

P

post

[X 1 |Y 1 Y 2 Y 3 ].

  1. Compute

P

ext

[X 2 |Y 3 ]. This computation only depends upon Y 3 , and comes from the formula:

P

ext [X 2

= a 2

|Y

3

] ∼

a 3 :(a 2 ,a 3 )∈C 23

P [X

3

= a 3

|X

2

= a 2

] P [Y

3

|X

3

= a 3

].

  1. Compute

P

ext

[X

2

= |Y

2

], and compute

P

ext

[X 2 = |Y 2 ] P

ext

[X 2 = a 2 |Y 3 ].

  1. For each a 1

, compute

P [X 2 |X 1 = a 1 ]

P

ext

[X 2 |Y 2 ]

P

ext

[X 2 |Y 3 ] ,

and then sum the resulting vector.

  1. Take the output of the previous step, and product it with

P

prior

[X 1 ]

P

ext

[X 1 |Y 1 ].

If you look at the flow of this computation, it can be understood as a vector being passed from X 3

to X 2 between steps 1 and 2, and a vector begin passed from X 2 to X 1 between steps 3 and 4.