Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Markov Chains: Contraction Approach and Limiting Probability Vector, Study notes of Operational Research

Columbia University in the City of New York Operational Research

Lecture notes from a university course on markov chains, focusing on the contraction approach to find the limiting probability vector π. The theorem stating that for an aperiodic irreducible finite-state markov chain, the probability vector at time n converges to π as n goes to infinity. The proof is based on the banach fixed-point theorem, and the document discusses the conditions for irreducibility and aperiodicity. The document also includes examples and lemmas to illustrate the concepts.

Typology: Study notes

2010/2011

Uploaded on 10/12/2011

juhy 🇺🇸

4.3

(6)

246 documents

1 / 5

This page cannot be seen from the preview

Don't miss anything!

IEOR 4106, Spring 2011, Professor Whitt

Class Lecture Notes: Thursday, February 10.

Markov Chains

The Contraction approach to π=πP

The limit for aperiodic irreducible finite-state DTMC’s.

There is a nice simple limit for aperiodic irreducible finite-state Markov chains. For any

initial probability vector u≡(u1, . . . , um), the probability vector at time nis

P(Xn=j) = (uP n)j=

m

X

i=1

uiPn

i,j .

The key limiting result is

Theorem 0.1 If Pis the transition matrix of an aperiodic irreducible finite-state Markov

chain with transition matrix P, then, for any initial probability vector u,

uP n→πas n→ ∞ ,

where the limiting probability vector πis the unique stationary probability vector, i.e., the

unique solution to the fixed-point equation

π=πP or πj=

m

X

i=1

πiPi,j for all j ,

where πj≥0for all jand Pjπj= 1.

Note the conditions: Of course, irreducibility is essential. And aperiodicity is essential to get

full convergence, as opposed to convergence of averages, or convergence through appropriate

subsequences. The method of proof here is designed to apply to finite-state chains. The

proof extends to infinite-state chains under the condition that there is some state jsuch that

Pi,j ≥² > 0 for all states i, or Pk

i,j ≥² > 0 for some k. This is a strong extra condition saying

that there is a state jsuch that there is a probability of at least ² > 0 of going to jin one step

(or in ksteps, as a weaker version of the same condition), from any other state. With that

extra condition, we not only get convergence, we get convergence quickly, geometrically fast.

We actually provide a proof without this condition, but we do not get such quick convergence

unless the condition holds.

The Contraction Proof.

One way to prove this result and others is to apply renewal theory. An alternative way to

prove the theorem is to consider the transition matrix Pas an operator on the space of all

probability vectors, here taken to be of dimension m, corresponding to there being mstates.

An operator on a space maps the space into itself. If uis a probability vector, then Pmaps

Discover Study notes of Operational Research Columbia University in the City of New York

Partial preview of the text

Download Markov Chains: Contraction Approach and Limiting Probability Vector and more Study notes Operational Research in PDF only on Docsity!

IEOR 4106, Spring 2011, Professor Whitt

Class Lecture Notes: Thursday, February 10.

Markov Chains

The Contraction approach to π = πP

The limit for aperiodic irreducible finite-state DTMC’s. There is a nice simple limit for aperiodic irreducible finite-state Markov chains. For any initial probability vector u ≡ (u 1 ,... , um), the probability vector at time n is

P (Xn = j) = (uP n)j =

∑^ m

i=

uiP (^) i,jn.

The key limiting result is

Theorem 0.1 If P is the transition matrix of an aperiodic irreducible finite-state Markov chain with transition matrix P , then, for any initial probability vector u,

uP n^ → π as n → ∞ ,

where the limiting probability vector π is the unique stationary probability vector, i.e., the unique solution to the fixed-point equation

π = πP or πj =

∑^ m

i=

πiPi,j for all j ,

where πj ≥ 0 for all j and

j πj^ = 1. Note the conditions: Of course, irreducibility is essential. And aperiodicity is essential to get full convergence, as opposed to convergence of averages, or convergence through appropriate subsequences. The method of proof here is designed to apply to finite-state chains. The proof extends to infinite-state chains under the condition that there is some state j such that Pi,j ≥ ≤ > 0 for all states i, or P (^) i,jk ≥ ≤ > 0 for some k. This is a strong extra condition saying that there is a state j such that there is a probability of at least ≤ > 0 of going to j in one step (or in k steps, as a weaker version of the same condition), from any other state. With that extra condition, we not only get convergence, we get convergence quickly, geometrically fast. We actually provide a proof without this condition, but we do not get such quick convergence unless the condition holds.

The Contraction Proof.

One way to prove this result and others is to apply renewal theory. An alternative way to prove the theorem is to consider the transition matrix P as an operator on the space of all probability vectors, here taken to be of dimension m, corresponding to there being m states. An operator on a space maps the space into itself. If u is a probability vector, then P maps

u into the probability vector uP , corresponding to the probability vector starting with u and then taking one step according to P , i.e.,

(uP )j =

∑^ m

i=

uiPi,j for all j.

We want the underlying space to be a complete metric space and the operator to be a contrac- tion map. Then we can apply the Banach fixed-point theorem, also called the Banach-Picard fixed-point theorem or the contraction fixed-point theorem; see pages 220-221 from the blue Rudin book, Principles of Mathematical Analysis, posted on line. The proof can be done in two steps:

step 1. In the first step, you prove that some power of P has all positive entries (using the assump- tion that P is an m × m transition matrix of an irreducible aperiodic Markov chain).

Example 0.1 We remark at the outset that the worst case has P 1 , 2 = P 2 , 3 = · · · = Pm− 1 ,m = 1, while Pm, 1 > 0, Pm, 2 > 0 and Pm,j = 0 for all j ≥ 3. Note that, for this example, P 1 k, 1 > 0 for k = m, k = 2m, k = 2m−1, k = 3m, k = 3m−1, k = 3m−2, k = 4m, k = 4m−1, k = 4m−2, k = 4m − 3, and so forth. In this example, we have P 1 k, 1 > 0 for k = (m − 2)m, but P 1 k, 1 = 0 for k = (m − 2)m + 1, but then we have P 1 k, 1 > 0 for all k ≥ (m − 1)m − (m − 2) = m^2 − 2 m + 2.

Lemma 0.1 For any states i and j 6 = i, P (^) i,jk > 0 for some k, with 1 ≤ k ≤ m − 1.

Proof. We use the fact that the chain is irreducible. Let Si,k be the set of states reachable from state i in at most k steps, with Si, 0 ≡ {i}. By the irreducibility, the sets Si,k have to be strictly increasing in k for every k until Si,k = { 1 , 2 ,... , m}, the full state space. Otherwise, the DTMC would not be irreducible. Since there are only m − 1 other states, all these other states have to be reached in at most m − 1 steps. If the increase is by more than a single state, then the number k will be strictly less than m − 1. Hence, indeed, for any states i and j 6 = i, P (^) i,jk > 0 for some k, with 1 ≤ k ≤ m − 1. But, in general, the value k depends on the states i and j. As a corollary to the last conclusion, we deduce the following:

Corollary 0.1 For any state i, there necessarily is a k ≤ m such that P (^) i,ik > 0.

Proof. To start, there must be some j such that Pi,j > 0. Then, by the reasoning above, P (^) j,il > 0 for some l ≤ m − 1. But then P (^) i,il+1 > Pi,j P (^) j,il > 0. Finally, since l ≤ m − 1, necessarily l + 1 ≤ m. Now we apply the aperiodicity to obtain a further result.

Lemma 0.2 There exists a constant n 0 such that P (^) i,ik > 0 for all k ≥ n 0.

Proof. We have shown above that P (^) i,ik > 0 for some k with 1 ≤ k ≤ m. Since the chain is aperiodic, there necessarily exist constants k 1 and k 2 such that the greatest common de- nominator of k 1 and k 2 is 1, and P (^) i,ik^1 > 0 and P (^) i,ik^2 > 0. But then P (^) i,ik > 0 for all k ≥ k 1 k 2.

We can do better in this last step, but the reasoning is somewhat complicated. We have the following sharper result from Section 2.4 of Seneta, Non-negative Matrices and Markov Chains, second edition, Springer, 1981, in particular from Theorem 2.9 on page 58.

and vP = (0. 01 , 0. 95 , 0. 01 , 0. 01 , 0. 01 , 0 .01). There are three natural norms to consider on Rm: the l∞, l 2 and l 1 norms:

||x||∞ ≡ max {|xi|} ,

||x|| 2 ≡

( (^) m ∑

i=

|xi|^2

||x|| 1 ≡

∑^ m

i=

|xi|. (1)

It is not difficult to see that

||u − v||∞ = 1 / 3 , ||u − v|| 2 =

||u − v|| 1 = 2 (2)

and It is not difficult to see that

||uP − vP ||∞ = 0. 94 , ||uP − vP || 2 =

2 × (0.94)^2 = 1. 329 ,

||uP − vP || 1 = 2 × 0 .94 = 1. 88. (3)

By this example, we prove that the norms ||x||∞ and ||x|| 2 do not work. However, it turns out that the norm ||x|| 1 does work.

Theorem 0.3 Let P be a m×m Markov-chain transition matrix associated with an irreducible Markov chain. Assume that Pi, 1 ≥ ≤ > 0 for all i, 1 ≤ i ≤ m. Then

||uP − vP || 1 ≤ (1 − ≤)||u − v|| 1.

Proof. Note that

||uP − vP || 1 =

∑^ m

j=

∑^ m

i=

uiPi,j −

∑^ m

i=

viPi,j |.

Now write Pi, 1 ≡ ≤ + Qi, 1 and Pi,j ≡ Qi,j for j 6 = i , for all i.

Then Q is a nonnegative m × m matrix with row sums 1 − ≤. Now observe that

∑^ m

j=

∑^ m

i=

uiPi,j −

∑^ m

i=

viPi,j | = |

∑^ m

i=

ui(≤ + Qi, 1 ) −

∑^ m

i=

vi(≤ + Qi, 1 )| +

∑^ m

j=

∑^ m

i=

uiQi,j −

∑^ m

i=

viQi,j |

∑^ m

i=

uiQi, 1 −

∑^ m

i=

viQi, 1 | +

∑^ m

j=

∑^ m

i=

uiQi,j −

∑^ m

i=

viQi,j |

∑^ m

j=

∑^ m

i=

uiQi,j −

∑^ m

i=

viQi,j |

∑^ m

j=

∑^ m

i=

|ui − vi|Qi,j =

∑^ m

i=

∑^ m

j=

|ui − vi|Qi,j = (1 − ≤)||u − v|| 1 .(4)

with the equality in the second line holding because the epsilon terms can be dropped out, using ∑m

i=

(ui − vi) = 0 ,

because u and v are probability vectors, summing to 1. We remark that we would obtain a stronger contraction property if we applied the above reasoning to all m columns. We would strengthen the assumption in the theorem to: Assume that, for each column j, Pi,j ≥ ≤j ≥ 0 for all i, 1 ≤ i ≤ m, with ≤j > 0 for at least one j. We would then use the reasoning above to obtain the stronger conclusion:

||uP − vP || 1 ≤ c||u − v|| 1 ,

where

c = (1 −

∑^ m

j=

≤j ) < 1.

It is not difficult to see that we must have

∑m j=1 ≤j^ <^ 1.

Markov Chains: Contraction Approach and Limiting Probability Vector, Study notes of Operational Research

Related documents

Partial preview of the text

Download Markov Chains: Contraction Approach and Limiting Probability Vector and more Study notes Operational Research in PDF only on Docsity!

IEOR 4106, Spring 2011, Professor Whitt

Class Lecture Notes: Thursday, February 10.

Markov Chains

2 × (0.94)^2 = 1. 329 ,