


















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Notes; Professor: Subramanian; Class: AN INTEGRATED INTRODUCTION TO COMPUTATIONAL AND PROBLEM SOLVING; Subject: Computer Science; University: Rice University; Term: Fall 2008;
Typology: Study notes
1 / 26
This page cannot be seen from the preview
Don't miss anything!



















1 Lecture derived from Ullman and Rajaraman
Link analysis algorithms
Simple recursive formulation
An example y = y /2 + a / a = y /2 + m m = a / Yahoo Amazon M’soft a m a/ a/ y/ m y y/
Matrix formulation of flow problem Matrix M has N rows and N columns, one for each page in set of pages to be ranked. (^) If page j has n outlinks (^) if j points to i, M ij=1/n (^) else M ij= 0 (^) Let r be a vector of length N (^) r i is the importance score of page (^) |r| = 1
Power iteration
0 = [1/N,….,1/N] T
= Mr
Stop when | r k+
Power iteration example Yahoo Amazon M’soft y 1/2 1/2 0 a 1/2 0 1 m 0 1/2 0 y a m y a = m
The stationary distribution (^) Where is the surfer at time t+1? (^) Follows a link uniformly at random (^) p(t+1) = Mp(t) (^) Suppose the random walk reaches a state such that p(t+1) = Mp(t) = p(t) (^) Then p(t) is called a stationary distribution for the random walk (^) Our rank vector r satisfies r = Mr (^) So it is a stationary distribution for the random surfer
Existence and uniqueness of solution (^) For graphs that satisfy certain conditions, the stationary distribution is unique and eventually will be reached no matter what the initial probability distribution at time t = 0. (^) The r vector is the scaled page rank score.
Implementation (contd.)
(^) r i+1 = Mri
Defining hubs and authorities A good hub links to many good authorities (^) A good authority is linked from many good hubs (^) Model using two scores for each node Hub score and Authority score Represented as vectors h and a
Defining link structure using transition matrix A
T
T