Lecture Notes on Distributed Systems: Replication and Replica Management | Study notes Computer Science

2002 M.T. Harandi and J. Hou

Student Notes Pages





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 23-1

Computer Science

425

Distributed Systems

Computer Science

425

Distributed Systems

Lecture 23

Replication Control

Reading: Section 15.1-15.3





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 23-2

Replication



Enhancing Services by replicating data



Load Balancing



Example: Workload is shared between the servers by

binding all the server IP addresses to the service ’s DNS

name. A DNS lookup of the site results in one of the servers’

IP addresses being returned, in a round-robin fashion.



Fault Tolerance



Under the fail-stop model, if up to fof f+1 servers crash, at

least one remains to supply the service.



Increased Availability



Service may not be available when servers fail or when the net work

is partitioned.

P: probability that one server fails= 1 – P= availability of

service. e.g. P = 5% => service is available 95% of the time.

: probability that n servers fail= 1 – P

= availability of

service. e.g. P = 5%, n = 3 => service available 99. 875% of

the time





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 23-3

Basic Mode of Replication



Replication Transparency

User/client need not know that multiple phy sical copies of

data exist.



Replication Consistency

Data is consistent on all of the replicas (or is in the process

of becoming consistent)

Client Front End

Service

server

Replica Manager`





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 23-4

Replication Management



Request Communication



Requests can be made to a single RM or to multiple RMs



Coordination: The RMs decide



whether the request is to be applied



the order of requests



FIFO ordering: If a FE issues rthen r’, then any correct

RM handles rand then r’.



Causal ordering: If the issue of r“happen ed before”

the issue of r’, then any correct RM handl es rand then

r’.



Total ordering: If a correct RM handles rand the n r’,

then any correct RM handles rand then r’.



Execution: The RMs execute the request

tentatively.





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 23-5

Replication Management



Agreement: The RMs attempt to reach

consensus on the effect of the request.



E.g., Two phase commit through a coordi nator



If this succeeds, effect of request is made per manent



Response



One or more RMs responds to the front end.



In the case of fail-stop model, the FE returns the fir st

response to arrive.





2002, M. T. Harandi and J. Hou (modified: I. Gupta)

Lecture 23-6

Group Communication



“Member”= process (e.g., RM)



Static Groups: group membership is pre-defined



Dynamic Groups: Members may join and leave, as

necessary

Group

Send

Address

Expansion

Multicast

Comm.

Membership

Management

Leave

Fail

Join

Group

Lecture Notes on Distributed Systems: Replication and Replica Management, Study notes of Computer Science

Related documents

Partial preview of the text

Download Lecture Notes on Distributed Systems: Replication and Replica Management and more Study notes Computer Science in PDF only on Docsity!

2002 M.T. Harandi and J. Hou

Computer Science

Distributed Systems

Computer Science

Distributed Systems

Lecture 23

Replication Control

Reading: Section 15.1-15.

Replication Replication

Enhancing Services by replicating data

Load Balancing

Example: Workload is shared between the servers by

Fault Tolerance

Under the fail-stop model, if up to f of f+1 servers crash, at

Increased Availability

Basic Mode of Replication Basic Mode of Replication

Replication Transparency

Replication Consistency

RM

RM

Replication Management Replication Management

Request Communication

Requests can be made to a single RM or to multiple RMs

Coordination: The RMs decide

whether the request is to be applied

the order of requests

FIFO ordering: If a FE issues r then r’, then any correct

Causal ordering: If the issue of r “happened before”

Total ordering: If a correct RM handles r and then r’,

Execution: The RMs execute the request

tentatively.

Replication Management Replication Management

Agreement: The RMs attempt to reach

consensus on the effect of the request.

E.g., Two phase commit through a coordinator

If this succeeds, effect of request is made permanent

Response

One or more RMs responds to the front end.

In the case of fail-stop model, the FE returns the first

Group Communication Group Communication

“Member”= process (e.g., RM)

Static Groups: group membership is pre-defined

Dynamic Groups: Members may join and leave, as

necessary

2002 M.T. Harandi and J. Hou

Views Views

A group membership service maintains group

views, which are lists of current group members.

This is NOT a list maintained by a one member, but…

Each member maintains its own local view

A view Vp(g) is process p’s understanding of its

group (list of members)

A new group view is disseminated, throughout the

group, whenever a member joins or leaves.

Member detecting failure of another member reliable multicasts a

Views Views

An event is said to occur in a view vp,i(g) if the event occurs at

Messages sent out in a view i need to be delivered in that view

Requirements for view delivery

Order: If p delivers vi(g) and then vi+1(g), then no other process q

Integrity: If p delivers vi(g), then p is in vi(g).

Non-triviality: if process q joins a group and becomes reachable

Exception: partitioning of group. Solutions to partitioning:

Ignore partitions for the rest of the lecture.

View Synchronous Communication View Synchronous Communication

View Synchronous Communication = Group Membership

The following guarantees are provided for multicast

Integrity: If p delivered message m, p will not deliver m again.

Validity: Correct processes always deliver all messages. That is,

Agreement: Correct processes deliver the same set of messages

All View Delivery conditions (Order, Integrity and Non-triviality

“What happens in the View, stays in the View”

Example: View Synchronous Communication Example: View Synchronous Communication

View Synchrony View Synchrony

needed (at view delivery point) to bring it up to

date