Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Replica Placement and Update Propagation in Distributed Systems, Study notes of Computer Science

University of Pittsburgh (Pitt) - Medical Center-Health System Computer Science

The placement and organization of replicas in distributed systems, including server-initiated and client-initiated replicas. It also covers update propagation methods such as lazy propagation, pushing updates, and pulling updates, as well as the use of leases. The document also touches upon epidemic algorithms for maintaining consistency and gossiping for update propagation.

Typology: Study notes

Pre 2010

Uploaded on 09/02/2009

koofers-user-kw5-1 🇺🇸

10 documents

1 / 11

This page cannot be seen from the preview

Don't miss anything!

Replica Placement

Model: We consider objects (and don’t worry whether they contain

just data or code, or both)

Distinguish different processes: A process is capable of hosting a

replica of an object or data:

•Permanent replicas: Process/machine always having a replica

•Server-initiated replica: Process that can dynamically host a

replica on request of another server in the data store

•Client-initiated replica: Process that can dynamically host a

replica on request of a client (client cache)

Replica Placement

• The logical organization of different kinds of copies of a data

store into three concentric rings.

• Examples: web servers file servers multicast trees

Discover Study notes of Computer Science University of Pittsburgh (Pitt) - Medical Center-Health System

Partial preview of the text

Download Replica Placement and Update Propagation in Distributed Systems and more Study notes Computer Science in PDF only on Docsity!

Replica Placement Model: We consider objects (and don’t worry whether they contain just data or code, or both) Distinguish different processes: A process is capable of hosting a replica of an object or data:

Permanent replicas: Process/machine always having a replica
Server-initiated replica: Process that can dynamically host a replica on request of another server in the data store
Client-initiated replica: Process that can dynamically host a replica on request of a client (client cache) Replica Placement
The logical organization of different kinds of copies of a data store into three concentric rings.
Examples: web servers file servers multicast trees

Server-Initiated Replicas

Keep track of access counts per file, aggregated by considering server closest to requesting clients
Number of accesses drops below threshold D => drop file
Number of accesses exceeds threshold R => replicate file
Number of access between D and R => migrate file Client-Initiated Replicas
More like a client cache
Keep it on disk?
Keep it in memory?
How much space to use?
How long to keep copy/replica?
How to detect data is stale?
Read-only files work best
Sharing data among client processes may be good. Sharing space is essential

Update Propagation ( 3 / 3 ) Observation: We can dynamically switch between pulling and pushing using leases : A contract in which the server promises to push updates to the client until the lease expires. Issue: Make lease expiration time dependent on system’s behavior (adaptive leases):

Age-based leases: An object that hasn’t changed for a long time, will not change in the near future, so provide a long-lasting lease
Renewal-frequency based leases: The more often a client requests a specific object, the longer the expiration time for that client (for that object) will be
State-based leases: The more loaded a server is, the shorter the expiration times become Question: Why are we doing all this? Epidemic Algorithms Basic idea: Assume there are no write–write conflicts:
Update operations are initially performed at one or only a few replicas
A replica passes its updated state to a limited number of neighbors
Update propagation is lazy, i.e., not immediate
Eventually, each update should reach every replica Anti-entropy: Each replica regularly chooses another replica at random, and exchanges state differences, leading to identical states at both afterwards Gossiping: A replica which has just been updated (i.e., has been contaminated ), tells a number of other replicas about its update (contaminating them as well).

System Model

We consider a collection servers, each storing a number of objects
Each object O has a primary server at which updates for O are always initiated (avoiding write-write conflicts)
An update of object O at server S is always time-stamped; the value of O at S is denoted VAL ( O,S )
T( O,S ) denotes the timestamp of the value of object O at server S Anti-Entropy
Basic issue: When a server S contacts another server S* to exchange state information, three different strategies can be followed:
Push : S only forwards all its updates to S*: if T(O,S*) < T(O,S) then VAL(O,S*) <= VAL(O,S)
Pull : S only fetched updates from S*: if T(O,S*) > T(O,S) then VAL(O,S*) <= VAL(O,S)
Push-Pull : S and S* exchange their updates by pushing and pulling values.
Observation: if each server periodically randomly chooses another server for exchanging updates, an update is propagated in O(log(N)) time units. Question : why is pushing alone not efficient when many servers have already been updated?

Consistency Protocols Consistency protocol: describes the implementation of a specific consistency model. We will concentrate only on sequential consistency.

Primary-based protocols
Replicated-write protocols
Cache-coherence protocols Primary-Based Protocols ( 1 / 4 )
All read and write operations go to server
Example: used in traditional client-server systems that do not support replication.

Primary-Based Protocols ( 2 / 4 ) Primary-backup protocol : writes are typically forwarded to server Example : Traditionally applied in distributed databases and file systems that require a high degree of fault tolerance. Replicas are often placed on same LAN. Primary-Based Protocols ( 3 / 4 ) Example : Establishes only a fully distributed, non-replicated data store. Useful when writes are expected to come in series from the same client (e.g., mobile computing without replication. Primary-based, local-write protocol : migrate the data, do not replace it.

Replicated-Write Protocols ( 2 / 2 ) Replicated invocations: “Centralized” Solution Assign a coordinator on each side (client and server), which ensures that only one invocation (a), and one reply is send (b). Triple Modular Redundancy

Simple to implement
Vote on all three results
Majority (50% + 1) wins (^) Request is replicated to all servers. Request A1 A2 A Voter A Result

Quorum-Based Protocols Quorum-based protocols: Ensure that each operation is carried out in such a way that a majority vote is established: distinguish read quorum and write quorum : Example: Lazy Replication

Basic model: number of replica servers jointly implement a causal-consistent data store.
Clients normally talk to front ends which maintain data to ensure causal consistency.

Replica Placement and Update Propagation in Distributed Systems, Study notes of Computer Science

Related documents

Partial preview of the text

Download Replica Placement and Update Propagation in Distributed Systems and more Study notes Computer Science in PDF only on Docsity!