





Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A set of exam questions and answers related to load balancing in distributed systems. It covers topics such as static and dynamic load balancing, round-robin, randomized, and multiple-choice load balancing. The document also discusses challenges and solutions related to load balancing, such as herd behavior and g-bounded errors. It is useful for students studying distributed systems and network traffic management, offering insights into the practical application and theoretical underpinnings of load balancing techniques. The content is presented in a question-and-answer format, making it easy to review and understand key concepts.
Typology: Exams
1 / 9
This page cannot be seen from the preview
Don't miss anything!






What is load balancing? Evenly balancing the load over servers Minimise the maximum load on any server
or some bs, but compare notes with people for this in particular (setup study sessions?). This card was made with assumptions and not slide knowledge What are some challenges with the balls in bins method? Complexity of procedure Outdated/inaccurate info Fault tolerance Scalability (dynamic system may change size) Heterogeneous hardware (some servers have more capacity or higher speeds) He didn't mention it, so i might be wrong, but i think balls can also take different amounts of time to process so its hard to predict? Maybe thats 'inaccurate info' What is static load balancing? Load balancer does NOT consider system state
Hash function
Similar to least load, but we also consider server speed Computes how long request would take at every backend server given current speed and queues Note: works great for heterogeneous hardware and is applied by Nginx and other web servers What is the problem with a centralised load balancer? It is a potential bottleneck Its a single point of failure We can use multiple balancers to fix this (multiple dispatchers) What is a problem with multiple dispatchers? As they are working in parallel on same info, they can send their requests to the same server (e.g. lowest load) This herd behaviour can have the opposite effect What are the solutions for herd behaviour in multiple dispatchers? Randomised load balancing OR NOTE: - If you need this i will be disgusted Round-based alg with dispatchers allocating jobs to bins according to their probabilities in the current round. To compute probabilities, some info is needed about server queues and processing rates as well as job arrival rates What is multiple-choice randomised load balancing? Dynamic randomised load balancing Instead of choosing 1 random bin (or server), choose multiple random bins Allocate ball to the bin of lowest load among these choices What is the d-choice protocol (greedy[d])? Greedy[d] Allocate m balls into n bins
What is a g-bounded error? Balls allocated using 2 choices, but if load difference is g or less, the process makes a mistake and chooses the higher loaded bin This models scenarios where load queries are inaccurate What is the result of a process being g-bounded? Do i need to know this Hope not This is called saving load queries, but the slide doesn't make much sense right now. Make flash later Note: Stored at tmp In a 1-choice load balancer, what is the probability that a bin is empty? Assume m = n Probability of particular ball going into particular bin: 1/n Probability of particular ball not falling into particular bin = 1 - (1/n) Probability that none of the m = n balls falls into a particular bin for (n → ∞):
A collision is 2 balls falling into same bin Three balls in a bin count as 3 collisions (1-2, 1-3, 2-3) (m C 2) · 1/n <- This Note: Formula to left, hopefully dont have to understand but check In a 1-choice load balancer, what is probability of a particular bin having exactly k balls? More nonsense i hope i dont have to know (but check) In a 1-choice load balancer, what is probability of a particular bin having at least k balls? Ew In a 1-choice load balancer, what is maximum load? 1 - o(n) He has 2 proof slides on the powerpoint but i cba, I dont think we will have to know them or even understand them. They are confusing and look like a waste of time What is a distributed hash table? Give an example A set of nodes that store data items When data item is added, a hash function is used to determine node. E.g. a chord How can we add a new (actual) node in a chord? let y be successor of x move items at y that belong to node x to x