Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Randomized Rounding and Concentration Bounds in Set Cover Problem, Study notes of Advanced Algorithms

University of Wisconsin (UW) - Madison Advanced Algorithms

The advanced algorithms lecture on randomized rounding and concentration bounds applied to the set cover problem. The lecture covers the ilp formulation of set cover, relaxing it to an lp problem, deterministic rounding, and randomized rounding. The document also introduces concentration bounds, markov's inequality, chebyshev's inequality, and chernoff's bound to analyze the performance of randomized techniques.

Typology: Study notes

2011/2012

Uploaded on 02/14/2012

alexey 🇺🇸

4.7

(20)

325 documents

1 / 6

This page cannot be seen from the preview

Don't miss anything!

CS787: Advanced Algorithms

Scribe: Archit Gupta, Pavan Kuppili Lecturer: Shuchi Chawla

Topic: Randomized rounding, concentration bounds Date: 10/10/2007

12.1 Set Cover

Let E={e1, e2, .., en}be a set of nelements. Let S1, S2, .., Smbe subsets of Ewith associated

costs c1, .., cm. As explained in a previous lecture, the problem of set cover is to choose 1 or more

of the msubsets such that the union of them cover every element in E, and the objective is to

find the set cover with the minimum cost. In a previous lecture, we have seen a greedy solution to

the set cover which gave a log n approximation. In this lecture, we will see a linear programming

solution with randomized rounding to achieve an O(logn) approximation.

12.1.1 ILP formulation of set cover

Let Xibe a random variable associated with each subset Si.

Xi= 1, if Siis in the solution, and 0 otherwise.

The ILP formulation:

Minimize Pm

i=1 ciXis.t

∀e∈E, Pi:e∈SiXi≥1 - (1)

∀Xi, Xi∈ {0,1}- (2)

The constraints in (1) ensure that every element is present in atleast one of the chosen subsets.

The constraints in (2) just mean that every subset is either chosen or not. The objective function

chooses a feasible solution with the minimum cost.

It can be noted that this problem formulation is a generalization of the vertex cover. If we map

every edge to an element in E, and every vertex to one of the the subsets of E, the vertex cover is

the same as the set cover problem with the additional constraint that every element eappears in

exactly 2 subsets (corresponding to the 2 vertices it is incident on).

12.1.2 Relaxing to an LP problem

Instead of Xi∈ {0,1}, relax the constraint so that Xiis in the range [0, 1]. The LP problem can be

solved optimally in polynomial time. Now we need to round the solution back to an ILP solution.

12.1.3 Deterministic rounding

First, let us see the approximation we get by using a deterministic rounding scheme analogous to

the one we used in the vertex cover problem.

Theorem 12.1.1 We have a deterministic rounding scheme which gives us an F-approximation

1

Discover Study notes of Advanced Algorithms University of Wisconsin (UW) - Madison

Partial preview of the text

Download Randomized Rounding and Concentration Bounds in Set Cover Problem and more Study notes Advanced Algorithms in PDF only on Docsity!

CS787: Advanced Algorithms Scribe: Archit Gupta, Pavan Kuppili Lecturer: Shuchi Chawla Topic: Randomized rounding, concentration bounds Date: 10/10/

12.1 Set Cover

Let E = {e 1 , e 2 , .., en} be a set of n elements. Let S 1 , S 2 , .., Sm be subsets of E with associated costs c 1 , .., cm. As explained in a previous lecture, the problem of set cover is to choose 1 or more of the m subsets such that the union of them cover every element in E, and the objective is to find the set cover with the minimum cost. In a previous lecture, we have seen a greedy solution to the set cover which gave a log n approximation. In this lecture, we will see a linear programming solution with randomized rounding to achieve an O(logn) approximation.

12.1.1 ILP formulation of set cover

Let Xi be a random variable associated with each subset Si. Xi = 1, if Si is in the solution, and 0 otherwise.

The ILP formulation: Minimize

∑m i=1 ciXi^ s.t ∀e ∈ E,

i:e∈Si Xi^ ≥^ 1 - (1) ∀Xi, Xi ∈ { 0 , 1 } - (2)

The constraints in (1) ensure that every element is present in atleast one of the chosen subsets. The constraints in (2) just mean that every subset is either chosen or not. The objective function chooses a feasible solution with the minimum cost.

It can be noted that this problem formulation is a generalization of the vertex cover. If we map every edge to an element in E, and every vertex to one of the the subsets of E, the vertex cover is the same as the set cover problem with the additional constraint that every element e appears in exactly 2 subsets (corresponding to the 2 vertices it is incident on).

12.1.2 Relaxing to an LP problem

Instead of Xi ∈ { 0 , 1 }, relax the constraint so that Xi is in the range [0, 1]. The LP problem can be solved optimally in polynomial time. Now we need to round the solution back to an ILP solution.

12.1.3 Deterministic rounding

First, let us see the approximation we get by using a deterministic rounding scheme analogous to the one we used in the vertex cover problem.

Theorem 12.1.1 We have a deterministic rounding scheme which gives us an F-approximation

to the solution, where F is the maximum frequency of an element.

Proof: Let F be the maximum frequency of an element, or the maximum number of subsets an element appears in. Pick a threshold t = (^) F^1. If x 1 , .., xm is the solution to the LP, the ILP solution is all subsets for which xi ≥ (^) F^1.

This results in a feasible solution because in any of the constraints in (1) we have atmost F variables on the LHS, and atleast one of them should be ≥ (^) F^1 , so we will set atleast one of those variables to 1 during the rounding. So, every element will be covered. It can be noted here that the vertex cover has F = 2, and hence we used a threshold of 12 during the previous lecture. If the values of X 1 , .., Xm are x ′ 1 , .., x

′ m after rounding,^ ∀i, x

′ i ≤^ F xi^ [since if^ xi^ ≥^

1 F we round it to 1, and 0 otherwise] So the cost of the ILP solution ≤ F(cost of the LP solution). So, this deterministic rounding gives us an F approximation to the optimal solution.

12.1.4 Randomized rounding

Algorithm: Step 1. Solve the LP. Step 2 (Randomized rounding). ∀ i, pick Si independently with probability xi (where xi is the value of Xi in the LP solution). Step 3. Repeat step 2 until all elements are covered.

The intution behind the algorithm is that higher the value of xi in the LP solution, the higher the probability of Si being picked during the random rounding.

Theorem 12.1.2 With a probability (1 − (^) n^1 ), we get a 2logn approximation.

Proof:

Lemma 12.1.3 The expected cost in each iteration of step 2 is the cost of the LP solution.

Proof: Let Yi be a random variable. Yi = ci if Si is picked, and 0 otherwise. Let Y =

∑n i=1 Yi, E[Y] =

∑n i=1 E[Yi] [by linearity of expectation] =

∑n i=1 cixi = cost of the LP solution. This is a nice result as the expected cost of the ILP solution is exactly equal to the cost of the LP solution.

Lemma 12.1.4 The number of iterations of step 2 is 2logn with a high probability (whp).

holds:

Pr[X ≥ λ] = Pr[f (X) ≥ f (λ)] ≤

E[f (X)] f (λ)

12.2.2 Chebyshev’s Inequality

We now study a tighter bound called the Chebyshev’s bound. Let f (X) = (X − E[X])^2. Note that f is an increasing function if X > E[X]. We have,

Pr[|X − E[X]| ≤ λ] = Pr

[

(X − E[X])^2 ≤ λ^2

]

E

[

(X − E[X])^2

]

λ^2

σ^2 (X) λ^2

That is, the deviation of X from E[X] is a function of its variance(σ(X)). If the variance is small, then we have a tight bound.

Also not that with probability p, X ∈ E[X] ±

1 p σ(X)

12.2.3 Chernoff’s bound

We present Chernoff’s bound which is tighter compared to the Chebyshev and Markov bounds.

Let random variables X 1 , X 2 , ..., Xn be independent and identically distributed random vari- ables, where Xi ∈ [0, 1]. Note that the class of indicator random variables lie in this category.

Let X =

∑n i=1 Xi. Also^ E[X] =^

∑n i=1 E[Xi] =^ μ

For any δ > 0,

Pr[X ≥ (1 + δ)μ] ≤

eδ (1 + δ)1+δ

)μ (12.2.6)

It can also be shown for δ ≥ 0,

Pr[X ≥ (1 + δ)μ] ≤ e −

“ (^) δ 2 2+δ

” μ (12.2.7)

Note that the above probability resembles a gaussian/bell curve. When 0 ≤ δ ≤ 1

Pr[X ≥ (1 + δ)μ] ≤ e−^

δ^2 3 μ^ (12.2.8)

Pr[X ≤ (1 − δ)μ] ≤ e−^ δ 22 μ (12.2.9)

Example: Coin tossing to show utility of bounds. We toss n independent unbiased coins. We want to find a t such that: Pr[No. of times we get heads in n tosses ≥ t] ≤ 1 /n.

We use indicator random variables to solve this: If the ith coin toss is heads, let Xi = 1, oth- erwise Xi = 0. Let X =

∑n i=1 Xi. Here^ X^ is the total number of heads we get in^ n^ independent tosses. Note that μ = E[X] =

∑n i=1 E[Xi] =^ n/2, since^ E[Xi] = 1/2 (The probability of getting a heads).

To get bounds on Pr[X ≥ t], we first apply Markov’s inequality:

Pr[X ≥ t] ≤

E[X]

t

n/ 2 t

Pr[X ≥ t] = 1/n ⇒

n/ 2 t = 1/n ⇒ t = n^2 / 2

But, we don’t do any more than n coin tosses, so this bound is not useful. Note that Markov’s bound is weak.

Applying Chebyshev’s inequality:

Pr[X ≥ t] ≤ Pr[|X − μ| ≥ t − μ] ≤ σ^2 (X) (t − μ)^2

Evaluating σ(X) where Xi ∈ { 0 , 1 }, E[Xi] = 1/ 2

σ^2 (Xi) = E

[

(Xi − E[Xi])^2

]

Since this is a sum of independent random variables, the variances can be summed together:

σ^2 (X) =

∑^ n

i=

σ^2 (Xi) = n/ 4

Evaluating t,

Pr[X ≥ t] ≤ σ^2 (X) (t − μ)^2

= 1/n

⇒ t = μ + n/2 = n

Again this bound is quite weak.

Applying Chernoff’s bound:

Pr[X ≥ (1 + δ)μ] ≤ e−^

δ 32 μ = e−^

δ 62 n

Taking,

e−^

δ^2 6 n^ = 1/n

Randomized Rounding and Concentration Bounds in Set Cover Problem, Study notes of Advanced Algorithms

Related documents

Partial preview of the text

Download Randomized Rounding and Concentration Bounds in Set Cover Problem and more Study notes Advanced Algorithms in PDF only on Docsity!

12.1 Set Cover

12.1.1 ILP formulation of set cover

12.1.2 Relaxing to an LP problem

12.1.3 Deterministic rounding

12.1.4 Randomized rounding

12.2.2 Chebyshev’s Inequality

[

]

E

[

(X − E[X])^2

]

12.2.3 Chernoff’s bound

E[X]

[

]