VC-Dimension and Empirical Risk Minimization: An Assignment in Machine Learning, Assignments of Machine Learning

assignment 3 of csce machine learning

Typology: Assignments

2021/2022

Uploaded on 10/10/2022

huixin-zhang-1103
huixin-zhang-1103 ๐Ÿ‡บ๐Ÿ‡ธ

5

(1)

4 documents

1 / 3

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Assignment)3)
1. Let&X&=&R,&the&real&line.&Let&H&=&{(a,&+โˆž):&a&is&a&real&number}.&That&is,&H&is&the&set&of&semi-
infinite&intervals&which&are&unbounded&on&the&right.&Determine&VC-dim(H).&
VC-dim(H)&=&1.&
First,&H&can&shatter&the&empty&set&{ฯ•}.&
Second,&for&x1&=&{x},&H&can&shatter&all&subsets.&
Then,&as&for&two&points&set&x2&=&{x1,&x2},&one&point&cannot&be&extracted&without&affecting&the&
other&one.&
&
2. Let&X&=&R,&the&real&line.&Let&H&=&{(a1,&a2):&a1&and&a2&are&real&numbers}.&That&is,&H&is&the&set&
of&bounded&intervals.&Determine&VC-dim(H).&
VC-dim(H)&=&2.&
First,&H&can&shatter&the&empty&set&{ฯ•}.&
Second,&for&x1&=&{x},&H&can&shatter&all&subsets&{{ฯ•},&{x}}.&
Then,&for&x2&=&{x1,&x2},&H&can&shatter&all&subsets&{{ฯ•},&{x1},&{x2},&{x1,&x2}}.&
However,& when& the& number& of& points& is& 3,& the& subsets& which& contain& {x1,& x3}& cannot& be&
shattered&without&affecting&{x2}.&
&
3. As&below:&
(i)&Let&X&=&R2,&the&two-dimensional&plane.&Let&R(a1,'a2)&:=&{(x1,&x2):&x1&โ‰ฅ&a1,&x2&โ‰ฅ&a2}&denote&
the&two-dimensional&semi-infinite&rectangle&with&โ€œsouthwestโ€&corner&at&the&point&(a1,&a2).&
Let& H& consist& of& all& such& semi-infinite& rectangles,& i.e.,& H& =& {R(a1,'a2):& a1&and& a2&are& real&
numbers}.&Determine&VC-dim(H).&
(ii) Generalize&the&above&result&to&the&higher&dimensional&case&where&X&=&Rn,&R(a1,'a 2,'...' ,'
an)&:=&{(x1,&x2,&...&,&xn):&x1&โ‰ฅ&a1,&x2&โ‰ฅ&a2,&...&,&xn&โ‰ฅ&an},&and&H&=&{R(a1,'a2,'...','an):&a1,&a2,&...&,&an&are&real&
numbers}.&
1) VC-dim(H)&=&2.&
First,&H&can&easily&shatter&the&empty&set&{ฯ•}.&
Second,&for&x1&=&{x},&H&can&shatter&all&subsets&{{ฯ•},&{x}}.&
Then,&for&x2&=&{x1,&x2},&H&can&shatter&all&subsets&{{ฯ•},&{x1},&{x2},&{x1,&x2}}.&
However,&when&the&number&of&points&is&3,&the&subsets&which&contain&{x1,&x2}&cannot&be&
shattered&without&affecting&{x3}.&
2) VC-dim(H)&=&n.&
pf3

Partial preview of the text

Download VC-Dimension and Empirical Risk Minimization: An Assignment in Machine Learning and more Assignments Machine Learning in PDF only on Docsity!

Assignment 3

  1. Let X = R, the real line. Let H = {(a, +โˆž): a is a real number}. That is, H is the set of semi-

infinite intervals which are unbounded on the right. Determine VC-dim(H).

VC-dim(H) = 1.

First, H can shatter the empty set {ฯ•}.

Second, for x

1

= {x}, H can shatter all subsets.

Then, as for two points set x

2

= {x 1

, x 2

}, one point cannot be extracted without affecting the

other one.

  1. Let X = R, the real line. Let H = {(a 1

, a 2

): a 1

and a 2

are real numbers}. That is, H is the set

of bounded intervals. Determine VC-dim(H).

VC-dim(H) = 2.

First, H can shatter the empty set {ฯ•}.

Second, for x

1

= {x}, H can shatter all subsets {{ฯ•}, {x}}.

Then, for x

2

= {x 1

, x 2

}, H can shatter all subsets {{ฯ•}, {x 1

}, {x 2

}, {x 1

, x 2

However, when the number of points is 3, the subsets which contain {x 1

, x 3

} cannot be

shattered without affecting {x 2

  1. As below:

(i) Let X = R

2

, the two-dimensional plane. Let R (a1, a2)

:= {(x 1

, x 2

): x 1

โ‰ฅ a 1

, x 2

โ‰ฅ a 2

} denote

the two-dimensional semi-infinite rectangle with โ€œsouthwestโ€ corner at the point (a 1

, a 2

Let H consist of all such semi-infinite rectangles, i.e., H = {R(a1, a2): a 1 and a 2 are real

numbers}. Determine VC-dim(H).

(ii) Generalize the above result to the higher dimensional case where X = R

n

, R

(a1, a2, ... ,

an)

:= {(x 1

, x 2

, ... , x n

): x 1

โ‰ฅ a 1

, x 2

โ‰ฅ a 2

, ... , x n

โ‰ฅ a n

}, and H = {R (a1, a2, ... , an)

: a 1

, a 2

, ... , a n

are real

numbers}.

  1. VC-dim(H) = 2.

First, H can easily shatter the empty set {ฯ•}.

Second, for x

1

= {x}, H can shatter all subsets {{ฯ•}, {x}}.

Then, for x

2

= {x 1

, x 2

}, H can shatter all subsets {{ฯ•}, {x 1

}, {x 2

}, {x 1

, x 2

However, when the number of points is 3, the subsets which contain {x 1

, x 2

} cannot be

shattered without affecting {x 3

  1. VC-dim(H) = n.

First, in n dimension space, H can shatter subsets of x

n

, when each point in one dimension.

Therefore, VC-dim(H) โ‰ฅ n.

Then, when the number of points increases, the (n + 1)-th point will make not all the

subsets of x

n+

be shattered without affecting others. Thus, VC-dim(H) < n + 1.

Now we know that VC-dim(H) = n, the VC-dimension of semi-intervals equals to the

dimension of hyper-plan.

  1. In this problem, you can use the fact that there is uniform convergence in the law of weak

numbers for a class H of sets if its VC-dim(H) is finite. The Glivenko-Cantelli Theorem

says that empirical distribution functions converge in the L โˆž

  • norm to the true

distribution function in probability. Here is what it means.

Let F(x) be the distribution function of a random variable X, i.e., P(X โ‰ค x) = F(x). We wish

to estimate this distribution function. For this purpose, we obtain m i.i.d. samples {x 1

x 2

, ... , x m

} where each x i

โˆผ P. Then we construct the empirical distribution function

!

"

!

!

#$"

. Show that P(sup x

||G

m

(x) โˆ’ F(x)|| > ฮต) โ†’ 0 as m โ†’ โˆž.

Weโ€™ve already known that there is uniform convergence in the law of weak numbers for a

class H of sets if its VC-dim(H) is finite.

The empirical distribution function is ๐บ

!

"

!

!

#$"

Let t be integral number in the real line R, ฯต > 0 and t < 1/ฯต.

๐นo๐‘

%ฬ…

q = ๐นo๐‘

'

q โˆ’ ๐‘ƒ(๐‘‹

'

'

'

should satisfy ๐น(๐‘

%ฬ…

'

(

'

), j = 1, 2, โ€ฆ , t.

for ๐‘

')"

'

๐นo๐‘

%

ฬ…

q โˆ’ ๐นo๐‘

'

q โ‰ค

"

(

When m โ†’ โˆž, we have ๐บ

!

o๐‘

'

q โˆ’ ๐นo๐‘

'

q โ†’ 0 , ๐บ

!

o๐‘

%ฬ…

q โˆ’ ๐นo๐‘

%ฬ…

q โ†’ 0 , then

โˆ† = max

'$",+,โ€ฆ,(

x|๐บ

!

o๐‘

'

q โˆ’ ๐นo๐‘

'

q|, |๐บ

!

o๐‘

%ฬ…

q โˆ’ ๐นo๐‘

%ฬ…

q|y โ†’ 0.

For any x and j, ๐นo๐‘ ')"

q โ‰ค ๐‘ฅ โ‰ค ๐นo๐‘

'

q.

So

Then,