Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Adaboost Algorithm: Building Strong Classifiers from Weak Ones, Exercises of Artificial Intelligence

Central University of Jammu and Kashmir Artificial Intelligence

The adaboost algorithm is a machine learning technique used to create strong classifiers from a series of weak classifiers. In this document, we explore the adaboost method, which was developed by freund and schapire, and learn how to calculate error rates, determine weights, and reweight samples to improve classification accuracy. Essential for students studying machine learning, data mining, or artificial intelligence.

Typology: Exercises

2011/2012

Uploaded on 07/31/2012

shaina_44kin 🇮🇳

3.9

(9)

64 documents

1 / 3

This page cannot be seen from the preview

Don't miss anything!





6.034f Boosting Notes

Patrick Winston and Luis Ortiz

Draft of November 17, 2009

A strong classiﬁer is one that has an error rate close to zero. A weak classiﬁer is one that has

an error rate just below 1

2, producing answers just a little better than a coin ﬂip.

Freund and Schapire discovered that you can construct a strong classiﬁer from weak

classiﬁers such that the strong classiﬁer will correctly classify all samples in a sample set.

Better still, the strong classiﬁer consists of a sequence of weighted classiﬁers, determined step

by step. In the following, the multiplicative weight found in the ﬁrst step is α1 and the weak

classiﬁer is h1(xi), and in step s, the weight is αs and the classiﬁer is hs(xi):

H(xi) = sign(α1h1(xi)+ α2h2(xi)+ ···+ αshs(xi) ···

where

sign = +1 for positive arguments

−1 for negative arguments

hs(xi)= +1 for samples the classiﬁer thinks belong to the class

−1 for samples the classiﬁer thinks do not belong to the class

Freund and Schapire named their method Adaboost, an acronym for adaptive boosting.

In the ﬁrst Adaboost step, you ﬁnd the weak classiﬁer, h1(xi), that produces the lowest

error rate; then, you ﬁnd the corresponding multiplier, α1. In step s, you ﬁrst ﬁnd the weak

classiﬁer, hs(xi), that produces the lowest error rate with the samples reweighted to emphasize

previously misclassiﬁed samples; then you ﬁnd the corresponding multiplier, αs. You continue

taking steps until the classiﬁer H(xi) correctly classiﬁes all samples or you cannot ﬁnd any

weak classiﬁer for the next step.

Several questions emerge:

• How do you compute the error rate†

, E, for the candidates for hs(xi)?

• How do you compute αs once you have hs(xi)?

• How do you reweight the samples to emphasize the misclassiﬁed samples for the next

step?

To compute the error rate, you assign to each sample, i, at each step, s, an emphasis-

determining weight, ws

i . The weights used in step 1 are all the same:

1

w =

i number of samples

Each time you calculate the weights for the next step, you normalize so that the weights

still add to 1:

wi

s =1

i

†We use E for the error rate to avoid confusion with the base of the natural logarithms, e.

docsity.com

Discover Exercises of Artificial Intelligence Central University of Jammu and Kashmir

Partial preview of the text

Download Adaboost Algorithm: Building Strong Classifiers from Weak Ones and more Exercises Artificial Intelligence in PDF only on Docsity!

6.034f Boosting Notes

Patrick Winston and Luis Ortiz

Draft of November 17, 2009

A strong classifier is one that has an error rate close to zero. A weak classifier is one that has

an error rate just below 12 , producing answers just a little better than a coin flip. Freund and Schapire discovered that you can construct a strong classifier from weak classifiers such that the strong classifier will correctly classify all samples in a sample set. Better still, the strong classifier consists of a sequence of weighted classifiers, determined step by step. In the following, the multiplicative weight found in the first step is α^1 and the weak

classifier is h^1 ( x (^) i ), and in step s , the weight is α s^ and the classifier is h s ( x (^) i ):

H( x (^) i ) = sign(α^1 h^1 ( x (^) i ) + α^2 h^2 ( x (^) i ) + · · · + α s h s ( x (^) i ) · · · where

sign = +1^ for^ positive^ arguments − 1 for negative arguments

h s ( xi ) =

+1 for samples the classifier thinks belong to the class − 1 for samples the classifier thinks do not belong to the class

Freund and Schapire named their method Adaboost , an acronym for ada ptive boosting. In the first Adaboost step, you find the weak classifier, h^1 ( x (^) i ), that produces the lowest error rate; then, you find the corresponding multiplier, α^1. In step s , you first find the weak classifier, h s ( x (^) i ), that produces the lowest error rate with the samples reweighted to emphasize previously misclassified samples; then you find the corresponding multiplier, α s. You continue taking steps until the classifier H( x (^) i ) correctly classifies all samples or you cannot find any weak classifier for the next step. Several questions emerge:

How do you compute the error rate

† , E , for the candidates for h s ( x (^) i )?

How do you compute α s^ once you have h s ( x (^) i )?
How do you reweight the samples to emphasize the misclassified samples for the next step?

To compute the error rate, you assign to each sample, i , at each step, s , an emphasis- determining weight, w s i. The weights used in step 1 are all the same:

w (^) i = number of samples Each time you calculate the weights for the next step, you normalize so that the weights still add to 1:

w i^ s^ = 1 i

†We use E for the error rate to avoid confusion with the base of the natural logarithms, e.

2

The error rate of a candidate classifier for a particular step is the sum of the weights for the samples that the candidate classifier misclassifies at that step:

Es candidate^ = w si for i misclassified by the candidate at step s i Es^ is the error rate for the best of the candidate classifiers. But what about computing α s^ and reweighting the samples. With some moderately complex mathematics, Freund and Shipiri determined that computing new weights from old weights using the following formula ensures that the overall error for H( x (^) i ), as you

add classifiers, will stay under an exponential bound, and eventually go to zero. Ns^ is a

normalizing constant

† for step s that ensures that all the new weights, w s i +1, add up to 1. ⎧ (^) w s e −α

s ⎨ (^) Ns^ i^ for^ correctly^ classified^ samples w^ s + i =^ ⎩ w s i (^) e +α s for misclassified samples N s With more math, Freund and Shipiri determined that the exponential bound on the overall error of H( xi ) is minimized if Ns^ is minimized. This led them to a formula for α s :

1 1 − Es α s^ = ln 2 Es At this point, you have all you need to write an Adaboost program:

You use uniform weights to start.
For each step, you find the classifier that yields the lowest error rate for the current

weights, w s i.

You use that best classifier, h s ( x (^) i ), to compute the error rate associated with the step, Es
You determine the alpha for the step, α s^ from the error for the step, Es^.
With the alpha in hand, you compute the weights for the next step, w s i +1, from the weights

for the current step, w s i , taking care to include a normalizing factor, Ns , so that the new weights add up to 1.

You stop successfully when H( x (^) i ) correctly classifies all the samples, xi ; you stop unsuc cessfully if you reach a point where there is no weak classifier, one with an error rate < 1 2. You, however, are not a computer, so calculating those exponentials and logarithms is im practical on an examination. You need to massage the formulas a bit to make them work for you. First, you plug the formula for α s^ into the reweighting formula, producing the following:

⎪ w i

⎧ s (^) Es ⎨ (^) Ns (^) 1 − Es for^ correctly^ classified^ samples w^ s + i =^ ⎪ √ ⎩ w si 1 − Es for misclassified samples N s^ Es Now, because Ns^ must be that number that makes the new weights add up to 1, you can write the following:

†We use Ns (^) rather than Z s , used by Freund and Schapire, to avoid confusion with the number 2 when written by hand.

Adaboost Algorithm: Building Strong Classifiers from Weak Ones, Exercises of Artificial Intelligence

Related documents

Partial preview of the text

Download Adaboost Algorithm: Building Strong Classifiers from Weak Ones and more Exercises Artificial Intelligence in PDF only on Docsity!

6.034f Boosting Notes

Patrick Winston and Luis Ortiz

Draft of November 17, 2009