






















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
How to apply machine learning algorithms to non-linearly separable data by using feature maps and corresponding kernels. It covers the concept of a feature map, its relationship to dot-products, and the definition of a kernel. Examples of different types of kernels are provided, including polynomial and radial basis function kernels. The document also discusses the benefits of using kernels for non-linearity and the computational efficiency of computing the kernel matrix instead of the feature map.
Typology: Slides
1 / 30
This page cannot be seen from the preview
Don't miss anything!























2
2
2
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
K(x, z) = (x, z) 2
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
K(x, z) = (x, z) 2 Suppose d=2, and: φ(x) = [φ 1 (x),^ φ 2 (x),^ φ 3 (x),.. .] (x, z) 2 = (x 1 z 1 + x 2 z 2 ) 2 φ(x), φ(z) = φ 1 (x)φ 1 (z) + φ 2 (x)φ 2 (z) +...
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
K(x, z) = (x, z) 2 Suppose d=2, and: φ(x) = [φ 1 (x),^ φ 2 (x),^ φ 3 (x),.. .] (x, z) 2 = (x 1 z 1 + x 2 z 2 ) 2
2 1
2 1
2 2
2 2 φ(x), φ(z) = φ 1 (x)φ 1 (z) + φ 2 (x)φ 2 (z) +...
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
K(x, z) = (x, z) 2 Suppose d=2, and: φ(x) = [φ 1 (x),^ φ 2 (x),^ φ 3 (x),.. .] (x, z) 2 = (x 1 z 1 + x 2 z 2 ) 2
2 1
2 1
2 2
2 2
φ(x), φ(z) = φ 1 (x)φ 1 (z) + φ 2 (x)φ 2 (z) +...
2 1
2 2 Feature map for K: ]
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
φ(x) (^) = [ x 1 2 , x 1 x 2 , x 1 x 3 , x 2 x 1 , x 2 2 , x 2 x 3 , x 3 x 1 , x 3 x 2 , x 3 2 ] T For d=3, K(x, z) =
d i= xizi
d i= xizi
d i= d j= xixj zizj = d i,j= (xixj )(zizj ) Time to compute K directly = O(d) Time to compute K though feature map = O(d 2 ) For more general d: K(x, z) = (x, z) 2
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
For d=2: K(x, z) = (x, z) 2 K(x, z) = (x, z + c) 2 (x, z + c) 2 = (x, z) 2
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
For d=2: K(x, z) = (x, z) 2 K(x, z) = (x, z + c) 2 (x, z + c) 2 = (x, z) 2
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
For d=2: K(x, z) = (x, z) 2 K(x, z) = (x, z + c) 2 (x, z + c) 2 = (x, z) 2
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
K(x, z) = (x, z) 2 K(x, z) = (x, z + c) 2 φ(x) = [xixj , 1 ≤ i, j ≤ d, √ 2 cxi, 1 ≤ i ≤ d, c] More general d:
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
Corresponds to an infinite dimensional feature map! K(x, z) = (x, z) 2 K(x, z) = (x, z + c) 2 K(x, z) = (x, z + c) d
2
2
For a feature map (^) φ (^) we define the corresponding kernel K: K(x, y) = φ(x), φ(y) Examples:
Corresponds to an infinite dimensional feature map!
2
2