Linear Subspaces, Geometry - Lecture Slides | CMSC 828, Study notes of Computer Science

Material Type: Notes; Professor: Jacobs; Subject: Computer Science; University: University of Maryland; Term: Unknown 1989;

Typology: Study notes

Pre 2010

Uploaded on 07/30/2009

koofers-user-lsp-1
koofers-user-lsp-1 🇺🇸

10 documents

1 / 7

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
1
Linear Subspaces - Geometry
No Invariants, so Capture
Variation
Each image = a pt. in a high-dimensional
space.
Image: Each pixel a dimension.
Point set: Each coordinate of each pt. A
dimension.
Simplest rep. of variation is linear.
Basis (eigen) images: x1…xk
Each image, x = a1x1+ … + akxk
Useful if k << n.
pf3
pf4
pf5

Partial preview of the text

Download Linear Subspaces, Geometry - Lecture Slides | CMSC 828 and more Study notes Computer Science in PDF only on Docsity!

Linear Subspaces - Geometry

No Invariants, so Capture

Variation

  • Each image = a pt. in a high-dimensional

space.

  • Image: Each pixel a dimension.
  • Point set: Each coordinate of each pt. A dimension.
  • Simplest rep. of variation is linear.
  • Basis (eigen) images: x 1 …xk
  • Each image, x = a 1 x 1 + … + akxk
  • Useful if k << n.

When is this accurate?

  • Approximately right when:
    • Variation approximately linear. Always true for small variation.
    • Some variations big, some small, can discard small.
  • Exactly right sometimes.
    • Point features with scaled-orthographic projection.
    • Convex, Lambertian objects and distant lights.

Principal Components Analysis

(PCA)

  • All-purpose linear approximation.
  • Given images (as vectors)
  • Finds low-dimensional linear subspace

that best approximates them.

  • Eg., minimizes distance from images to

subspace.

PCA derivation

(this is all just taken from Duda, Hart and Stork)

Suppose we have a series of vectors, x1…xn, and we want to approximate them with a low-dimensional subspace. What is the best way to do this? If we want to approximate them with a 0 dimensional subspace, we can do this most accurately by approximating them by their mean, m. This is probably intuitive, but if not, Duda, Hart and Stork have a very nice proof (Eq. 80, p. 115).

Next we’ll consider find the best 1-dimensional subspace, written as: x_i is approximated by m+a_ie, where e is a unit vector indicating the direction of the space. Then our goal is to choose a_i and e to minimize:

J(a1, …, an, e) = sum ||(m+ake)-xk}}^

= sum ||ake – (xk – m)||^ = sum ak^2||e||^2 – 2 sum ak e(xk – m) + sum ||xk – m||^ ||e|| = 1. Taking the derivatives w.r.t. ak and setting them to 0 we get: 2ak – 2 e(xk-m) = 0,

ak = e(xk-m).

We can skip this derivation, and just say that of course we get the best choice of ak by projecting xk-m onto e.

Now, if we set ak = e(xk-m), we get J as a function of e J(e) = sum ak^2 – 2 sum ak^2 + sum ||xk-m||^ = - sum [e(xk-m)]^2 + sum ||xk-m||^ = - sum e(xk-m)(xk-m)e + sum ||xk-m||^ So we need to maximize eSe subject to ||e|| = 1. We do this with Lagrange multipliers. We set: U = eSe – lambda (ee – 1), differentiate w.r.t. e and set this to 0. We get: partial u/ partial e = 2Se – 2 lambda e, so Se = lambda e. So e is an eigenvector of S, and we can see that eSe is maximized when e is the eigenvector associated with the largest eigenvalue.

Fisher Linear Discriminant

PCA
LDA

LDA for comparison

  • Capture variations that distinguish

different objects, throw away variations that don’t.

Linear Combinations

1 2

1 2

1 2

2 , 1 2 , 2 2 , 3

1 , 1 1 , 2 1 , 3

2 2 2 , 3

2 2 , 2

2 2 , 1

2 2 1 , 3

2 1 , 2

2 1 , 1

1 1 2 , 3

1 2 , 2

1 2 , 1

1 1 1 , 3

1 1 , 2

1 1 , 1

1 2

1 2

2 2 2

2 1

2 2 2

2 1

1 1 2

1 1

1 1 2

1 1

n

n

n

m y

m m m

m x

m m m

y

x

y

x

m n

m m

m n

m m

n

n

n

n

z z z

y y y

x x x

s s s t

s s s t

s s s t

s s s t

s s s t

s s s t

v v v

u u u

v v v

u u u

v v v

u u u

I S P

Immediately apparent that u and v coordinates lie in a 4D linear subspace