Clustering Algorithms Overview, Lecture Notes - Computer Science, Study notes of Network Theory

<p class="MsoNormal" style="margin: 0in 0in 10pt"><font color="#000000"><font face="Calibri">Prof. David C Parkes, Computer Science, Clustering, K-means, Hierarchical Agglomerative Clustering, HAC, Probabilistic Methods, maximum likelihood, Harvard, Lecture Notes<p></p></font></font></p>

Typology: Study notes

2010/2011

Uploaded on 10/25/2011

thecoral
thecoral 🇺🇸

4.5

(30)

395 documents

1 / 73

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
CS181 Lecture 9: Clustering
(and intro to Prob methods)
David C. Parkes
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a
pf2b
pf2c
pf2d
pf2e
pf2f
pf30
pf31
pf32
pf33
pf34
pf35
pf36
pf37
pf38
pf39
pf3a
pf3b
pf3c
pf3d
pf3e
pf3f
pf40
pf41
pf42
pf43
pf44
pf45
pf46
pf47
pf48
pf49

Partial preview of the text

Download Clustering Algorithms Overview, Lecture Notes - Computer Science and more Study notes Network Theory in PDF only on Docsity!

CS181 Lecture 9: Clustering

(and intro to Prob methods)

David C. Parkes

Outline

  • Clustering
    • K -means
    • Hierarchical Agglomerative Clustering (HAC)
  • Introduction to Probabilistic Methods
    • classification
    • clustering
    • maximum likelihood

2

Today: Clustering

  • Clustering : discover groups of similar examples - for dimensionality reduction : project the data down to two or three dimensions - to understand the data - as a way to preprocess a lot unlabeled data, to use for supervised learning

4

Clustering

5

How do we Evaluate Clustering

Algorithms?

  • Eyeball the clusters to see if they make intuitive sense?
  • Problem: subjective
  • Identifying a useful objective is the key challenge in clustering! 10

Possible Objective Criterion

  • Intuition: a cluster is a group of examples with small inter-example distances compared to distances to other examples

11

K-means Algorithm

  • Choose initial values for ¹ 1 , …, ¹K
  • [E] Minimize Err with respect to rik, keeping prototypes ¹ fixed Set rik = 1 for k that minimizes ||xi - ¹k||^2
  • [M] Minimize Err with respect to {¹ 1 , …, ¹K},

keeping assignments rik fixed Set  Err /  ¹k = 0 ¹k = i rik x i / i rik

  • Repeat until convergence (^14)

K-means Algorithm

  • Choose initial values for ¹ 1 , …, ¹K
  • [E] Minimize Err with respect to rik, keeping prototypes ¹ fixed Set rik = 1 for k that minimizes ||xi - ¹k||^2
  • [M] Minimize Err with respect to {¹ 1 , …, ¹K},

keeping assignments rik fixed

¹k = i rik x i / i rik

  • Repeat until convergence (^15)

K-means Algorithm

  • Choose initial values for ¹ 1 , …, ¹K
  • [E] Minimize Err with respect to rik, keeping prototypes ¹ fixed Set rik = 1 for k that minimizes ||xi - ¹k||^2
  • [M] Minimize Err with respect to {¹ 1 , …, ¹K},

keeping assignments rik fixed Set  Err /  ¹k = 0 ¹k = i rik x i / i rik [just centoid!]

  • Repeat until convergence [+ restart!] (^17)
  • Yellowstone National Park, Wyoming
  • 272 data points
    • duration of eruption (y-axis)
    • duration to next eruption (x)
    • standardized

(Wikipedia)^19

Example

21

Sum squared error in K-means; E (blue), M (red)

(Bishop)

22

(Bishop; K-means image segmentation)