



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The k-means clustering algorithm and its limitations, specifically its tendency to converge to local minima. It then introduces spectral relaxation, a method for formulating the sum-of-squares minimization problem in k-means as a trace maximization problem with special constraints. This relaxation leads to optimal global solutions and improves the clustering process. The document also includes mathematical derivations and references to related work.
Typology: Study notes
1 / 5
This page cannot be seen from the preview
Don't miss anything!




Instructor: Jieping Ye
πj = {v | xv belongs to cluster j}.
cj =
nj
∑ v∈πj
xv,
where nj is the number of elements in πj.
∑
v∈πj
||xv − cj ||^2.
∑^ k
j=
∑ v∈πj
||xv − cj ||^2.
cj =
nj
∑ v∈πj
xv,
where nj is the number of elements in πj.
min Π Q(Π) ≥ trace(XT^ X) − max Y T^ Y =Ik
trace
( Y T^ XT^ XY
min ∑{m,n}
i=k+
σ i^2 (X),
where σi(X) is the i-th largest singular value of X.
Xˆ = [c 1 , c 2 , · · · , ck] I
where I ∈ IRk×n^ indicates the cluster membership. More specifically, Iij = 1, if xj belongs to the i-th cluster πi and Iij = 0 otherwise. Denote C = [c 1 , c 2 , · · · , ck].
Z∗^ =
( CT^ C
)− 1 CT^ X.
( RT^ R
)− 1 RT^ QT^ X. It follows that X˜k = CZ∗^ = QR
( RT^ R
)− 1 RT^ QT^ X = QQT^ X. Here we assume that R is nonsingular, i.e., the k cluster centroids in C are linearly independent.