8 - k-means Clustering

What is a limitation of improved algorithms in clustering?

What type of distances can k-means be used with?

What is the connection between cosine similarity and squared Euclidean distance?

What does Spherical k-means do with input data?

How does Spherical k-means minimize the average cosine similarity?

Why can the solution found by the standard k-means algorithm be arbitrarily poor?

What is the main issue with using subtraction in computations for clustering?

What is the importance of pairwise squared deviations in clustering?

Why is the standard algorithm for -means considered inefficient?

What is the significance of MacQueen's algorithm in the context of clustering?

How does the arithmetic mean relate to the binomial expansion in clustering?

What is the main goal of the k-means clustering algorithm?

Why does the k-means algorithm use squared errors instead of other distance metrics?

What type of problem is k-means clustering considered?

Explain why k-means clustering is suitable for signals with normally distributed measurement errors.

What theorem is attributed to König, Huygens, and Steiner in the context of clustering?

Why is it important to assign every point to its least-squares closest cluster in k-means clustering?

k-means clustering is numerically unstable in computations, but useful in proofs.
The pairwise sum of squared deviations minimizes squared deviations from the mean.

The standard algorithm for k-means is not the most efficient algorithm despite being widely taught.
There are over 12 variants of the algorithm, including ELKI, which contains multiple variants.
Improved algorithms focus on reducing the number of computations for reassignment, but often lack theoretical guarantees.

k-means cannot be used with arbitrary distances, only with Bregman divergences.
Spherical k-means uses normalized input data and centers, minimizing the average cosine similarity.
Spherical k-means uses sparse nearest-centroid computations and can be accelerated using stored bounds.

The solution found by the standard k-means algorithm can be arbitrarily poor.
In the worst case, a k-means solution can be arbitrarily worse than the best solution.

The sum-of-squares objective is minimized by the arithmetic mean.
Assigning every point to its least-squares closest cluster reduces the sum of squares.
The sum of squares is equivalent to the squared Euclidean distance.