Expectation Maximization (EM)
A probabilistic variant of K-means:
E step: “soft assignment” of points to clusters
estimate probability that a point is in a cluster
M step:  update cluster parameters
mean and variance info (covariance matrix)
maximizes the likelihood of the points given the clusters
Forsyth Chapter 16 (optional)