Probabilistic clustering
Basic questions
what’s the probability that a point x is in cluster m?
what’s the shape of each cluster?
K-means doesn’t answer these questions
Basic idea
instead of treating the data as a bunch of points, assume
that they are all generated by sampling a continuous function
This function is called a generative model
defined by a vector of parameters θ