Basic questions
•what’s
the probability that a point x is in cluster m?
•what’s
the shape of each cluster?
K-means doesn’t answer these questions
Basic idea
•instead
of treating the data as a bunch of points, assume that
they are all generated by sampling a continuous function
•This
function is called a generative model
–defined
by a vector of parameters θ
•