 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
Basic questions
|
|
|
|
• |
what’s the
probability that a point x is in cluster m?
|
|
|
|
• |
what’s the shape
of each cluster?
|
|
|
K-means doesn’t
answer these questions
|
|
|
Basic idea
|
|
|
|
• |
instead of
treating the data as a bunch of points, assume
|
|
|
that they are
all generated by sampling a continuous function
|
|
|
• |
This function is
called a generative model
|
|
|
|
– |
defined by a
vector of parameters θ
|
|