22
Information Gain (cont)
Attribute A
v1
vk
v2
Set S
Set S ¢
repeat
recursively
Information gain has the disadvantage that it prefers
attributes with large number of values that split the
data into small, pure subsets.
S¢={sÎS | value(A)=v1}