22
Information Gain (cont)
Attribute A
v1
vk
v2
Set S
Set S
¢
repeat
recursively
Information gain has the disadvantage that it prefers
attributes with large number of values that split the
data into small, pure subsets.
S
¢
={s
Î
S | value(A)=v1}