We can calculate the average information value of these, taking into account the
number of instances that go down each branch—five down the first and third
and four down the second:This average represents the amount of information that we expect would be nec-
essary to specify the class of a new instance, given the tree structure in Figure
4.2(a).info([][][]2 3 4 0 3 2)=(5 14)¥+0 971 (4 14)¥+ 0 (5 14)¥=0 971 0 693,, ,, ,bits....info bits
info bits
info bits2 3 0 971
40 00
3 2 0 971,.
,.
,.([])=
([])=
([])=98 CHAPTER 4| ALGORITHMS: THE BASIC METHODS
yes
yes
no
no
nosunnyyes
yes
yes
yesovercastyes
yes
yes
no
norainyoutlook(a)yes
yes
yesno
nohotyes
yes
no
noyes
yes
yes
nomild cooltemperatureyes(b)yes
yes
yes
no
no
no
noyes
yes
yes
yes
yes
yes
nohigh normalhumidity(c)yes
yes
yes
yes
yes
yes
no
noyes
yes
yes
no
no
nofalse truewindy(d)
Figure 4.2Tree stumps for the weather data.