A side effect of region segmentation is that all regions are accepted
as synthetic features. Consider Figure A.3.
Consider now if one centroid
is placed at (5, 0.35),
at
(10, 0.1) and
at (40,-0.1).
It is clear that the centroids
and
make good synthetic
features. But
, although important for a good segmentation
(indeed, it is necessary to define the concept boundaries), does not
itself make a useful synthetic feature. This is because the region
around
contains a mix of instances from different clases, hence
its usefulness as a discriminative attribute is not likely to be high.
However, without
, all of the points formerly considered as
associated with
would now be associated with
, including
the mixed class area of centred around (40, -0.1). This would make
a less useful discriminative features. Hence,
is useful
overall for providing a highly discriminative features - in
particular making
more discriminative - but
itself is not
a discriminative feature.
If the learner we are using were smart enough, this synthetic feature
should be eliminated. Quinlan points out [Qui93] that the
more attributes that are given to the learner, the greater the
probability that amongst them there will be one attribute that
performs highly on the train data but poorly on the test data. Hence,
if we can remove
before it gets to the learner, it would likely
improve our classification performance. Using fewer features may also
aid in the production of more comprehensible rules, since it would
likely lead to shallower trees.