The first artificial domain we will be considering is a simple one, but one that has been used and explored by other researchers.
The artificial cylinder-bell-funnel task was originally
proposed by Saito [Sai94], and further worked on by
Manganaris [Man97]. The task is to classify a stream as
one of three classes, cylinder (
), bell (
) or funnel (
).
Samples are generated as follows:
![]() |
where
and
are drawn from a standard normal
distribution
,
is an integer drawn uniformly from
and
is an integer drawn uniformly from
.
Figure 6.1 shows
instances of each class. The cylinder class is characterised
by a plateau from time
to
, the bell class by a
gradual increase from
to
followed by a sudden decline and the
funnel class by a sudden increase at
and a gradual
decrease until
. Although univariate (i.e., only has one channel)
and of a fixed length (128 frames), the CBF task attempts to
characterise some of the typical properties of temporal domains.
Firstly, there is random amplitude variation as a result of the
in the equation. Secondly, there is random noise (represented by the
). Thirdly, there is significant temporal variation in
both the start of events (since
can vary from 16 to 32) and the
duration of events (since
can vary from 32 to 96).
266 instances of each class were generated. For ease of comparison with previous results, 10-fold cross validation was used.
We have previously looked at the cylinder-bell-funnel domain [Kad99]. In that work, we compared an earlier version of TClass (see Appendix A) against naive segmentation and showed that we performed significantly better. Table 6.1 shows the table of results from that work.
|
As an initial test, we ran all the learners, with and without smoothing. In all tables, the bounds represent the standard error of the mean (the standard deviation divided by the number of of folds).
The above table shows the best of the performers for the naive segmentation and hidden Markov models. The complete results for naive segmentation and HMMs for unsmoothed classification are shown in Tables 6.3 and 6.4.
| |||||||||||||||||||||||||||||||||||
The results are very surprising. For example, one surprise is the performance of the baseline algorithms: nowhere else in the literature has their performance been compared. It's surprising therefore to see them perform amongst the best and they even do better than previously published results (including our own, which were previously the best).
It turns out that the dataset may be too simple to test the capabilities of our system. This makes it difficult to evaluate accuracy and other phenomena. Hence we need to construct a dataset that is (a) more difficult and (b) more typical of the domains that we are interested in.
Let us suggest why the two controls performed so well - they are
capturing the natural aggregate nature of the data. There is only one
real event in each of the three classes; and all of these events are
of extended duration - at a bare minimum the one event characterises
itself over 25 per cent of the data (32 frames) and possibly as much
as 75 per cent of the data (96 frames). On average, the one event
therefore occupies 50 per cent of the channel. Although the
underlying signal is masked by the noise; the noise has a Gaussian
distribution: exactly the type of noise that aggregate approaches
(such as naive segmentation and HMMs) are designed to exclude.
Further investigation reveal an interesting result in the HMMs: the
self-transition probability for each state was almost always
approximately 0.8. This means the hidden Markov model spends
approximately the same time in each state. Some simple mathematics
show that if
we stay in each state for about 7 time
steps, very close to the size of a segment used by the naive segmenter
(6.25 timesteps). It seems that the HMM defaulting to a naive
segmentation approach: equal size regions; with a mean and an average
for each region functioning as a probabilistic model.
The above results might lead one to despair of the performance of TClass, since it doesn't appear to compete that well in terms of accuracy. However, if one's objective becomes the relentless pursuit of accuracy, then TClass can be used with voting to improve accuracy. The above experiment was therefore repeated with 10 repetitions of TClass with an AdaBoost backend, and the outcomes voted. 100 per cent accuracy was obtained. Indeed, 100 per cent 10-fold cross-validated accuracy was obtained with as few as 3 voters - the minimum number of possible voters.