next up previous contents
Next: Comprehensibility - A subjective Up: Assessing success Previous: Assessing Accuracy of Weak   Contents


Assessing Accuracy of Strong TC

Strong TC is a little harder to assess, because the types of error that can occur are more complex.

One possible initial definition would be the following. The classification for an unseen stream is correct if $ \ensuremath{\mathit{classseq}}(U) =
\ensuremath{\mathit{classseq}}_P(U)$; i.e. it is correct if and only if the predicted and actual class sequences are identical.

However, in many domains this is over-simplistic. If there are two classes $ a$ and $ b$, then the class sequence $ aaabab$ is in some sense ``closer'' to $ aabab$ than $ bbbbba$ is, even though the above classification system would consider them both equally wrong.

To solve this problem, the notion of edit distance is introduced. In such systems, strings are not either ``correct'' or ``incorrect'', but some strings are closer to other strings. One such measure is the Levenshtein distance [Lev66]. The Levenshtein distance between strings $ x$ and $ y$ is the minimum number of differences between the two strings. These differences can take three forms:

Of course, there is more than one way to get from one string to another. For example, a substitution can be thought of as a deletion and an insertion. However, the Levenshtein distance is defined as the minimum number of changes to go from one string to the other.

The Levenshtein distance measure can be used on class sequences. Each class maps to a symbol in the alphabet, and each class sequence maps to a string. Other distance measures may be appropriate for different domains.

Again, we should give greater weight to those elements of the stream set which are likely to occur. The strong temporal classification task can now be defined as minimising:

$\displaystyle \sum_{S \in \ensuremath{\mathit{SS}}} \ensuremath{\mathit{LevDist...
...seq}}(S),\ensuremath{\mathit{classseq}_P}(S))\ P_{\ensuremath{\mathit{SS}}}(S)
$


next up previous contents
Next: Comprehensibility - A subjective Up: Assessing success Previous: Assessing Accuracy of Weak   Contents
Mohammed Waleed Kadous 2002-12-10