Consider the sign language domain. What if, when we record the original data, each stream corresponds not to a single sign, but to a sentence (i.e. a sequence) of signs? Then there would not be a simple association of one stream to a class, but from a stream to a sequence of classes. This would be an example of a strong classification problem.
A formal definition of strong classification is a modification of the
definition of weak classification: Rather than have a function
, we have a function
which
has the following type:
In other words,
returns a sequence of classes for
a given stream, rather than a single class.
Our problem can be restated as: given a subset of the function
(say
), produce a function
which is a similar to
as possible.
If there are
possible classes, and we assume that for each stream,
the longest possible class sequence is
, then in general there are
possible class sequences; because there could
be anywhere between 1 and n classes in the class sequence.
Theoretically, therefore, the two variants are equivalent; as each
sentence could be treated as a single label. This is not practical, of
course, and generally, strong temporal classification is harder than
weak temporal classification.