machine learning databases
Thanos Kehagias
kehagias at eng.auth.gr
Wed Sep 21 10:29:05 EDT 1994
on the subject of machine learning databases, here is a request. if one
has a pointer to the kind of data i need (see next paragraph),
or if the people setting up databases now would like to consider
including this kind of data, i will be most grateful.
i am looking at the problem of Time Series Classification. in other
words, there is a number of possible sources, each producing a time
series. previous instances of these time series have been observed,
either labelled (supervised learning) or unlabelled (unsupervised
learning). now a new time series is observed and one wants to decide
which source generated it.
there is a lot of algorithms in the literature that do this kind of thing
(i have some of my own, even) and everyone seems to be using their own
example problem and/or dataset. a classical example of this problem
is, of course, phoneme recognition. what i have not been able to find
is some standard datasets to be used for benchmarks (e.g. sonar,
radar signals, EEG, ECG data and so on). i mean the raw time series,
not some kinf of preprocessed data where the whole time series is
reduced to a static features vector. does anyone know of any such data
in the public domain (except speech data)? i think this would be
a useful benchmark, and the kind of thing that i have not seen in, for
instance, the uci collection.
Thanks a lot,
Thanasis
More information about the Connectionists
mailing list