paper on ICA and LOCOCODE
Josef Hochreiter
hochreit at informatik.tu-muenchen.de
Mon Sep 21 11:06:47 EDT 1998
Feature extraction through LOCOCODE
Sepp Hochreiter, TUM Juergen Schmidhuber, IDSIA
Neural Computation, in press (28 pages, 0.7MB, 5MB gunzipped)
LOw-COmplexity COding and DEcoding (LOCOCODE) is a novel approach to
sensory coding and unsupervised learning. Unlike previous methods it
explicitly takes into account the information-theoretic complexity of
the code generator: it computes lococodes that convey information
about the input data and can be computed and decoded by low-complexity
mappings. We implement LOCOCODE by training autoassociators with Flat
Minimum Search (Neural Computation 9(1):1-42, 1997), a general method
for discovering low-complexity neural nets. It turns out that this
approach can unmix an unknown number of independent data sources by
extracting a minimal number of low-complexity features necessary for
representing the data. Experiments show: unlike codes obtained with
standard autoencoders, lococodes are based on feature detectors, never
unstructured, usually sparse, sometimes factorial or local (depending
on statistical properties of the data). Although LOCOCODE is not
explicitly designed to enforce sparse or factorial codes, it extracts
optimal codes for difficult versions of the bars benchmark problem,
whereas ICA and PCA do not. It produces familiar, biologically
plausible feature detectors when applied to real world images, and
codes with fewer bits per pixel than ICA and PCA. Unlike ICA it does
not need to know the number of independent sources. As a preprocessor
for a vowel recognition benchmark problem it sets the stage for
excellent classification performance. Our results reveil an
interesting, previously ignored connection between two important
fields: regularizer research, and ICA-related research. They may
represent a first step towards unification of regularization and
unsupervised learning.
ftp://ftp.idsia.ch/pub/juergen/lococode.ps.gz
ftp://flop.informatik.tu-muenchen.de/pub/articles-etc/
hochreiter.lococode.ps.gz
http://www7.informatik.tu-muenchen.de/~hochreit/pub.html
http://www.idsia.ch/~juergen/onlinepub.html
Conference spin-offs:
1. Low-complexity coding and decoding. In K. M. Wong, I. King, D.
Yeung, eds., Proc. TANC'97, 297-306, Springer, 1997.
2. Unsupervised coding with LOCOCODE. In W. Gerstner, A. Germond, M.
Hasler, J.-D. Nicoud, eds., ICANN'97, 655-660, Springer, 1997.
3. LOCOCODE versus PCA and ICA. In L. Niklasson, M. Boden, T. Ziemke,
eds., ICANN'98, 669-674, Springer, 1998.
4. Source separation as a by-product of regularization.
To be presented at NIPS'98, 1998.
Sepp & Juergen
More information about the Connectionists
mailing list