Two papers on information transfer / problem decomposition
Lorien Y. Pratt
pratt at paul.rutgers.edu
Fri Apr 5 17:08:52 EST 1991
The following two papers are now available via FTP from the neuroprose
archives. The first is for AAAI91, so written towards an AI/Machine
learning audience. The second is for IJCNN91, so more neural
network-oriented. There is some overlap between them: the AAAI paper
reports briefly on the study describved in more detail in the IJCNN
paper. Instructions for retrieval are at the end of this message.
--Lori
#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@#@
Direct Transfer of Learned Information Among Neural Networks
To appear: Proceedings of AAAI-91
Lorien Y. Pratt and Jack Mostow and Candace A. Kamm
Abstract
A touted advantage of symbolic representations is the ease of
transferring learned information from one intelligent agent to
another. This paper investigates an analogous problem: how to use
information from one neural network to help a second network learn a
related task. Rather than translate such information into symbolic
form (in which it may not be readily expressible), we investigate the
direct transfer of information encoded as weights.
Here, we focus on how transfer can be used to address the important
problem of improving neural network learning speed. First we present
an exploratory study of the somewhat surprising effects of pre-setting
network weights on subsequent learning. Guided by hypotheses from this
study, we sped up back-propagation learning for two speech recognition
tasks. By transferring weights from smaller networks trained on
subtasks, we achieved speedups of up to an order of magnitude compared
with training starting with random weights, even taking into account
the time to train the smaller networks. We include results on how
transfer scales to a large phoneme recognition problem.
@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@%@
Improving a Phoneme Classification Neural Network
through Problem Decomposition
To appear: Proceedings of IJCNN-91
L. Y. Pratt and C. A. Kamm
Abstract
In the study of neural networks, it is important to determine whether
techniques that have been validated on smaller experimental tasks can
be scaled to larger real-world problems. In this paper we discuss how
a methodology called {\em problem decomposition} can be applied to
AP-net, a neural network for mapping acoustic spectra to phoneme
classes. The network's task is to recognize phonemes from a large
corpus of multiple-speaker, continuously-spoken sentences. We review
previous AP-net systems and present results from a decomposition study
in which smaller networks trained to recognize subsets of phonemes are
combined into a larger network for the full signal-to-phoneme mapping
task. We show that, by using this problem decomposition methodology,
comparable performance can be obtained in significantly fewer
arithmetic operations.
^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%^%
To retrieve:
unix> ftp cheops.cis.ohio-state.edu (or 128.146.8.62)
Name: anonymous
Password: neuron
ftp> cd pub/neuroprose
ftp> binary
ftp> get pratt.aaai91.ps.Z
ftp> get pratt.ijcnn91.ps.Z
ftp> quit
unix> uncompress pratt.aaai91.ps.Z pratt.ijcnn91.ps.Z
unix> lpr pratt.aaai91.ps pratt.ijcnn91.ps
More information about the Connectionists
mailing list