TR announcement: Long Short Term Memory

Josef Hochreiter hochreit at informatik.tu-muenchen.de
Wed Aug 23 05:15:10 EDT 1995




FTP-host:  flop.informatik.tu-muenchen.de (131.159.8.35)
FTP-filename: /pub/fki/fki-207-95.ps.gz    
or
FTP-host: fava.idsia.ch (192.132.252.1)
FTP-filename: /pub/juergen/fki-207-95.ps.gz     
or something like
netscape http://www.idsia.ch/~juergen


                     LONG SHORT TERM MEMORY
        
            Technical Report FKI-207-95 (8 pages, 50 K) 


        Sepp Hochreiter                  Juergen Schmidhuber

      Fakultaet fuer Informatik                IDSIA
   Technische Universitaet Muenchen        Corso Elvezia 36
      80290 Muenchen, Germany          6900 Lugano, Switzerland


  ``Recurrent backprop'' for learning to store information over 
  extended time periods takes too long. The main reason is 
  insufficient, decaying error back flow.  We describe a novel, 
  efficient ``Long Short Term Memory'' (LSTM) that overcomes 
  this and related problems. Unlike previous approaches, LSTM 
  can learn to bridge arbitrary time lags by enforcing constant 
  error flow. Using gradient descent, LSTM explicitly learns when 
  to store information and when to access it. In experimental 
  comparisons with ``Real-Time Recurrent Learning'', ``Recurrent 
  Cascade-Correlation'', ``Elman nets'', and ``Neural Sequence 
  Chunking'', LSTM leads to many more successful runs, and learns 
  much faster. Unlike its competitors, LSTM can solve tasks 
  involving minimal time lags of more than 1000 time steps, even 
  in noisy environments.


If you don't have gzip/gunzip,  we can mail you an uncompressed 
postscript  version  (as a  last resort). Comments welcome.


Sepp Hochreiter
Juergen Schmidhuber







More information about the Connectionists mailing list