Neural network model of the genetic code

Soren Brunak brunak at cbs.dth.dk
Fri Sep 9 13:43:19 EDT 1994


Neural network model of the genetic code is strongly correlated to  the
GES scale of amino acid transfer free energies 

N. Tolstrup, J. Toftgaard, J. Engelbrecht and S. Brunak
Centre for Biological Sequence Analysis
Department of Physical Chemistry
The Technical University of Denmark
DK-2800 Lyngby, Denmark

Journal of Molecular Biology, to appear.

Abstract 

A neural network trained to classify the 61 nucleotide triplets of  the
genetic code into twenty amino acid categories develops in its internal
representation a pattern matching the  relative  cost  of  transferring
amino acids with satisfied backbone hydrogen bonds  from  water  to  an
environment of dielectric constant of roughly  2.0.  Such  environments
are typically found in lipid membranes or in the interior of  proteins.
In learning the mapping between the  codons  and  the  categories,  the
network groups the amino acids according to the scale of transfer  free
energies developed by  Engelman,  Goldman  and  Steitz.  Several  other
scales based on internal preference statistics  also  agree  reasonably
well with the network grouping. The  network  is  able  to  relate  the
structure  of  the  genetic  code  to  quantifications  of  amino  acid
hydrophobicity-philicity more systematicly than the  numerous  attempts
made earlier. Due to its inherent non-linearity, the code is also shown
to impose decisive constraints on algorithmic analysis of  the  protein
coding potential of DNA.  


To obtain a copy, do:

unix>         ftp 129.142.74.40  (or ftp virus.fki.dth.dk)
Name:         anonymous
Password:     (your email address, please) 
ftp>          binary
ftp>          cd pub
ftp>          get gcode.ps.gz
ftp>          bye
unix>         gunzip gcode.ps.gz
unix>         lpr gcode.ps 

URL  ftp://virus.fki.dth.dk/pub/gcode.ps.gz




More information about the Connectionists mailing list