Papers on Rule-Extraction from trained ANN
Prof invite
joe at sunia.u-strasbg.fr
Thu May 30 07:44:45 EDT 1996
The following papers are available via anonymous ftp:
An Evaluation And Comparison Of Techniques For Extracting And
Refining Rules From Artificial Neural Networks
Robert Andrews* **
Russell Cable*
Joachim Diederich*
Shlomo Geva*
Mostefa Golea*
Ross Hayward*
Chris Ho-Stewart*
Alan B. Tickle* **
Neurocomputing Research Centre*
School of Information Systems**
Queensland University of Technology
Brisbane Q 4001 Australia
QUTNRC-96-01-04.ps.Z
Abstract
It is becoming increasingly apparent that without some form of
explanation capability, the full potential of trained Artificial Neural
Networks (ANNs) may not be realised. The primary purpose of this report is
to survey techniques which have been developed to redress this situation.
Specifically the survey focuses on mechanisms, procedures, and algorithms
designed to insert knowledge into ANNs (knowledge initialisation), extract
rules from trained ANNs (rule extraction), and utilise ANNs to refine
existing rule bases (rule refinement). The survey also introduces a new
taxanomy for classifying the various techniques, discusses their modus
operandi, and delineates criteria for evaluating their efficacy. An
additional facet of the report is a comparative evaluation of the
performance of a set of techniques developed at the Neurocomputing Research
Centre at QUT to extract knowledge from trained ANNs as a set of symbolic
rules.
Note: This is an extended version of: Andrews, R.; Diederich, J.; Tickle, A.B.
A Survey and Critique of Techniques for Extracting Rules from Trained
Artificial Neural Networks. KNOWLEDGE-BASED SYSTEMS 8 (1995) 6, 373-389.
This version includes first empirical results and is distributed with
permission of the editor and publisher.
*******************************************************************************
DEDEC: Decision Detection by Rule Extraction from Neural
Networks
Alan B. Tickle* **
Marian Orlowski* **
Joachim Diederich*
Neurocomputing Research Centre*
School of Information Systems**
Queensland University of Technology
Brisbane Q 4001 Australia
QUTNRC-95-01-03.ps.Z
Abstract
A clearly recognised impediment to the realisation of the full
potential of Artificial Neural Networks is an inherent inability to explain
in a comprehensible form (e.g. as a set of symbolic rules), the process by
which an ANN arrived at a particular conclusion/decision/result. While a
variety of techniques have already appeared to address this limitation, a
substantial number of the more successful approaches are dependent on
specialised ANN architectures. The DEDEC technique is a generic approach to
rule extraction from trained ANNs which is designed to be applicable across
a broad range of ANN architectures. The DEDEC technique is a generic
approach to rule extraction from trained ANNs which is designed to be
applicable across a broad range of ANN architectures. The basic motif
adopted is to utilise the generalisation capability of a trained ANN to
generate a set of examples from the problem domain which may include
examples beyond the initial training set. These examples are then presented
to a symbolic induction algorithm and the requisite rule set extracted.
However an important innovation over other rule-extraction techniques of
this ('pedagogical') type is that the DEDEC technique utilises information
extracted from an analysis of the weight vectors of the trained ANN to rank
the input variables (rule antecedents) in terms of their relative
importance. This additional information is used to focus the search of the
solution space on those examples from the problem domain which are deemed
to be of most significance. The paper gives a detailed description of one
possible implementation of the DEDEC technique and discusses results
obtained on both a set of structured sample problems and 'real world'
problems.
*******************************************************************************
DEDEC: A Methodology For Extracting Rules
From Trained Artificial Neural Networks
Alan B. Tickle* **
Marian Orlowski
Joachim Diederich*
Neurocomputing Research Centre*
School of Information Systems**
Queensland University of Technology
Brisbane Q 4001 Australia
QUTNRC-96-01-05.ps.Z
Abstract
A recognised impediment to the more widespread utilisation of
Artificial Neural Networks (ANNs) is the absence of a capability to
explain, in a human comprehensible form, either the process by which a
specific decision/result has been reached or, in general, the totality of
knowledge embedded within the ANN. Currently, one of the most promising
approaches to redressing this situation is to extract the knowledge
embedded in the trained ANN as a set of symbolic rules. In this paper we
describe the DEDEC methodology for rule-extraction which is applicable to a
broad class of multilayer, feedforward ANNs trained by the
'back-propogation' method. Central to the DEDEC approach is the
identification of the functional dependencies between the ANN inputs (i.e.
the attribute values of the data) and the ANN outputs (e.g. the
classification decision). However the key motif of the DEDEC methodology is
the utilisation of information extracted from analysing the weight vectors
in the trained ANN to focus the process of determining these functional
dependencies. In addition, if required, DEDEC exploits the capability of a
trained ANN to generalise beyond the data used in the ANN training phase.
The paper illustrates one of a number of possible implementations of the
DEDEC methodology, discusses results obtained on both a set of structured
sample problems and a "real world" problem, and provides a comparison with
other rule extraction techniques.
*****************************************************************************
Artificial Intelligence Meets Artificial Insemination
The Importance and Application of Symbolic Rule Extraction From Trained
Artificial Neural Networks
Robert Andrews* **
Joachim Diederich*
Emanoil Pop*
Alan B Tickle* **
Neurocomputing Research Centre*
School of Information Systems**
Queensland University of Technology
Brisbane Q 4001 Australia
QUTNRC-96-01-01.ps.Z
Abstract
In a recent article Andrews et al.[1995] describe a schema for
classifying neural network rule extraction techniques as either
decompositional, eclectic, or pedagogical. Decompositional techniques
require knowledge of the neural network architecture and weights. Each
hidden and output unit is interpreted as a Boolean rule with the
antecedents being a set of incoming links whose summed weights guarantee to
exceed the unit's bias regardless of the activations of the other incoming
links. Pedagogical techniques on the other hand treat the underlyling
neural network as a 'black box' using it to both classify examples and to
generate examples which a symbolic algorithm then converts to rules.
Eclectic techniques combine elements of the two basic categories. In this
paper we describe some reasons why rule extraction is an important area of
research. We then briefly describe three rule extraction algorithms, RULEX,
DEDEC & RULENEG, these being representative of each of the abovementioned
groups. We test these algorithms using two classification problems; the
first being a laboratory benchmarking problem while the second is drawn
from real life. For each problem, each of the rule extraction techniques
previously described is applied to a trained neural network and the
resulting rules presented.
********************************************************************************
Rule Extraction From CASCADE-2 Networks
Ross Hayward
Emanoil Pop
Joachim Diederich
Neurocomputing Research Centre
Queensland University of Technology
Brisbane Q 4001 Australia
QUTNRC-96-01-02.ps.Z
Abstract
Rule extraction from feed forward neural networks is a topic that
is gaining increasing interest. Any symbolic representation of how a
network arrives at a particular decision is important not only for user
acceptance, but also for rule refinement and network learning. This paper
describes a new method of extracting rules that predict the firing of
single units within a feed forward neural network. The extraction technique
is applied to networks constructed by the Cascade 2 algorithm each of which
solve a different benchmark problem. The hidden and output units within
each of the networks are shown to represent distinct rules which govern the
classification of patterns. Since a discrete rule set can be obtained for
each of the units within the network, a logical mapping between input and
output values can be achieved.
********************************************************************************
Feasibility of Incremental Learning in
Biologically Plausible Networks
James M. Hogan
Joachim Diederich
Neurocomputing Research Centre
Queensland University of Technology
Brisbane Q 4001 Australia
QUTNRC-96-01-03.ps.Z
Abstract
The feasibility of incremental learning within a feed-forward
network is examined under the constraint of biologically plausible
connectivity. A randomly connected network (of physiologically plausible
global connection probability) is considered under the assumption of a
local connection probability which decays with distance between nodes. The
representation of the function XOR is chosen as a test problem, and the
likelihood of its recruitment is discussed with reference to the
probability of occurrence of a subnetwork suitable for implementation of
this function, assuming a uniform initial weight distribution.
******************************************************************************
These papers are available from
ftp.fit.qut.edu.au
cd to /pub/NRC/tr/ps
More information about the Connectionists
mailing list