tree classification code available for comparative studies

Wray Buntine wray at ptolemy.arc.nasa.gov
Fri Oct 4 19:11:01 EDT 1991




I've made the following report available on the Neuroprose Archive
(cheops.cis.ohio-state.edu) as
		buntine.treecode.ps.Z
not because I think connectionists are "deeply" interested in tree learning
research but because I think it would be a handy resource for
comparative studies:
	1)    systems such as CART/C4 are recognised programs
	      for benchmarking supervised learning systems against
	2)    home-grown reimplementations can be buggy and a timesink
	3)    if your problem has some inherent structure and a few
	      key indicator variables then trees may be a good thing to
	      try as well
	4)    trees typically don't work well with purely numeric
	      data or with problems with many variables all giving some
	      minor contribution to the prediction being made

The IND Tree Package we developed here incorporates some of early C4,
most of the classification trees component of CART (no regression) along
with some more recent Bayesian/MDL approaches that sometimes work better.

You can obtain LaTeX source for the following introductory report
if you email to:

	ind at kronos.arc.nasa.gov

and ask for "About the IND Tree Package".

---------------------------------------
	
	About the IND Tree Package
	
	Wray Buntine, RIACS 
        NASA Ames Research Center 
        Mail Stop 269-2 
        Moffet Field, CA 94035 

 	September 29, 1991

   This note introduces the IND Tree Package to prospective procurers and
   those users/installers looking at IND for the first time.
   IND does supervised learning using classification trees.
   IND integrates features from Breiman {\it et al.}'s CART and Quinlan's C4 
   with newer Bayesian and minimum encoding methods for growing classification
   trees, and provides an experimental control suite on top.  The package comes
   with a manual, ``man'' entries, and a guide to tree methods and research.
   Information about obtaining IND, performance statistics,
   documentation, authorship, copyright, installation, etc., are given.
   IND is currently under development, although it has been used considerably
   since late 1989.  IND is implemented in C under UNIX.

----------------------------------------

Wray Buntine
RIACS (Research Inst. for Advanced Comp. Sc.)
NASA Ames Research Center                 phone:  (415) 604 3389
Mail Stop 244-17                          fax:    (415) 604 6997
Moffett Field, CA, 94035 		  email:  wray at ptolemy.arc.nasa.gov


More information about the Connectionists mailing list