Connectionists: LASSO-Patternsearch paper

Grace Wahba wahba at stat.wisc.edu
Fri Mar 7 22:27:52 EST 2008


Available at http://www.stat.wisc.edu/~wahba -> TRLIST

W. Shi, G. Wahba, S. Wright, K. Lee, R. Klein and B. Klein.   
LASSO-Patternsearch Algorithm with Applications to 
Ophthalmology and Genomic Data "  February 2008.

To appear, Statistics and Its Interface(SII)

The LASSO-Patternsearch algorithm is proposed to efficiently
identify patterns of multiple dichotomous risk factors for outcomes 
of interest in  demographic and genomic studies.  The patterns 
considered are those that arise naturally from the log linear 
expansion of the multivariate Bernoulli density. The method is  
designed for the case where there is a possibly very large number
of candidate patterns but it is believed that only a relatively  
small number are important. A LASSO is used to greatly reduce the 
number of candidate patterns, using a novel computational algorithm
that can handle an extremely large number of unknowns simultaneously.
The patterns surviving the LASSO are further pruned in the framework 
of (parametric) generalized linear models. A novel tuning procedure  
based on the GACV for Bernoulli outcomes, modified to act as a model 
selector, is used at both steps. We applied the method to myopia data 
from the population-based Beaver Dam Eye Study, exposing physiologically 
interesting interacting risk factors. We then applied the 
the method to data from a generative model of Rheumatoid Arthritis
based on  Problem 3 from the Genetic Analysis Workshop 15, successfully
demonstrating its potential to efficiently recover higher order 
patterns from attribute vectors of length typical of genomic studies.


More information about the Connectionists mailing list