Variable Selection, Multicategory SVM's

Grace Wahba wahba at stat.wisc.edu
Thu Oct 4 17:19:42 EDT 2001


The following papers are available via 
http://www.stat.wisc.edu/~wahba  -> TRLIST

         TR 1042
Variable Selection via Basis Pursuit for Non-Gaussian Data

Hao Zhang, Grace Wahba,Yi Lin,
Meta Voelker, Michael Ferris
Ronald Klein and Barbara Klein

         Abstract
A simultaneous flexible variable selection procedure is proposed by 
applying a basis pursuit method to the likelihood function. The basis 
functions are chosen to be compatible with variable selection in the 
context of smoothing spline ANOVA models. Since it is a generalized 
LASSO-type method, it enjoys the favorable property of shrinking 
coefficients and gives interpretable results. We derive a Generalized 
Approximate Cross Validation function (GACV), which is an approximate 
leave-out-one cross validation function used to choose smoothing 
parameters. In order to apply the GACV function for a large data set 
situation, we propose a corresponding randomized GACV. A technique 
called `slice modeling' is used to develop an efficient code. Our 
simulation study shows the effectiveness of the proposed approach 
in the Bernoulli case.

         TR 1043
Multicategory Support Vector Machines

Yoonkyung Lee, Yi Lin and Grace Wahba

         Abstract
The Support Vector Machine (SVM) has shown great performance
in practice as a classification methodology.
Oftentimes multicategory problems have been 
treated as a series of binary problems in the SVM paradigm. 
Even though the SVM implements the optimal classification rule
asymptotically in the binary case, solutions to a series of binary problems 
may not be optimal for the original multicategory problem.
We propose multicategory SVMs, which extend the binary SVM to the
multicategory case, and encompass the binary SVM as a special case.
The multicategory SVM implements the optimal classification rule 
as the sample size gets large, overcoming the suboptimality of
the conventional one-versus-rest approach.  
The proposed method deals with the equal misclassification cost and
the unequal cost case in unified way. (Long version of TR 1040)





More information about the Connectionists mailing list