Abstract
For multivariate data collected over groups, discriminant analysis is a two-stage procedure: separation and allocation. For the traditional least squares procedure, separation of training data into groups is accomplished by the maximization of the Lawley-Hotelling test for differences between group means. This produces a set of discriminant coordinates which are used to visualize the data. Using the nearest center rule, the discriminant representation can be used for allocation of data of unknown group membership. In this paper, we propose an approach to discriminant analysis based on efficient robust discriminant coordinates. These coordinates are obtained by the maximization of a Lawley-Hotelling test based on robust estimates. The design matrix used in the fitting is the usual one-way incidence matrix of zeros and ones; hence, our procedure uses highly efficient robust estimators to do the fitting. This produces efficient robust discriminant coordinates which allow the user to visually assess the differences among groups. Further, the allocation is based on the robust discriminant representation of the data using the nearest robust center rule. We discuss our procedure in terms of an affine-equivariant estimating procedure. The robustness of our procedure is verified in several examples. In a Monte Carlo study on probabilities of misclassifications of the procedures over a variety of error distributions, the robust discriminant analysis performs practically as well as the traditional procedure for good data and is much more efficient than the traditional procedure in the presence of outliers and heavy tailed error distributions. Further, our procedure is much more efficient than a high breakdown procedure.
Original language | English |
---|---|
Pages (from-to) | 199-213 |
Number of pages | 15 |
Journal | Journal of Nonparametric Statistics |
Volume | 19 |
Issue number | 4-5 |
DOIs | |
State | Published - May 2007 |
Keywords
- Affine-equivariant estimators
- Least squares
- Linear discriminant rule
- Nearest center rule
- Nonparametrics
- Rank-based analysis
- Visualization
- Wilcoxon analysis
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty