Abstract
We consider high-dimensional multiclass classification by sparse multinomial logistic regression. Unlike binary classification, in the multiclass setup one can think about an entire spectrum of possible notions of sparsity associated with different structural assumptions on the regression coefficients matrix. We propose a computationally feasible feature selection procedure based on penalized maximum likelihood with convex penalties capturing a specific type of sparsity at hand. In particular, we consider global row-wise sparsity, double row-wise sparsity, and low-rank sparsity, and show that with the properly chosen tuning parameters the derived plug-in classifiers attain the minimax generalization error bounds (in terms of misclassification excess risk) within the corresponding classes of multiclass sparse linear classifiers. The developed approach is general and can be adapted to other types of sparsity as well.
| Original language | English |
|---|---|
| Article number | 151 |
| Pages (from-to) | 7217 - 7251 |
| Number of pages | 35 |
| Journal | Journal of Machine Learning Research |
| Volume | 24 |
| DOIs | |
| State | Published - 2023 |
Keywords
- Feature selection
- high-dimensionality
- minimaxity
- misclassification excess risk
- sparsity
All Science Journal Classification (ASJC) codes
- Control and Systems Engineering
- Software
- Statistics and Probability
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'Generalization Error Bounds for Multiclass Sparse Linear Classifiers'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver