Generalization Error Bounds for Multiclass Sparse Linear Classifiers

Tomer Levy, Felix Abramovich

Research output: Contribution to journalArticlepeer-review

Abstract

We consider high-dimensional multiclass classification by sparse multinomial logistic regression. Unlike binary classification, in the multiclass setup one can think about an entire spectrum of possible notions of sparsity associated with different structural assumptions on the regression coefficients matrix. We propose a computationally feasible feature selection procedure based on penalized maximum likelihood with convex penalties capturing a specific type of sparsity at hand. In particular, we consider global row-wise sparsity, double row-wise sparsity, and low-rank sparsity, and show that with the properly chosen tuning parameters the derived plug-in classifiers attain the minimax generalization error bounds (in terms of misclassification excess risk) within the corresponding classes of multiclass sparse linear classifiers. The developed approach is general and can be adapted to other types of sparsity as well.

Original languageEnglish
Article number151
Pages (from-to)7217 - 7251
Number of pages35
JournalJournal of Machine Learning Research
Volume24
DOIs
StatePublished - 2023

Keywords

  • Feature selection
  • high-dimensionality
  • minimaxity
  • misclassification excess risk
  • sparsity

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Software
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Generalization Error Bounds for Multiclass Sparse Linear Classifiers'. Together they form a unique fingerprint.

Cite this