Risk Estimation with Active Labeling

Alessandro Magnani, Esteban Arcaute, Shie Mannor

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We consider a setting where, for a given model, with a given labeling budget, we need to accurately evaluate its risk repeatedly, when the set of items considered for the risk evaluation, as well as the loss function, change over time. This is a natural setting in a non-stationary environment, such as that of large retail chains, whose catalogue changes year-round. Since evaluating risk often requires human judgement, the cost can increase dramatically over time. We propose a new estimator that minimize the labeling cost by reusing all available labels when possible and by actively selecting items to be labeled in an optimal way. We show that an optimal sampling profile can be derived, efficiently and at scale, as the solution of an optimization problem. We show how this approach with only a small added computational and storage cost, can efficiently reduce the labeling work required to measure the risk of a model in a non-stationary environment in a production system. We extend these results to the Fα measure and weighted risks. The presented approach is related to the Horvitz-Thompson estimator, importance sampling and active learning and provides a scalable, robust solution for risk evaluation in non-stationary environment that cannot be achieved with either Horvitz-Thompson estimator nor importance sampling.

Original languageEnglish
Title of host publicationMachine Learning and Soft Computing - 9th International Conference, ICMLSC 2025, Revised Selected Papers
EditorsLetian Huang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages103-120
Number of pages18
ISBN (Print)9789819663996
DOIs
StatePublished - 2025
Event9th International Conference on Machine Learning and Soft Computing, ICMLSC 2025 - Tokyo, Japan
Duration: 24 Jan 202526 Jan 2025

Publication series

NameCommunications in Computer and Information Science
Volume2487 CCIS

Conference

Conference9th International Conference on Machine Learning and Soft Computing, ICMLSC 2025
Country/TerritoryJapan
CityTokyo
Period24/01/2526/01/25

All Science Journal Classification (ASJC) codes

  • General Computer Science
  • General Mathematics

Fingerprint

Dive into the research topics of 'Risk Estimation with Active Labeling'. Together they form a unique fingerprint.

Cite this