Abstract
Machine Learning models are prevalent in critical humanrelated decision making, such as resume filtering and loan applications. Refused individuals naturally ask what could change the decision, should they reapply. This question is hard for the model owner to answer: first, the model is typically complex and not easily interpretable; second, models may be updated periodically; and last, attributes of the individual seeking approval are apt to change in time. While each of these challenges have been extensively studied in isolation, their conjunction has not. To this end, we propose a novel framework that allows users to devise a plan of action to individuals in presence of Machine Learning classification, where both the ML model and the user properties are expected to change over time. Our technical solution is currently confined to a particular yet important class of models, namely those of tree-based ensembles (Random Forests, Gradient Boosted trees). In this setting it uniquely combines state-of-the-art solutions for single model interpretation, domain adaptation techniques for predicting future models, and constraint databases to represent and query the space of possible actions. We devise efficient algorithms that leverage these foundations in a novel solution, and experimentally show that they are effective in proposing useful and actionable steps leading to the desired classification.
Original language | English |
---|---|
Pages (from-to) | 798-811 |
Number of pages | 14 |
Journal | Proceedings of the VLDB Endowment |
Volume | 13 |
Issue number | 6 |
DOIs | |
State | Published - 1 Jan 2020 |
Event | 46th International Conference on Very Large Data Bases, VLDB 2020 - Virtual, Japan Duration: 31 Aug 2020 → 4 Sep 2020 |
All Science Journal Classification (ASJC) codes
- Computer Science (miscellaneous)
- General Computer Science