Abstracting fairness: Oracles, metrics, and interpretability

Cynthia Dwork, Christina Ilvento, Guy N. Rothblum, Pragya Sur

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

It is well understood that classification algorithms, for example, for deciding on loan applications, cannot be evaluated for fairness without taking context into account. We examine what can be learned from a fairness oracle equipped with an underlying understanding of “true” fairness. The oracle takes as input a (context, classifier) pair satisfying an arbitrary fairness definition, and accepts or rejects the pair according to whether the classifier satisfies the underlying fairness truth. Our principal conceptual result is an extraction procedure that learns the underlying truth; moreover, the procedure can learn an approximation to this truth given access to a weak form of the oracle. Since every “truly fair” classifier induces a coarse metric, in which those receiving the same decision are at distance zero from one another and those receiving different decisions are at distance one, this extraction process provides the basis for ensuring a rough form of metric fairness, also known as individual fairness. Our principal technical result is a higher fidelity extractor under a mild technical constraint on the weak oracle’s conception of fairness. Our framework permits the scenario in which many classifiers, with differing outcomes, may all be considered fair. Our results have implications for interpretablity – a highly desired but poorly defined property of classification systems that endeavors to permit a human arbiter to reject classifiers deemed to be “unfair” or illegitimately derived.

Original languageEnglish
Title of host publication1st Symposium on Foundations of Responsible Computing, FORC 2020
EditorsAaron Roth
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Number of pages16
ISBN (Electronic)9783959771429
DOIs
StatePublished - 1 May 2020
Event1st Symposium on Foundations of Responsible Computing, FORC 2020 - Virtual, Cambridge, United States
Duration: 1 Jun 2020 → …

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume156
ISSN (Print)1868-8969

Conference

Conference1st Symposium on Foundations of Responsible Computing, FORC 2020
Country/TerritoryUnited States
CityVirtual, Cambridge
Period1/06/20 → …

All Science Journal Classification (ASJC) codes

  • Software

Fingerprint

Dive into the research topics of 'Abstracting fairness: Oracles, metrics, and interpretability'. Together they form a unique fingerprint.

Cite this