Skip to main navigation Skip to search Skip to main content

Learning from Others: Similarity-based Regularization for Mitigating Dataset Bias

Reda Igbaria, Yonatan Belinkov

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Common methods for mitigating spurious correlations in natural language understanding (NLU) usually operate in the output space, encouraging a main model to behave differently from a bias model by down-weighing examples where the bias model is confident. While improving out-of-distribution (OOD) performance, it was recently observed that the internal representations of the presumably debiased models are actually more, rather than less biased. We propose SimReg, a new method for debiasing internal model components via similarity-based regularization, in representation space: We encourage the model to learn representations that are either similar to an unbiased model or different from a biased model. We experiment with three NLU tasks and different kinds of biases. We find that SimReg improves OOD performance, with little in-distribution degradation. Moreover, the representations learned by SimReg are less biased than in other methods.

Original languageEnglish
Title of host publicationACL 2024 - 9th Workshop on Representation Learning for NLP, RepL4NLP 2024 - Proceedings of the Workshop
EditorsChen Zhao, Marius Mosbach, Pepa Atanasova, Seraphina Goldfarb-Tarrent, Peter Hase, Arian Hosseini, Maha Elbayad, Sandro Pezzelle, Maximilian Mozes
Pages37-50
Number of pages14
ISBN (Electronic)9798891761575
StatePublished - 2024
Event9th Workshop on Representation Learning for NLP, RepL4NLP 2024 at ACL 2024 - Bangkok, Thailand
Duration: 15 Aug 2024 → …

Publication series

NameACL 2024 - 9th Workshop on Representation Learning for NLP, RepL4NLP 2024 - Proceedings of the Workshop

Conference

Conference9th Workshop on Representation Learning for NLP, RepL4NLP 2024 at ACL 2024
Country/TerritoryThailand
CityBangkok
Period15/08/24 → …

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Software
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Learning from Others: Similarity-based Regularization for Mitigating Dataset Bias'. Together they form a unique fingerprint.

Cite this