A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis

Alessandro Stolfo, Yonatan Belinkov, Mrinmaya Sachan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Fingerprint

Dive into the research topics of 'A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis'. Together they form a unique fingerprint.

Computer Science

Psychology

Keyphrases