Why Attention is Not Explanation: Surgical Intervention and Causal Reasoning about Neural Models

Proceedings of the 12th Conference on Language Resources and Evaluation (2020)
  Copy   BIBTEX

Abstract

As the demand for explainable deep learning grows in the evaluation of language technologies, the value of a principled grounding for those explanations grows as well. Here we study the state-of-the-art in explanation for neural models for natural-language processing (NLP) tasks from the viewpoint of philosophy of science. We focus on recent evaluation work that finds brittleness in explanations obtained through attention mechanisms.We harness philosophical accounts of explanation to suggest broader conclusions from these studies. From this analysis, we assert the impossibility of causal explanations from attention layers over text data. We then introduce NLP researchers to contemporary philosophy of science theories that allow robust yet non-causal reasoning in explanation, giving computer scientists a vocabulary for future research

Other Versions

No versions found

Links

PhilArchive

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Abstract versus Causal Explanations?Reutlinger Alexander & Andersen Holly - 2016 - International Studies in the Philosophy of Science 30 (2):129-146.
Network Explanations and Explanatory Directionality.Lina Jansson - 2020 - Philosophical Transactions of the Royal Society B 375 (1796).
Quantum causal explanation: or, why birds fly south.Sally Shrapnel - 2014 - European Journal for Philosophy of Science 4 (3):409-423.
Causal Explanations and XAI.Sander Beckers - 2022 - Proceedings of the 1St Conference on Causal Learning and Reasoning, Pmlr.

Analytics

Added to PP
2020-06-23

Downloads
841 (#27,410)

6 months
107 (#56,271)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Julia Bursten
University of Kentucky

Citations of this work

Add more citations