Skip to main navigation Skip to search Skip to main content

Relevance-based Infilling for Natural Language Counterfactuals

  • Lorenzo Betti
  • , Carlo Abrate
  • , Francesco Bonchi
  • , Andreas Kaltenbrunner
  • Institute for Scientific Interchange Foundation
  • CENTAI
  • University of Rome La Sapienza
  • EURECAT
  • Open University of Catalonia

Research output: Contribution to Book/Report typesConference contributionpeer-review

Abstract (may include machine translation)

Counterfactual explanations are a natural way for humans to gain understanding and trust in the outcomes of complex machine learning algorithms. In the context of natural language processing, generating counterfactuals is particularly challenging as it requires the generated text to be fluent, grammatically correct, and meaningful. In this study, we improve the current state of the art for the generation of such counterfactual explanations for text classifiers. Our approach, named RELITC (Relevance-based Infilling for Textual Counterfactuals), builds on the idea of masking a fraction of text tokens based on their importance in a given prediction task and employs a novel strategy, based on the entropy of their associated probability distributions, to determine the infilling order of these tokens. Our method uses less time than competing methods to generate counterfactuals that require less changes, are closer to the original text and preserve its content better, while being competitive in terms of fluency. We demonstrate the effectiveness of the method on four different datasets and show the quality of its outcomes in a comparison with human generated counterfactuals.

Original languageEnglish
Title of host publicationCIKM 2023 - Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery
Pages88-98
Number of pages11
ISBN (Electronic)9798400701245
DOIs
StatePublished - 21 Oct 2023
Event32nd ACM International Conference on Information and Knowledge Management, CIKM 2023 - Birmingham, United Kingdom
Duration: 21 Oct 202325 Oct 2023

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Conference

Conference32nd ACM International Conference on Information and Knowledge Management, CIKM 2023
Country/TerritoryUnited Kingdom
CityBirmingham
Period21/10/2325/10/23

Keywords

  • counterfactuals
  • explainability
  • masked language model
  • NLP

Fingerprint

Dive into the research topics of 'Relevance-based Infilling for Natural Language Counterfactuals'. Together they form a unique fingerprint.

Cite this