Word-level saliency explanations (“heat maps over words”) are often used to communicate feature-attribution in text-based models. Recent studies found that superficial factors such as word length can distort human interpretation of the communicated saliency scores. We conduct a user study to investigate how the marking of a word's neighboring words affect the explainee's perception of the word's importance in the context of a saliency explanation. We find that neighboring words have significant effects on the word's importance rating. Concretely, we identify that the influence changes based on neighboring direction (left vs. right) and a-priori linguistic and computational measures of phrases and collocations (vs. unrelated neighboring words). Our results question whether text-based saliency explanations should be continued to be communicated at word level, and inform future research on alternative saliency explanation methods.
|Title of host publication||Findings of the Association for Computational Linguistics, ACL 2023|
|Publisher||Association for Computational Linguistics (ACL)|
|Number of pages||18|
|State||Published - 2023|
|Event||61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Toronto, Canada|
Duration: 9 Jul 2023 → 14 Jul 2023
|Name||Proceedings of the Annual Meeting of the Association for Computational Linguistics|
|Conference||61st Annual Meeting of the Association for Computational Linguistics, ACL 2023|
|Period||9/07/23 → 14/07/23|
Bibliographical notePublisher Copyright:
© 2023 Association for Computational Linguistics.