Skip to main navigation Skip to search Skip to main content

Beyond Face Value: Visual and Auditory Signals in Human and Machine Trust Judgments

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As conversational agents become increasingly multimodal, they invite human-like evaluations—especially in trust-sensitive contexts. Building on the human tendency to form rapid judgments from subtle visual and auditory cues, we explore how trust is constructed from faces and voices. In a behavioral experiment, 150 participants rated bimodal stimuli across four trust congruence conditions. We then trained a multimodal model using HuBERT and ResNet-50 with late fusion to predict trust scores. To examine alignment between human and model judgments, we applied Permutation Feature Importance (PFI) to compare the most influential features. Our results highlight the dominance of auditory cues in both human and model trust evaluations, while revealing subtle but meaningful differences in feature weighting across modalities and conditions.

Original languageEnglish
Title of host publicationCUI 2025 - Proceedings of the 2025 ACM Conference on Conversational User Interfaces
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9798400715273
DOIs
StatePublished - 7 Jul 2025
Externally publishedYes
Event7th Conference on Conversational User Interfaces, CUI 2025 - Waterloo, Canada
Duration: 8 Jul 202510 Jul 2025

Publication series

NameCUI 2025 - Proceedings of the 2025 ACM Conference on Conversational User Interfaces

Conference

Conference7th Conference on Conversational User Interfaces, CUI 2025
Country/TerritoryCanada
CityWaterloo
Period8/07/2510/07/25

Bibliographical note

Publisher Copyright:
© 2025 Copyright held by the owner/author(s).

Keywords

  • Behavioral Experiment
  • Feature Importance
  • Human-AI Alignment
  • Multimodal Trust Perception

Fingerprint

Dive into the research topics of 'Beyond Face Value: Visual and Auditory Signals in Human and Machine Trust Judgments'. Together they form a unique fingerprint.

Cite this