Abstract
With advances in machine learning and speech technologies, conversational agents are becoming increasingly capable of engaging in human-like conversations. However, trust is crucial for effective communication and collaboration, and understanding the signals of trustworthy speech is essential for successful interactions. While researchers across disciplines have sought to discover the signals of trustworthy speech, mostly in human speech, in this paper, we explore the human perception of trustworthy synthesized speech. We present the results of a large-scale crowdsourced perception study, designed to investigate the acoustic-prosodic properties of trustworthy synthesized speech. Highly controlled parameters are manipulated to test the effects of acoustic-prosodic features including pitch, intensity, and speaking rate. We also extend the work to examine individual differences in the perception and production of trustworthy in speech. To evaluate trust perception in contexts that require vulnerability and trust, a real-world application of emotional support dialogues is used. The findings of this work contribute valuable insights to improve the perceived trustworthiness of conversational agents.
| Original language | English |
|---|---|
| Pages (from-to) | 1240-1244 |
| Number of pages | 5 |
| Journal | Proceedings of the International Conference on Speech Prosody |
| DOIs | |
| State | Published - 2024 |
| Externally published | Yes |
| Event | 12th International Conference on Speech Prosody, Speech Prosody 2024 - Leiden, Netherlands Duration: 2 Jul 2025 → 5 Jul 2025 |
Bibliographical note
Publisher Copyright:© 2024 International Speech Communications Association. All rights reserved.
Keywords
- computational paralinguistics
- human-computer interaction
- speaking style
- speech perception
- trustworthiness