Abstract
Deception detection in conversational dialogue has attracted much attention in recent years. Yet existing methods for this rely heavily on human-labeled annotations that are costly and potentially inaccurate. In this work, we present an automated system that utilizes multimodal features for conversational deception detection, without the use of human annotations. We study the predictive power of different modalities and combine them for better performance. We use openSMILE to extract acoustic features after applying noise reduction techniques to the original audio. Facial landmark features are extracted from the visual modality. We experiment with training facial expression detectors and applying Fisher Vectors to encode sequences of facial landmarks with varying length. Linguistic features are extracted from automatic transcriptions of the data. We examine the performance of these methods on the Box of Lies dataset of deception game videos, achieving 73% accuracy using features from all modalities. This result is significantly better than previous results on this corpus which relied on manual annotations, and also better than human performance.
| Original language | English |
|---|---|
| Title of host publication | Interspeech 2020 |
| Publisher | International Speech Communication Association |
| Pages | 359-363 |
| Number of pages | 5 |
| ISBN (Print) | 9781713820697 |
| DOIs | |
| State | Published - 2020 |
| Externally published | Yes |
| Event | 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020 - Shanghai, China Duration: 25 Oct 2020 → 29 Oct 2020 |
Publication series
| Name | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
|---|---|
| Volume | 2020-October |
| ISSN (Print) | 2308-457X |
| ISSN (Electronic) | 1990-9772 |
Conference
| Conference | 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020 |
|---|---|
| Country/Territory | China |
| City | Shanghai |
| Period | 25/10/20 → 29/10/20 |
Bibliographical note
Publisher Copyright:© 2020 ISCA
Keywords
- Deception
- Facial landmarks
- Multimodal data
- Prosody