We introduce a large set of Hebrew lexicons pertaining to psychological aspects. These lexicons are useful for various psychology applications such as detecting emotional state, well being, relationship quality in conversation, identifying topics (e.g., family, work) and many more. We discuss the challenges in creating and validating lexicons in a new language, and highlight our methodological considerations in the data-driven lexicon construction process. Most of the lexicons are publicly available, which will facilitate further research on Hebrew clinical psychology text analysis. The lexicons were developed through data driven means, and verified by domain experts, clinical psychologists and psychology students, in a process of reconciliation with three judges. Development and verification relied on a dataset of a total of 872 psychotherapy session transcripts. We describe the construction process of each collection, the final resource and initial results of research studies employing this resource.
|Title of host publication||Computational Linguistics and Clinical Psychology|
|Subtitle of host publication||Improving Access, CLPsych 2021 - Proceedings of the 7th Workshop, in conjunction with NAACL 2021|
|Editors||Nazli Goharian, Philip Resnik, Andrew Yates, Molly Ireland, Kate Niederhoffer, Rebecca Resnik|
|Publisher||Association for Computational Linguistics (ACL)|
|Number of pages||15|
|State||Published - 2021|
|Event||7th Workshop on Computational Linguistics and Clinical Psychology: Improving Access, CLPsych 2021 - Virtual, Online|
Duration: 11 Jun 2021 → …
|Name||Computational Linguistics and Clinical Psychology: Improving Access, CLPsych 2021 - Proceedings of the 7th Workshop, in conjunction with NAACL 2021|
|Conference||7th Workshop on Computational Linguistics and Clinical Psychology: Improving Access, CLPsych 2021|
|Period||11/06/21 → …|
Bibliographical noteFunding Information:
We thank the anonymous reviewers for their careful reading of our manuscript and their insightful comments and suggestions. This project has received funding from the Israel Science Foundation (grants 1348/15 and 1278/16); and from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program, grant agreement No. 802774 (iEXTRACT).
©2021 Association for Computational Linguistics.