Abstract
Most traditional distributional similarity models fail to capture syntagmatic patterns that group together multiple word features within the same joint context. In this work we introduce a novel generic distributional similarity scheme under which the power of probabilistic models can be leveraged to effectively model joint contexts. Based on this scheme, we implement a concrete model which utilizes probabilistic n-gram language models. Our evaluations suggest that this model is particularly well-suited for measuring similarity for verbs, which are known to exhibit richer syntagmatic patterns, while maintaining comparable or better performance with respect to competitive baselines for nouns. Following this, we propose our scheme as a framework for future semantic similarity models leveraging the substantial body of work that exists in probabilistic language modeling.
Original language | English |
---|---|
Title of host publication | CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 181-190 |
Number of pages | 10 |
ISBN (Electronic) | 9781941643020 |
DOIs | |
State | Published - 2014 |
Event | 18th Conference on Computational Natural Language Learning, CoNLL 2014 - Baltimore, United States Duration: 26 Jun 2014 → 27 Jun 2014 |
Publication series
Name | CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings |
---|
Conference
Conference | 18th Conference on Computational Natural Language Learning, CoNLL 2014 |
---|---|
Country/Territory | United States |
City | Baltimore |
Period | 26/06/14 → 27/06/14 |
Bibliographical note
Publisher Copyright:© 2014 Association for Computational Linguistics.
Funding
This work was partially supported by the Israeli Ministry of Science and Technology grant 3-8705, the Israel Science Foundation grant 880/12, the European Community’s Seventh Framework Pro-gramme (FP7/2007-2013) under grant agreement no. 287923 (EXCITEMENT) and the Scientific and Technical Research Council of Turkey (TÜBİTAK, Grant Number 112E277). This work was partially supported by the Israeli Ministry of Science and Technology grant 3-8705, the Israel Science Foundation grant 880/12, the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 287923 (EXCITEMENT) and the Scientific and Technical Research Council of Turkey (TÜB˙TAK, Grant Number 112E277).
Funders | Funder number |
---|---|
European Community’s Seventh Framework Pro-gramme | |
Seventh Framework Programme | 287923 |
Israel Science Foundation | 880/12 |
Türkiye Bilimsel ve Teknolojik Araştirma Kurumu | 112E277 |
Ministry of science and technology, Israel | 3-8705 |