Statistical thesaurus construction for a morphologically rich language

Chaya Liebeskind, Ido Dagan, Jonathan Schler

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

Corpus-based thesaurus construction for Morphologically Rich Languages (MRL) is a complex task, due to the morphological variability of MRL. In this paper we explore alternative term representations, complemented by clustering of morphological variants. We introduce a generic algorithmic scheme for thesaurus construction in MRL, and demonstrate the empirical benefit of our methodology for a Hebrew thesaurus.

Original languageEnglish
Title of host publicationProceedings of the Main Conference and the Shared Task
PublisherAssociation for Computational Linguistics (ACL)
Pages59-64
Number of pages6
ISBN (Electronic)9781937284213
StatePublished - 2012
Event1st Joint Conference on Lexical and Computational Semantics, *SEM 2012 - Montreal, Canada
Duration: 7 Jun 20128 Jun 2012

Publication series

Name*SEM 2012 - 1st Joint Conference on Lexical and Computational Semantics
Volume1

Conference

Conference1st Joint Conference on Lexical and Computational Semantics, *SEM 2012
Country/TerritoryCanada
CityMontreal
Period7/06/128/06/12

Bibliographical note

Publisher Copyright:
© 2012 Association for Computational Linguistics.

Fingerprint

Dive into the research topics of 'Statistical thesaurus construction for a morphologically rich language'. Together they form a unique fingerprint.

Cite this