Integrating Query Performance Prediction in Term Scoring for Diachronic Thesaurus

Chaya Liebeskind, Ido Dagan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

A diachronic thesaurus is a lexical resource that aims to map between modern terms and their semantically related terms in earlier periods. In this paper, we investigate the task of collecting a list of relevant modern target terms for a domain-specific diachronic thesaurus. We propose a supervised learning scheme, which integrates features from two closely related fields: Terminology Extraction and Query Performance Prediction (QPP). Our method further expands modern candidate terms with ancient related terms, before assessing their corpus relevancy with QPP measures. We evaluate the empirical benefit of our method for a thesaurus for a diachronic Jewish corpus. c 2015 Association for Computational Linguistics and The Asian Federation of Natural Language Processing.

Original languageEnglish
Title of host publicationLaTeCH 2015 - Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
EditorsKalliopi A. Zervanou, Marieke van Erp, Beatrice Alex
PublisherAssociation for Computational Linguistics (ACL)
Pages89-94
Number of pages6
ISBN (Electronic)9781941643631
StatePublished - 2015
Event9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH 2015 - Beijing, China
Duration: 30 Jul 2015 → …

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
Volume2015-text
ISSN (Print)0736-587X

Conference

Conference9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH 2015
Country/TerritoryChina
CityBeijing
Period30/07/15 → …

Bibliographical note

Publisher Copyright:
© 2015 Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Fingerprint

Dive into the research topics of 'Integrating Query Performance Prediction in Term Scoring for Diachronic Thesaurus'. Together they form a unique fingerprint.

Cite this