Term Set Expansion based on Multi-Context Term Embeddings: an End-to-end Workflow

Jonathan Mamou, Oren Pereg, Moshe Wasserblat, Ido Dagan, Yoav Goldberg, Alon Eirew, Yael Green, Shira Guskin, Peter Izsak, Daniel Korat

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

We present SetExpander, a corpus-based system for expanding a seed set of terms into a more complete set of terms that belong to the same semantic class. SetExpander implements an iterative end-to end workflow for term set expansion. It enables users to easily select a seed set of terms, expand it, view the expanded set, validate it, re-expand the validated set and store it, thus simplifying the extraction of domain-specific fine-grained semantic classes. SetExpander has been used for solving real-life use cases including integration in an automated recruitment system and an issues and defects resolution system.

Original languageEnglish
Title of host publicationCOLING 2018 - 27th International Conference on Computational Linguistics, Proceedings of System Demonstrations
PublisherAssociation for Computational Linguistics (ACL)
Pages58-62
Number of pages5
ISBN (Electronic)9781948087537
StatePublished - 2018
Event27th International Conference on Computational Linguistics, COLING 2018 - Santa Fe, United States
Duration: 20 Aug 201826 Aug 2018

Publication series

NameCOLING 2018 - 27th International Conference on Computational Linguistics, Proceedings of System Demonstrations

Conference

Conference27th International Conference on Computational Linguistics, COLING 2018
Country/TerritoryUnited States
CitySanta Fe
Period20/08/1826/08/18

Bibliographical note

Publisher Copyright:
© COLING 2018.All right reserved.

Funding

This work was supported in part by an Intel ICRI-CI grant. The authors are grateful to Sapir Tsabari from Intel AI Lab for her help in the dataset preparation.

FundersFunder number
Intel ICRI-CI

    Fingerprint

    Dive into the research topics of 'Term Set Expansion based on Multi-Context Term Embeddings: an End-to-end Workflow'. Together they form a unique fingerprint.

    Cite this