Termight: Coordinating Humans and Machines in Bilingual Terminology Acquisition

Ido Dagan, Ken Church

Research output: Contribution to journalArticlepeer-review

27 Scopus citations

Abstract

We propose a semi-automatic tool, termight, that supports the construction of bilingual glossaries. Termight consists of two components which address the two subtasks in glossary construction: (a) preparing a monolingual list of all technical terms in a source-language document, and (b) finding the translations for these terms in parallel source-target documents. As a first step (in each component) the tool extracts automatically candidate terms and candidate translations, based on term-extraction and word-alignment algorithms. It then performs several additional preprocessing steps which greatly facilitate human post-editing of the candidate lists. These steps include grouping and sorting of candidates and associating example concordance lines with each candidate. Finally, the data prepared in preprocessing is presented to the user via an interactive interface which supports quick post-editing operations. Termight was deployed by translators at AT&T Business Translation Services (formerly AT&T Language Line Services) leading to very high rates of semi-automatic glossary construction.

Original languageEnglish
Pages (from-to)89-107
Number of pages19
JournalMachine Translation
Volume12
Issue number1-2
DOIs
StatePublished - 1997

Keywords

  • Bilingual alignment
  • Bilingual terminology extraction
  • Linguistic engineering
  • Linguistic interfaces
  • Terminology extraction
  • Translation tools

Fingerprint

Dive into the research topics of 'Termight: Coordinating Humans and Machines in Bilingual Terminology Acquisition'. Together they form a unique fingerprint.

Cite this