TY - JOUR
T1 - Termight
T2 - Coordinating Humans and Machines in Bilingual Terminology Acquisition
AU - Dagan, Ido
AU - Church, Ken
PY - 1997
Y1 - 1997
N2 - We propose a semi-automatic tool, termight, that supports the construction of bilingual glossaries. Termight consists of two components which address the two subtasks in glossary construction: (a) preparing a monolingual list of all technical terms in a source-language document, and (b) finding the translations for these terms in parallel source-target documents. As a first step (in each component) the tool extracts automatically candidate terms and candidate translations, based on term-extraction and word-alignment algorithms. It then performs several additional preprocessing steps which greatly facilitate human post-editing of the candidate lists. These steps include grouping and sorting of candidates and associating example concordance lines with each candidate. Finally, the data prepared in preprocessing is presented to the user via an interactive interface which supports quick post-editing operations. Termight was deployed by translators at AT&T Business Translation Services (formerly AT&T Language Line Services) leading to very high rates of semi-automatic glossary construction.
AB - We propose a semi-automatic tool, termight, that supports the construction of bilingual glossaries. Termight consists of two components which address the two subtasks in glossary construction: (a) preparing a monolingual list of all technical terms in a source-language document, and (b) finding the translations for these terms in parallel source-target documents. As a first step (in each component) the tool extracts automatically candidate terms and candidate translations, based on term-extraction and word-alignment algorithms. It then performs several additional preprocessing steps which greatly facilitate human post-editing of the candidate lists. These steps include grouping and sorting of candidates and associating example concordance lines with each candidate. Finally, the data prepared in preprocessing is presented to the user via an interactive interface which supports quick post-editing operations. Termight was deployed by translators at AT&T Business Translation Services (formerly AT&T Language Line Services) leading to very high rates of semi-automatic glossary construction.
KW - Bilingual alignment
KW - Bilingual terminology extraction
KW - Linguistic engineering
KW - Linguistic interfaces
KW - Terminology extraction
KW - Translation tools
UR - http://www.scopus.com/inward/record.url?scp=0030660739&partnerID=8YFLogxK
U2 - 10.1023/a:1007926723945
DO - 10.1023/a:1007926723945
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:0030660739
SN - 0922-6567
VL - 12
SP - 89
EP - 107
JO - Machine Translation
JF - Machine Translation
IS - 1-2
ER -