TY - JOUR
T1 - Building textual entailment specialized data sets: A methodology for isolating linguistic phenomena relevant to inference
AU - Bentivogli, Luisa
AU - Cabrio, Elena
AU - Dagan, Ido
AU - Giampiccolo, Danilo
AU - Lo Leggio, Medea
AU - Magnini, Bernardo
PY - 2010/1/1
Y1 - 2010/1/1
N2 - This paper proposes a methodology for the creation of specialized data sets for Textual Entailment, made of monothematic Text-Hypothesis pairs (i.e. pairs in which only one linguistic phenomenon relevant to the entailment relation is highlighted and isolated). The annotation procedure assumes that humans have knowledge about the linguistic phenomena relevant to inference, and a classification of such phenomena both into fine grained and macro categories is suggested. We experimented with the proposed methodology over a sample of pairs taken from the RTE-5 data set, and investigated critical issues arising when entailment, contradiction or unknown pairs are considered. The result is a new resource, which can be profitably used both to advance the comprehension of the linguistic phenomena relevant to entailment judgments and to make a first step towards the creation of large-scale specialized data sets.
AB - This paper proposes a methodology for the creation of specialized data sets for Textual Entailment, made of monothematic Text-Hypothesis pairs (i.e. pairs in which only one linguistic phenomenon relevant to the entailment relation is highlighted and isolated). The annotation procedure assumes that humans have knowledge about the linguistic phenomena relevant to inference, and a classification of such phenomena both into fine grained and macro categories is suggested. We experimented with the proposed methodology over a sample of pairs taken from the RTE-5 data set, and investigated critical issues arising when entailment, contradiction or unknown pairs are considered. The result is a new resource, which can be profitably used both to advance the comprehension of the linguistic phenomena relevant to entailment judgments and to make a first step towards the creation of large-scale specialized data sets.
UR - http://www.scopus.com/inward/record.url?scp=84925410386&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
JO - Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010
JF - Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010
ER -