Abstract
Pretrained Language Models (LMs) have been shown to possess significant linguistic, common sense and factual knowledge. One form of knowledge that has not been studied yet in this context is information about the scalar magnitudes of objects. We show that pretrained language models capture a significant amount of this information but are short of the capability required for general common-sense reasoning. We identify contextual information in pre-training and numeracy as two key factors affecting their performance, and show that a simple method of canonicalizing numbers can have a significant effect on the results.
| Original language | English |
|---|---|
| Title of host publication | Findings of the Association for Computational Linguistics Findings of ACL |
| Subtitle of host publication | EMNLP 2020 |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 4889-4896 |
| Number of pages | 8 |
| ISBN (Electronic) | 9781952148903 |
| DOIs | |
| State | Published - 2020 |
| Event | Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020 - Virtual, Online Duration: 16 Nov 2020 → 20 Nov 2020 |
Publication series
| Name | Findings of the Association for Computational Linguistics Findings of ACL: EMNLP 2020 |
|---|
Conference
| Conference | Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020 |
|---|---|
| City | Virtual, Online |
| Period | 16/11/20 → 20/11/20 |
Bibliographical note
Publisher Copyright:© 2020 Association for Computational Linguistics
Fingerprint
Dive into the research topics of 'Do language embeddings capture scales?'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver