Abstract
Recent work has explored the syntactic abilities of RNNs using the subject-verb agreement task, which diagnoses sensitivity to sentence structure. RNNs performed this task well in common cases, but faltered in complex sentences (Linzen et al., 2016). We test whether these errors are due to inherent limitations of the architecture or to the relatively indirect supervision provided by most agreement dependencies in a corpus. We trained a single RNN to perform both the agreement task and an additional task, either CCG supertagging or language modeling. Multitask training led to significantly lower error rates, in particular on complex sentences, suggesting that RNNs have the ability to evolve more sophisticated syntactic representations than shown before. We also show that easily available agreement training data can improve performance on other syntactic tasks, in particular when only a limited amount of training data is available for those tasks. The multi-task paradigm can also be leveraged to inject grammatical knowledge into language models.
| Original language | English |
|---|---|
| Title of host publication | CoNLL 2017 - 21st Conference on Computational Natural Language Learning, Proceedings |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 3-14 |
| Number of pages | 12 |
| ISBN (Electronic) | 9781945626548 |
| DOIs | |
| State | Published - 2017 |
| Event | 21st Conference on Computational Natural Language Learning, CoNLL 2017 - Vancouver, Canada Duration: 3 Aug 2017 → 4 Aug 2017 |
Publication series
| Name | CoNLL 2017 - 21st Conference on Computational Natural Language Learning, Proceedings |
|---|
Conference
| Conference | 21st Conference on Computational Natural Language Learning, CoNLL 2017 |
|---|---|
| Country/Territory | Canada |
| City | Vancouver |
| Period | 3/08/17 → 4/08/17 |
Bibliographical note
Publisher Copyright:© 2017 Association for Computational Linguistics.
Funding
We thank Emmanuel Dupoux for discussion. This research was supported by the European Research Council (grant ERC-2011-AdG 295810 BOOT-PHON), the Agence Nationale pour la Recherche (grants ANR-10-IDEX-0001-02 PSL and ANR-10-LABX-0087 IEC) and the Israeli Science Foundation (grant number 1555/15).
| Funders | Funder number |
|---|---|
| Israeli Science Foundation | 1555/15 |
| European Commission | ERC-2011-AdG 295810 BOOT-PHON |
| Agence Nationale de la Recherche | ANR-10-IDEX-0001-02 PSL, ANR-10-LABX-0087 IEC |
Fingerprint
Dive into the research topics of 'Exploring the syntactic abilities of RNNs with multi-task learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver