Abstract
We present the Uppsala submission to the CoNLL 2017 shared task on parsing from raw text to universal dependencies. Our system is a simple pipeline consisting of two components. The first performs joint word and sentence segmentation on raw text; the second predicts dependency trees from raw words. The parser bypasses the need for part-of-speech tagging, but uses word embeddings based on universal tag distributions. We achieved a macro-averaged LAS F1 of 65.11 in the official test run and obtained the 2nd best result for sentence segmentation with a score of 89.03. After fixing two bugs, we obtained an unofficial LAS F1 of 70.49.
Original language | English |
---|---|
Title of host publication | CoNLL 2017 - SIGNLL Conference on Computational Natural Language Learning, Proceedings of the CoNLL 2017 Shared Task |
Subtitle of host publication | Multilingual Parsing from Raw Text to Universal Dependencies |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 207-217 |
Number of pages | 11 |
ISBN (Electronic) | 9781945626708 |
DOIs | |
State | Published - 2017 |
Event | 2017 SIGNLL Conference on Computational Natural Language Learning- CoNLL Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, CoNLL 2017 - Vancouver, Canada Duration: 3 Aug 2017 → 4 Aug 2017 |
Publication series
Name | CoNLL 2017 - SIGNLL Conference on Computational Natural Language Learning, Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies |
---|
Conference
Conference | 2017 SIGNLL Conference on Computational Natural Language Learning- CoNLL Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, CoNLL 2017 |
---|---|
Country/Territory | Canada |
City | Vancouver |
Period | 3/08/17 → 4/08/17 |
Bibliographical note
Publisher Copyright:© 2017 Association for Computational Linguistics.
Funding
We are grateful to the shared task organizers and to Dan Zeman in particular, and we acknowledge the computational resources provided by CSC in Helsinki and Sigma2 in Oslo through NeIC-NLPL (www.nlpl.eu). Our parser will be made available in the NLPL dependency parsing laboratory.
Funders | Funder number |
---|---|
CSC in Helsinki and Sigma2 in Oslo |