Hebrew dependency parsing: Initial results

Y. Goldberg, Michael Elhadad

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We describe a newly available Hebrew Dependency Treebank, which is extracted from the Hebrew (constituency) Tree-bank. We establish some baseline unlabeled dependency parsing performance on Hebrew, based on two state-of-the-art parsers, MST-parser and MaltParser. The evaluation is performed both in an artificial setting, in which the data is assumed to be properly morphologically segmented and POS-tagged, and in a real-world setting, in which the parsing is performed on automatically segmented and POS-tagged text. We present an evaluation measure that takes into account the possibility of incompatible token segmentation between the gold standard and the parsed data. Results indicate that (a) MST-parser performs better on Hebrew data than Malt-Parser, and (b) both parsers do not make good use of morphological information when parsing Hebrew.
Original languageAmerican English
Title of host publication11th International Conference on Parsing Technologies
PublisherAssociation for Computational Linguistics
StatePublished - 2009

Bibliographical note

Place of conference:France

Fingerprint

Dive into the research topics of 'Hebrew dependency parsing: Initial results'. Together they form a unique fingerprint.

Cite this