Þórunn Arnardóttir and Anton Karl Ingason
Proceedings of CLARIN 2020. Costanza Navarretta & Maria Eskevich (Eds.), pp 48-51. [PDF]
Publication year: 2020

We present a machine parsing pipeline for Icelandic which uses the Berkeley Neural Parser and includes every step necessary for parsing plain Icelandic text, delivering text annotated according to IcePaHC. The parser is fast and reports an 84.74 F1 score. We describe the training and evaluation of the new parsing model and the structure of the parsing pipeline. All scripts necessary for parsing plain text using the new parsing pipeline are provided in open access via the CLARIN repository and GitHub.


