Abstract
We report a new baseline for a Danish word intrusion task by combining pre-trained offthe-shelf word, subword and knowledge graph embedding models. We test fastText, Byte-Pair Encoding, BERT and the knowledge graph embedding in Wembedder, finding fastText as the individual model with the superior performance, while a simple combination of the fastText with other models can slightly improve the accuracy of finding the odd-one-out words in the word
intrusion task.
intrusion task.
Original language | English |
---|---|
Title of host publication | Proceedings of the 15th Conference on Natural Language Processing |
Publisher | Association for Computational Linguistics |
Publication date | 2019 |
Pages | 237-240 |
Publication status | Published - 2019 |
Event | 15th Conference on Natural Language Processing - Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany Duration: 9 Oct 2019 → 11 Oct 2019 https://2019.konvens.org/ |
Conference
Conference | 15th Conference on Natural Language Processing |
---|---|
Location | Friedrich-Alexander-Universität Erlangen-Nürnberg |
Country/Territory | Germany |
City | Erlangen |
Period | 09/10/2019 → 11/10/2019 |
Internet address |