Validating Danish Wikidata lexemes

Finn Årup Nielsen, Katherine Thornton, Jose Emilio Labra Gayo

Research output: Contribution to journalConference articleResearchpeer-review

1 Downloads (Pure)

Abstract

Two of the newest features of Wikidata are support for lexicographic data (lexemes), and support for Shape Expressions (ShEx). We demonstrate the first application of ShEx for validation of entity data for Wikidata lexemes. Validation of entity data in Wikidata against ShEx schemas allows editors to discover missing or incorrect information. It may also form a basis for discussion of the data models implicitly used in Wikidata. We present a use case and benchmark for ShEx and discuss its current limitations.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume2451
ISSN1613-0073
Publication statusPublished - 1 Jan 2019
Event15th International Conference on Semantic Systems - Karlsruhe, Germany
Duration: 9 Sep 201912 Sep 2019

Conference

Conference15th International Conference on Semantic Systems
CountryGermany
CityKarlsruhe
Period09/09/201912/09/2019
SponsorChinese Academy of Sciences, eccenca GmbH, et al., Leibniz Institute for Information Infrastructure, PoolParty, Semiodesk

Cite this

Nielsen, F. Å., Thornton, K., & Gayo, J. E. L. (2019). Validating Danish Wikidata lexemes. CEUR Workshop Proceedings, 2451.
Nielsen, Finn Årup ; Thornton, Katherine ; Gayo, Jose Emilio Labra. / Validating Danish Wikidata lexemes. In: CEUR Workshop Proceedings. 2019 ; Vol. 2451.
@inproceedings{010e2ea3fb0f4e398c2d06ed11f8df8b,
title = "Validating Danish Wikidata lexemes",
abstract = "Two of the newest features of Wikidata are support for lexicographic data (lexemes), and support for Shape Expressions (ShEx). We demonstrate the first application of ShEx for validation of entity data for Wikidata lexemes. Validation of entity data in Wikidata against ShEx schemas allows editors to discover missing or incorrect information. It may also form a basis for discussion of the data models implicitly used in Wikidata. We present a use case and benchmark for ShEx and discuss its current limitations.",
author = "Nielsen, {Finn {\AA}rup} and Katherine Thornton and Gayo, {Jose Emilio Labra}",
year = "2019",
month = "1",
day = "1",
language = "English",
volume = "2451",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "CEUR-WS",

}

Nielsen, FÅ, Thornton, K & Gayo, JEL 2019, 'Validating Danish Wikidata lexemes', CEUR Workshop Proceedings, vol. 2451.

Validating Danish Wikidata lexemes. / Nielsen, Finn Årup; Thornton, Katherine; Gayo, Jose Emilio Labra.

In: CEUR Workshop Proceedings, Vol. 2451, 01.01.2019.

Research output: Contribution to journalConference articleResearchpeer-review

TY - GEN

T1 - Validating Danish Wikidata lexemes

AU - Nielsen, Finn Årup

AU - Thornton, Katherine

AU - Gayo, Jose Emilio Labra

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Two of the newest features of Wikidata are support for lexicographic data (lexemes), and support for Shape Expressions (ShEx). We demonstrate the first application of ShEx for validation of entity data for Wikidata lexemes. Validation of entity data in Wikidata against ShEx schemas allows editors to discover missing or incorrect information. It may also form a basis for discussion of the data models implicitly used in Wikidata. We present a use case and benchmark for ShEx and discuss its current limitations.

AB - Two of the newest features of Wikidata are support for lexicographic data (lexemes), and support for Shape Expressions (ShEx). We demonstrate the first application of ShEx for validation of entity data for Wikidata lexemes. Validation of entity data in Wikidata against ShEx schemas allows editors to discover missing or incorrect information. It may also form a basis for discussion of the data models implicitly used in Wikidata. We present a use case and benchmark for ShEx and discuss its current limitations.

M3 - Conference article

VL - 2451

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

ER -