Entity Linking Benchmark Dutch Proceedings 1999-2012

This open-data Entity Linking (EL) benchmark is introduced in Olieman, A., Kamps, J., Marx, M., and Nusselder, A. (2015). A Hybrid Approach to Domain-Specific Entity Linking. Proceedings of the Posters and Demos Track of the 11th International Conference on Semantic Systems. The benchmark consists of a sample of Dutch parliamentary proceedings from the period 1999-2014, gold standard annotations, EL system annotations, and accompanying code.

Software

The accompanying code is hosted on Bitbucket, and is largely self-documenting. Some additional documentation is kept in the project wiki.

Annotation Guidelines

We provide the Dutch original and an English translation of the gold standard annotation guidelines that were used to produce this benchmark.

Datasets

The files in this dataset are provided in the MongoDB Extended JSON format, which allows them to be conveniently imported with mongoimport, and used with the provided code.

The gold standard and system annotations will also be made available as NIF, for usage with existing EL benchmarking frameworks. This is work-in-progress.

Data and Resources

Additional Info

Field Value
Author Alex Olieman
Last Updated September 6, 2015, 23:18 (UTC)
Created May 30, 2015, 21:37 (UTC)
Volunteer Evelijn Martinius