-
SSF
Syntactic and semantic framework of Croatian language -
Bibliography of Linguistic Literature (BLL) Thesaurus
The Thesaurus of the Bibliography of Linguistic Literature (BLL Thesaurus) represents a comprehensive bilingual vocabulary for indexing and documentation of linguistically... -
GeoWordNet
GeoWordNet is a semantic resource built from the full integration of WordNet, GeoNames and the Italian part of MultiWordNet. GeoWordNet Public Dataset contains 3,698,238... -
KORE 50 NIF NER Corpus
KORE 50[1] (AIDA) is a subset of the larger AIDA corpus, which is based on the dataset of the CoNLL 2003 NER task. The dataset aims to capture hard to disambiguate mentions of... -
EMN
The Terminology of the European Migration Network in RDF -
Wikilinks RDF/NIF
The Wikilinks corpus is a coreference resolution corpus of very large scale. It contains over 40 million mentions of over 3 million entities. Mentions are manually labeled links... -
News-100 NIF NER Corpus
This corpus comprises 100 German news articles from the online news platform news.de. All of the articles were published in the year of 2010 and contain the word Golf. This word... -
RSS-500 NIF NER CORPUS
This corpus has been created using a dataset comprising a list of 1,457 RSS feeds as compiled in (Goldhahn et al. 2012). The list includes all major worldwide newspapers and a... -
DBpedia Spotlight NIF NER Corpus
Based on P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. DBpedia Spotlight: shedding light on the web of documents. In Proc. of the 7th Int. Conf. on Semantic Systems,... -
Reuters-128 NIF NER Corpus
This English corpus is based on the well known Reuters-21578 corpus which contains economic news articles. In particular, we chose 128 articles containing at least one NE.... -
SweFN-RDF
Swedish FrameNet (SweFN), a lexical-semantic in RDF. -
SALDOM-RDF
SALDO morphology, a morphological Swedish lexicon in RDF. -
LEXIN-RDF
Lexin, a bilingual dictionary in RDF. -
TalkBank
About About TalkBank: The goal of TalkBank is to foster fundamental research in the study of human and animal communication. It will construct sample databases within each of... -
French TimeBank
The French TimeBank consists of a set of 109 journalistic articles from 7 different sub-genres annotated according to the ISO-TimeML standard, adapted for the French language.... -
Automated Similarity Judgment Program lexical data
ASJP collects 40 words from 5500 languages in a simplified phonetic representation. More background can be found at http://email.eva.mpg.de/~wichmann/ASJPHomePage.htm -
World Loanword Database
The World Loanword Database, edited by Martin Haspelmath and Uri Tadmor, is a scientific publication by the Max Planck Digital Library, Munich (2009). It provides vocabularies...