Search for a Dataset - the Datahub

Add Dataset Import Data Package

xLiD-Lexica

Our xLiD-Lexica dataset in RDF (http://km.aifb.kit.edu/resources/xLiD-lexica.nt) contains about 300 million triples of cross-lingual groundings. It is extracted from Wikipedia...
Moby Lexicon

Description From home page: Moby Hyphenator 185,000 entries fully hyphenated mhyph.tar.Z [980kB] Moby Language Word lists in five of the world's great languages mlang.tar.Z...
Google Books Ngram

Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the...
- CSV

You can also access this registry using the API (see API Docs).

3 datasets found