-
xLiD-Lexica
Our xLiD-Lexica dataset in RDF (http://km.aifb.kit.edu/resources/xLiD-lexica.nt) contains about 300 million triples of cross-lingual groundings. It is extracted from Wikipedia... -
Moby Lexicon
Description From home page: Moby Hyphenator 185,000 entries fully hyphenated mhyph.tar.Z [980kB] Moby Language Word lists in five of the world's great languages mlang.tar.Z... -
Google Books Ngram
Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the...
