41 datasets found

Tags: language

Filter Results
  • WikiWord

    About Overview: WikiWord is a system for building a multilingual Thesaurus by extracting lexical and semantic information from Wikipedia. It was originally developed for a...
  • The Speech Accent Archive

    From website: The speech accent archive uniformly presents a large set of speech samples from a variety of language backgrounds. Native and non-native speakers of English read...
  • Spanish Verb Database

    Fred Jehle, formerly a professor at Indiana University-Purdue University Fort Wayne, published approximately 600 verbs, fully conjugated in all moods and tenses, on his website...
  • POS Tagger for Romanian Language

    We have developed a hidden Markov model-based part-of-speech tagger for the Romanian language. Our interactive web tool is located at...
  • Polygotte

    This dataset has no description

  • Perseus Digital Library

    Description Started in 1987 focusing on classics but has now expanded to other areas. Though containing a large amount of material site itself is focused on being a digital...
  • MOCHA-TIMIT

    About Authors: Alan Wrench, Queen Margaret University College. Funded by: Engineering and Physical Sciences Research Council. When created: November 1999. Purpose:...
  • Language Commons

    This dataset has no description

  • The Kids Open Dictionary Builder

    About From the creators: The purpose of this project is to create a free, open simple dictionary for students to use. The words in the dictionary will reviewed for quality and...
  • ISO language, territory, currency codes and their translations

    This is a set of ISO codes including those for country and currency collected together into a useful package by the Debian project. From the package page: This package provides...
  • ISO 639-3 - Codes for the Representation of Names of Languages

    About ISO 639-3 is a list of three letter codes for languages: ISO 639-3 attempts to provide as complete an enumeration of languages as possible, including living, extinct,...
  • FSI Language Courses

    About From website: Welcome to fsi-language-courses.com, the home for language courses developed by the Foreign Service Institute. These courses were developed by the United...
  • Europarl Parallel Corpus

    Description Overview from home page: The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includes versions in 11 European languages:...
  • Dict.cc - English German Dictionary

    About From about page: dict.cc is not only an online dictionary. It's an attempt to create a platform where users from all over the world can share their knowledge in the field...
  • Apertium

    Description "Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine,...
You can also access this registry using the API (see API Docs).