Apertium

Description

"Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs."

Language-pair data includes:

  • Spanish ⇆ Catalan (apertium-es-ca)
  • Spanish ← Romanian (apertium-es-ro)
  • French ⇆ Catalan (apertium-fr-ca)
  • Occitan ⇆ Catalan (apertium-oc-ca)
  • English ⇆ Galician (apertium-en-gl)
  • Swedish → Danish (apertium-sv-da)
  • Occitan ⇆ Spanish (apertium-oc-es)
  • Spanish ⇆ Portuguese (apertium-es-pt)
  • English ⇆ Catalan (apertium-en-ca)
  • English ⇆ Spanish (apertium-en-es)
  • English ⇆ Esperanto (apertium-en-eo)
  • Spanish ⇆ Galician (apertium-es-gl)
  • French ⇆ Spanish (apertium-fr-es)
  • Esperanto ← Spanish (apertium-eo-es)
  • Welsh → English (apertium-cy-en)
  • Breton → French (apertium-br-fr)
  • Esperanto ← Catalan (apertium-eo-ca)
  • Portuguese ⇆ Catalan (apertium-pt-ca)
  • Portuguese ⇆ Galician (apertium-pt-gl)
  • Basque → Spanish (apertium-eu-es)
  • Norwegian Nynorsk ⇆ Norwegian Bokmål (apertium-nn-nb)

The above are the "released" language pairs, data includes:

  • dictionaries for morphological analysis and generation
  • disambiguation (statistical models, rules, in some cases Constraint Grammars)
  • bilingual (transfer) dictionaries
  • structural transfer rules

There is also a lot of data of the above kinds for unreleased language pairs, eg. Icelandic → English, North Sámi → Lule Sámi; and tools to maintain such data.

License

COPYING file in language pair data archive contains a copy of the GPL.

Data and Resources

Additional Info

Field Value
Source http://www.apertium.org/
Last Updated October 10, 2013, 19:44 (UTC)
Created April 4, 2008, 23:54 (UTC)