1 dataset found

Licenses: Other (Open) Tags: text

Filter Results
  • english-gigaword

    This is a recipe to train word n-gram language models using the newswire text provided in the English Gigaword corpus (1200M words of NYT, APW, AFE, XIE). It also prepares...
You can also access this registry using the API (see API Docs).