3 datasets found

Licenses: Other (Not Open) Tags: text

Filter Results
  • Spinn3r Indexing the Blogosphere

    Spinn3r is a web service for indexing the blogosphere. We provide raw access to every blog post being published - in real time. We provide the data, and you can focus on...
  • Openthesis

    From the website: OpenThesis is a free repository of theses, dissertations, and other academic documents, coupled with powerful search, organization, and collaboration tools....
  • Microsoft Web N-Gram Service

    Microsoft has developed services on the basis of ngrams from all of Bing's en_US corpus. The raw public data available include two files with the top 100k words from this...
You can also access this registry using the API (see API Docs).