2 datasets found

Licenses: Other (Not Open) Tags: pdds

Filter Results
  • Spinn3r Indexing the Blogosphere

    Spinn3r is a web service for indexing the blogosphere. We provide raw access to every blog post being published - in real time. We provide the data, and you can focus on...
  • Web 1T 5-gram Version 1

    This data set, contributed by Google Inc., contains English word n-grams and their observed frequency counts. The length of the n-grams ranges from unigrams (single words) to...
You can also access this registry using the API (see API Docs).