3 datasets found

Licenses: Creative Commons CCZero Formats: text/tab-separated-values Organisations: Wikimedia

Filter Results
  • Wikimedia user agents

    A dataset of parsed reader and editor browser agents from the Wikimedia web properties. The intent behind releasing the parsed agents is to make it easier for Wikimedia...
  • Scholarly article citations in Wikipedia

    About This dataset includes a list of citations to scholarly articles from the most recent version of Wikipedia. License All files included in this datasets are released under...
  • Wikichallenge - Training

    This is a non-random dataset containing the edit histories of about 47,000 editors. This can be used for machine learning purposes and the outcome variable is the number of...
You can also access this registry using the API (see API Docs).