Datasets
-
Scholarly article citations in Wikipedia
About This dataset includes a list of citations to scholarly articles from the most recent version of Wikipedia. License All files included in this datasets are released under... -
Wikipedia Article Feedback corpus
This dataset contains the entire corpus of feedback submitted on the English, French and German Wikipedia during the Article Feedback v.5 pilot (AFT). The Wikimedia Foundation... -
Wikipedia new user registrations
Historical data on new user account registrations to the English Wikipedia and other large Wikipedias. -
Wikipedia user preferences
Data on user preferences set by active Wikipedia editors. Active editors are defined as registered users with at least 5 edits per month in a given project. The dumps were... -
Wikipedia Editor Engagement Experiments: Timestamp position modification
This experiment looks at the effects of linking to the revision history of Wikipedia articles with a prominent "last modified" timestamp. Currently, the only way for readers to... -
Wikipedia article ratings
A complete anonymized dump of 11M article ratings collected over 1 year (July 2011 - July 2012) from the English Wikipedia. Read more... -
Wikimedia VisualEditor
This dataset includes open data from the Wikimedia Foundation's VisualEditor project. Unless otherwise specified, all data is released in the public domain. -
Wikimedia Research Newsletter corpus
A curated corpus of references on Wikipedia and Wikimedia research, reviewed in the monthly Wikimedia Research Newsletter. -
Wikimedia Fundraiser Public Data
Public data about the Wikimedia Fundraiser. Data is refreshed every 15 minutes and includes the complete historical series since 2006. -
EPIC/Oxford Wikipedia quality assessment
This dataset comprises the full, anonymized set of responses from the blind assessment of a sample of Wikipedia articles across languages and disciplines by academic experts....