-
U.S. Securities and Exchange Commission Corporate Ownership RDF Data (rdfabout)
Data exposed: corporate ownership Size of dump and data set: 1.8 million triples Notes: also found in the of SPARQL Endpoints -
Freebase
Description "Freebase is an open database of the world?s information. It is built by the community and for the community?free for anyone to query, contribute to, built... -
The CIA World Factbook
Description US government profiles of countries and territories around the world. Information on geography, people, government, transportation, economy, communications, etc.... -
World Values Survey
Description Large global surveys of 'values' taking place every five years since 1990 described on its website as "The world's most comprehensive investigation of Political and... -
Statistical Abstract of the United States
Description From website: The Statistical Abstract of the United States, published since 1878, is the authoritative and comprehensive summary of statistics on the social,... -
US Copyright Renewal Database
Released in 2008 and funded by Hewlett Foundation. From front page: ... This database makes searchable the copyright renewal records received by the US Copyright Office between... -
UK - Office of National Statistics
Description "Free access to data produced by the Office for National Statistics, government departments and devolved administrations." Datasets Census: links to lists of all... -
TV IV: a compendium of television knowledge anyone can edit.
This dataset has no description
-
SwetoDblp
Data exposed: ontology focused on bibliography data of publications from DBLP with additions that include affiliations, universities, and publishers Size of dump and data set:... -
Reference Database of Immune Cells
Description From home page: "RefDIC is an open-access database of quantitative mRNA/Protein profiles specifically for immune cells." From... -
RCSB Protein Data Bank
Description As of August 2008 over 52 thousand structures available for download. From home page: The RCSB PDB provides a variety of tools and resources for studying the... -
NBER US Patent Citation Database
Detailed information on almost 3 million U.S. patents granted between January 1963 and December 1999, all citations made to these patents between 1975 and 1999 (over 16... -
OpenGuides (tm): The Guides Made by You
Description From front page: OpenGuides™ is a network of free, community-maintained wiki guidebooks to places around the world. Anyone is free to contribute, whether it's by... -
Numbrary
Description Not a producer of data but focused on extracting and aggregating data from other sources. Openness: OPEN License: no explicit license used but all underlying data... -
The National Public Transport Data Repository (traveline)
Description Data created by traveline and used by (among others) transportdirect. From http://www.pti.org.uk/repository.htm: The third snapshot of the traveline data was taken... -
Werner Icking Music Archive
Description Lots of sheet music. While quite a bit has source files much only seems to be in pdf. Openness: OPEN License: not specified but strongly appears to be open plus... -
MovieLens Data Sets
This data set contains 10000054 ratings and 95580 tags applied to 10681 movies by 71567 users of the online movie recommender service MovieLens. Users were selected at random... -
The Mondial Database
From home page: The MONDIAL database has been compiled from geographical Web data sources listed below: CIA World Factbook, a predecessor of Global Statistics which has been... -
Internet Movie Database
Large film/movie database claiming: 425,000+ titles 1,700,000 + filmographies of cast and crew members Films from 1891 to Present Foreign and independent movies, television... -
HapMap
Description The International HapMap Project is a partnership of scientists and funding agencies from Canada, China, Japan, Nigeria, the United Kingdom and the United States to... -
GenBank - NIH genetic sequence database
Description From the main page: GenBank® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acids Research, 2008... -
Galaxy Zoo 2
The Galaxy Zoo files contain almost a quarter of a million galaxies which have been imaged with a camera attached to a robotic telescope (the Sloan Digital Sky Survey, no less).... -
Flossmetrics - Free Libre and Open Source Software Metrics
Description From front page: The main objective of FLOSSMETRICS is to construct, publish and analyse a large scale database with information and metrics about libre software... -
Statistics Database of Food and Agricultural Organisation of the United Nations
FAOSTAT provides time-series and cross sectional data relating to food and agriculture for some 200 countries. Openness: ? No explicit license No bulk download Seems you need... -
Federal Aviation Administration - Data and Statistics
Description From main page: Accident & Incident Reports Preliminary Data Final Data More » Accident & Incident Data Aviation Data & Statistics Airline On-Time...