Read the Web

This data includes facts extracted from 500 million web pages.

From the project's website:

To build a never-ending machine learning system that acquires the ability to extract structured information from unstructured web pages. If successful, this will result in a knowledge base (i.e., a relational database) of structured information that mirrors the content of the Web. We call this system NELL (Never-Ending Language Learner).

Data and Resources

Additional Info

Field Value
Source http://rtw.ml.cmu.edu/rtw/resources
Author Carnegie Mellon University
Last Updated October 10, 2013, 23:23 (UTC)
Created June 26, 2011, 10:39 (UTC)