2 datasets found

Tags: Web Crawl

Filter Results
  • Web Tables

    This page provides a large corpus of HTML tables for public download. The corpus has been extracted from the 2012 version of the Common Crawl and contains 147 million relational...
  • RDFa, Microdata, and Microformat Data Set

    More and more websites have started to embed structured data describing products, people, organizations, places, events into their HTML pages using markup standards such as...
You can also access this registry using the API (see API Docs).