UCL Applied Machine Learning 1

This data comes from Kaggle.com, a machine learning competition site.

This is a prediction problem in which there are 50000 datapoints (vectors), each datapoint being itself a sparse 50000 dimensional vector.

About the data

xtrain.txt contains the training data. Each row specifies which 10 elements of the training vector contain 1, with all other entries being 0.

ytrain.txt contains the training value.

xtest.txt contains the test data.

Data and Resources

Additional Info

Field Value
Source http://inclass.kaggle.com/c/UCLAML1
Last Updated October 10, 2013, 23:59 (UTC)
Created October 9, 2011, 08:01 (UTC)