Netflix withdrew the Netflix Challenge dataset, I think after some researchers found that you could de-anonymize a few users in the dataset by correlating reviews with those posted in IMDB.

However, you can still find the dataset in various places by Googling for it.

Are there any legal risks for researchers who continue to use it in their machine learning projects?

asked May 04 '14 at 12:03

Matthew%20Koichi%20Grimes's gravatar image

Matthew Koichi Grimes
1446

Be the first one to answer this question!
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.