3
1

In one of his recent talks Andrew Ng mentioned several state-of-the-art results:

slide

However, there were no references, and his online list of publications does not seem to include anything about learning on video. Any idea where I can read more about these?

On a slightly separate subject, are people interested in collaborating on maintaining a comprehensive list of ML benchmarks and the most interesting results on them (like this slide, but more comprehensive and with more detail)? Should this be a table on Wikipedia?

Edit: fixed slide, emphasis added

asked Sep 09 '11 at 17:22

Oleg%20Trott's gravatar image

Oleg Trott
24681016

edited Sep 12 '11 at 16:39


4 Answers:

I'm not sure if this is what you are asking about but I'd start with Ivan Laptev's work, which apparently was the previous state of the art on Hollywood2. Or are you asking specifically about the stanford feature learning results?

To the second question: This would indeed be very helpful. I think there have been some efforts but I don't know of anything that really "made it". Though I feel this shouldn't be too hard if enough people collaborate.

answered Sep 09 '11 at 17:46

Andreas%20Mueller's gravatar image

Andreas Mueller
2686185893

Yes, I'm interested in finding references for the four "video" results. Perhaps they weren't published by Andrew Ng, but by someone else at Stanford?

(Sep 12 '11 at 16:32) Oleg Trott

I think "Stanford Feature Learning" is a branded name for a variety of techniques used in recent publications on unsupervised feature extraction by members of Andrew Ng's lab. I think a majority of them are based on deep learning models (e.g. Deep Belief Networks or Stacked Denoising Autoencoders with a sparsity constraint) with or without convolutional variants & pooling when it makes sense (e.g. for 1D audio and 2D images). Some of them are not even "deep": the CIFAR entry is probably referring to recent work by Adam Coates on using k-means centers or random samples carefully normalized / whitened and then used as a dictionary for a simple thresholed dot-product as sparse encoder.

answered Sep 11 '11 at 17:49

ogrisel's gravatar image

ogrisel
498995591

edited Sep 12 '11 at 07:37

answered Sep 12 '11 at 19:10

Mark%20Alen's gravatar image

Mark Alen
1323234146

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.