0
1

Is the there any published result showing how to do phone recognition (or possibly any other speech recognition task) using Stacked Autoencoder instead of DBN (as done by Mohamed et al., 2009)?

As discussed in a previews thread, pre-training a Deep Neural Network with Autoencoders instead of RBMs should be mainly a matter of personal preference and thus is expected to yield equivalent performance.

I'd like to use the Pylearn2 Stacked Autoencoers GPU implementations (or maybe the deeplearning.net implemention) for phone recognition, but since I haven't found anyone who claim to have done it before, I'm afraid there must be something which makes this task harder than I'm naivly expecting.

asked Oct 04 '13 at 02:08

Saul%20Berardo's gravatar image

Saul Berardo
66127


One Answer:

It don't see why it would be hard at all. For large speech databases, pre-training only helps a tiny bit anyway.

answered Oct 07 '13 at 23:11

gdahl's gravatar image

gdahl ♦
341453559

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.