I am working on Honglak Lee paper http://web.eecs.umich.edu/~honglak/nips09-AudioConvolutionalDBN.pdf (section 2.2 of the paper). My question is, after spectrogram, I get a 2-dimensional matrix, and after PCA whitening, I retain (say) 100 components. So, I will have a matrix of size N-by-100, where N is the number of time bins. Should I apply the pca to each and every audio file separately, and then, concatenate all the matrices (i.e. 10 matrices for 10 files after pca) to apply them to a DBN or an RBM? For each audio file, the size of the matrix after pca is not the same i.e. N is not the same. Does this effect the RBM performance?

asked Dec 02 '13 at 09:57

chinaali's gravatar image

chinaali
1333

edited Dec 03 '13 at 08:09

Be the first one to answer this question!
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.