|
I have read lda papers for a while, i'm so much confused right now. LDA is used for topic model, but these topic reprented by words can't be used directly, they are ususlly meaning-less words. So, LDA is nothing different from other document clustering methods, but with more complex model and inference, maybe, more interpretable, then , why is lda so hot?? what's its benefit? |
|
LDA works pretty good when you compare it to other document clustering methods. It is a pretty decent generative process overall, and in his original paper, Blei clearly explained differences and advantages over different models. It is far from perfect, and that is the reasion there are so many extension concerning its inference and the overall structure of the model. It is a great way to model topics not only in documents, but it can also be applied to a whole different set of different problems. |
|
LDA is not exactly a document clustering method, it's easier to think of it as a discrete matrix decomposition method, something like PCA or LSA but regularized and dealing directly with probabilities that works well for discrete data. Some big uses of LDA, I think, are:
3ks a lot! besides your 3 points listed above, in blei's original paper, it seems that lda benefit a lot from its "generative", so what's the benefit of the generative or it's just used for inference?
(Mar 03 '11 at 09:20)
Fischer Yu
Since it's generative (i.e. a joint density estimation problem), it can be trained easily on unlabeled data. Discriminative/conditional models typically need quite a lot of labeled data.
(Mar 03 '11 at 09:43)
Oscar Täckström
|