8
9

Lately there has been much research activity on topic models (e.g. latent dirichlet allocation, LDA). Is there any survey paper that describes many of the new extensions since LDA and discusses their connections? For example, Blei and Lafferty's 2009 review paper is a good start that discusses LDA and connections to Correlated Topic Models and Dynamic Topic Models. I am looking for an even more comprehensive survey covering more Models out there.

If possible I am looking for a machine learning oriented survey paper (as opposed to application-oriented), in order to keep pace with this rapidly developing field. Any pointers would be appreciated. Thanks!

asked Nov 16 '10 at 00:15

Kevin%20Duh's gravatar image

Kevin Duh
271459


7 Answers:

The following paper provides a very detailed review of topic models. It provides a chronological review with classification of models which is of great help.

Knowledge discovery through directed probabilistic topic models. a Survey

http://www.iiu.edu.pk/wp-content/uploads/downloads/faculties/fbas/cs/ali_daud/Ali_RP/2010-FCS%20Journal-Knowledge%20discovery%20through%20directed%20probabilistic%20topic%20Models.%20A%20Survey.pdf

answered Mar 09 '13 at 06:45

David%20Hawk's gravatar image

David Hawk
1

Another paper related to topic models (hierarchies of topics) would probably be interesting: nCRP

answered Nov 16 '10 at 14:59

Joseph%20Austerweil's gravatar image

Joseph Austerweil
331118

12

I think this is really necessary, but unfortunately I don't think there is one. Someone should try to do a NIPS or ICML tutorial on topic models, covering thinks like:

  1. pachinko allocation,
  2. HDP-LDA (which is not formally described by itself in any paper, I think),
  3. LDA+ngrams,
  4. LDA+hmm,
  5. Barzilay & Lee content models (which kind of are like topic models on sentences),
  6. held-out-likelihood evaluation (and the "reading tea leaves" evaluation as well),
  7. inference techniques (variational as in the original lda paper, collapsed gibbs as in griffiths and steyvers' finding scientific topics, uncollapsed gibbs for completeness, em, collapsed variational as in teh and welling, and [online variational]. (http://www.cs.princeton.edu/~blei/papers/HoffmanBleiBach2010b.pdf) as in hoffman, blei and bach). Some connections have been already explored in "On smoothing and inference in topic models", by Asuncion et al.
  8. the interface between topic models and supervised classifiers (blei et al supervised topic models; sauper, haghighi and barzilay "incorporating content structure in text analysis applications")
  9. Miscellaneous applications, like summarization, Fei-Fei Li's image work with lda (a sample here), and some interesting applications from the 2009 NIPS workshop, like financial topic models
  10. the connection (equivalence) between LDA and multinomial PCA

I hope this list helps you find solutions to your problem; most significant LDA work that I can remember off the top of my head is here, and I welcome additions/expansions (which is why this answer is marked as community wiki, which I've never quite figured out what is).

This answer is marked "community wiki".

answered Nov 16 '10 at 13:16

Alexandre%20Passos's gravatar image

Alexandre Passos ♦
2554154278421

edited Nov 16 '10 at 13:24

This is a great list to begin with. Thanks so much! (I'm not sure how to do the community wiki, though).

(Nov 17 '10 at 06:54) Kevin Duh

I think community wiki means you can edit this if you feel something else should be on the list, or the order should be different.

(Nov 17 '10 at 06:55) Alexandre Passos ♦

You might try to look into Blei's tutorial in Cambridge's summer school 2009, it is online in free lectures

answered Nov 16 '10 at 09:34

Leon%20Palafox's gravatar image

Leon Palafox ♦
40857194128

I found the following paper to be very enlightening:

G. Heinrich, Parameter estimation for text analysis, Technical report, Fraunhofer IGD, 15 September 2009. PDF

It shows a lot of details from the ground up...

Ivan

answered Nov 16 '10 at 09:11

Ivan%20Savov's gravatar image

Ivan Savov
301

Steyvers and Griffiths have a nice tutorial paper on topic models: http://psiexp.ss.uci.edu/research/papers/SteyversGriffithsLSABookFormatted.pdf

answered Nov 16 '10 at 02:46

priya%20venkateshan's gravatar image

priya venkateshan
1646812

Thanks--that's indeed a nice tutorial. I'm looking more for a technical survey, though. Lots of math and graphical models desired. :)

(Nov 16 '10 at 04:23) Kevin Duh

I'm not aware of other survey papers but David Mimno has created a nice bibliography on topic models (and I hope it will evolve with time).

answered Nov 16 '10 at 00:39

spinxl39's gravatar image

spinxl39
3698114869

Thanks. I hope it'll evolve with time too.

(Nov 16 '10 at 04:22) Kevin Duh
Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.