|
We know that, by using the Stick-Breaking construction, a dirichlet process random draws G such that: G~DP(alpha,H) A set of random draws of G is defined by: G=sum(pi_k*delta_theta_k) from k=1 to infinity Where: pi_k=beta_k*prod(1-beta_l) from l=1 to k-1 and beta_k are random draws from a beta distribution. delta_theta_k are points of energy located at the random draws from the base distribution H. I have some questions:
Thanks A lot |
Ohhhhh, now I get it, that is the reason of the location variable (delta), other way it would be a normal summation and thus the result would be a single number (right?) But since we have this location variable given by the random draws, they do not actually sum ever, unless we had 2 identical draws in the base probability?
(Jan 11 '11 at 00:36)
Leon Palafox
Yes, that's exactly right. If you can find a tutorial which shows G pictorially, that would be a really helpful learning aid. I usually have to build a mental picture when thinking about Dirichlet processes.
(Jan 11 '11 at 00:56)
Kevin Canini
I did find a PPT tutorial with some graphical representations: nlp.stanford.edu/~grenager/papers/dp_2005_02_24.ppt But in slide 15, where he exemplifies the stick-breaking, he states the prod equals 0 in the first iteration, thus my confusion. Other than that the slides are quite nice
(Jan 11 '11 at 01:04)
Leon Palafox
Yeah, I see that (on slide 13). That's definitely a typo. The red spikes on the bottom of that same slide are the atoms that characterize the discrete distribution G. If you were to plot the probability mass function of G, it would look exactly like that.
(Jan 11 '11 at 01:22)
Kevin Canini
|