Paper Scanner: Variational Inference for Dirichlet Process Mixtures

Thursday, February 26, 2009

Variational Inference for Dirichlet Process Mixtures

This is the first DP paper I read. I heard about DP a long time ago but I haven't taken time for it until recent. This paper shows how to use (global) variational inference for the DP mixtures of exponential family.

The thing about DP is quite peculiar. The formal definition of DP would not yield us a model which could be computed. However, several related processes are employed, e.g. Chinese restaurant process and stick breaking process. In this paper the latter is adopted. One difficulty in understanding DP is the posterior distribution. For a mixture, we have several experts whose parameters comes from a common space, which is endowed with a prior H. The GP simly works on this space. GP is a stochastic meansure, which means that given a measurable set (event), it has a stochastic measure (probability). For DP, it refers to given a measurable finite partition, the probability of all events in this partition is Dirichlet distributed (with the parameter αH). Therefore, the posterior distribution of the GP is still something like this, added with several delta functions.

To solve this problem, the proposed solution can be interpreted as global variational approximation or mean field Gibbs sampling. To see why, we have to use the stick breaking process. This is a process, first we generate v_j from a Beta(1, α) distribution. The approximate uses a truncated version (let T be the stopping time). With this we can compute the mixing proportion π. We also generate ηj from H. The observation x_i is generated by taking π as the parameter of a multinomial distribution to select an index for η_j.

With this model, they propose a factorial posterior and with the idea of global variational method we can maximize the variational bound coordinate by coordinate (each coordinate is one parameter in the approximate posterior). This resembles the Gibbs sampling procedure. The difference is we use the mean (the parameter is usually the mean, first order moment) instead of do a sampling. They comapre the two in later experiments.

1 comment:

Anonymous said...: top [url=http://www.c-online-casino.co.uk/]uk online casinos[/url] coincide the latest [url=http://www.casinolasvegass.com/]casino games[/url] unshackled no store reward at the foremost [url=http://www.baywatchcasino.com/]casino games
[/url].; January 30, 2013 at 8:11 AM

Paper Scanner

Thursday, February 26, 2009

Variational Inference for Dirichlet Process Mixtures

1 comment:

Recent Comments

Scanning Areas

Paper list

Labels

Scanner