630 likes | 837 Views
Expectation Propagation for Graphical Models. Yuan (Alan) Qi Joint work with Tom Minka. Motivation. Graphical models are widely used in real-world applications, such as wireless communications and bioinformatics.
E N D
Expectation Propagation for Graphical Models Yuan (Alan) Qi Joint work with Tom Minka
Motivation • Graphical models are widely used in real-world applications, such as wireless communications and bioinformatics. • Inference techniques on graphical models often sacrifice efficiency for accuracy or sacrifice accuracy for efficiency. • Need a new method that better balances the trade-off between accuracy and efficiency.
What we want Motivation Accuracy Current Techniques Efficiency
Outline • Background • Expectation Propagation (EP) on dynamic systems • Poisson tracking • Signal detection for wireless communications • Tree-structured EP on loopy graphs • Conclusions and future work
Outline • Background • Expectation Propagation (EP) on dynamic systems • Poisson tracking • Signal detection for wireless communications • Tree-structured EP on loopy graphs • Conclusions
x1 x2 x1 x2 y1 y2 y1 y2 x1 x2 x1 x2 y1 y2 y1 y2 Graphical Models
Inference on Graphical Models • Bayesian inference techniques: • Belief propagation(BP): Kalman filtering /smoothing, forward-backward algorithm • Monte Carlo: Particle filter/smoothers, MCMC • Loopy BP: typically efficient, but not accurate • Monte Carlo: accurate, but often not efficient
Efficiency vs. Accuracy MC EP ? Accuracy BP Efficiency
Expectation Propagation in a Nutshell • Approximate a probability distribution by simpler parametric terms: • Each approximation term lives in an exponential family (e.g. Gaussian)
Update Term Approximation • Iterate the fixed-point equation by moment matching: Where the leave-one-out approximation is
Outline • Background • Expectation Propagation (EP) on dynamic systems • Poisson tracking • Signal detection for wireless communications • Tree-structured EP on loopy graphs • Conclusions
x1 x2 x1 x2 y1 y2 y1 y2 x1 x2 x1 x2 y1 y2 y1 y2 EP on Dynamic Systems
Object Tracking Guess the position of an object given noisy observations Object
x1 x2 xT y1 y2 yT Bayesian Network e.g. (random walk) want distribution of x’s given y’s
Approximation (proportional) Factorized and Gaussian in x
xt yt Message Interpretation = (forward msg)(observation)(backward msg) Forward Message Backward Message Observation Message
EP on Dynamic Systems • Filtering: t = 1, …, T • Incorporate forward message • Initialize observation message • Smoothing: t = T, …, 1 • Incorporate the backward message • Compute the leave-one-out approximation by dividing out the old observation messages • Re-approximate the new observation messages • Re-filtering: t = 1, …, T • Incorporate forward and observation messages
Extension of EP • Instead of matching moments, use any method for approximate filtering. • Examples: Extended Kalman filter, statistical linearization, unscented filter • All methods can be interpreted as finding linear/Gaussian approximations to original terms
Example: Poisson Tracking • is an integer valued Poisson variate with mean
Approximate Observation Message • is not Gaussian • Moments of x not analytic • Two approaches: • Gauss-Hermite quadrature for moments • Statistical linearization instead of moment-matching • Both work well
EP vs. Monte Carlo: Accuracy Mean Variance
EP for Digital Wireless Communication • Signal detection problem • Transmitted signal st = • vary to encode each symbol • Complex representation: Im Re
Binary Symbols, Gaussian Noise • Symbols are 1 and –1 (in complex plane) • Received signal yt = • Optimal detection is easy
Fading Channel • Channel systematically changes amplitude and phase: • changes over time
Benchmark: Differential Detection • Classical technique • Use previous observation to estimate state • Binary symbols only
s2 sT s1 y1 yT y2 x1 xT x2 Bayesian network for Signal Detection
On-line EP Joint Signal Detector and Channel Estimation • Iterate over the last observations • Observations before act as prior for the current estimation
Computational Complexity • Expectation propagation O(nLd2) • Stochastic mixture of Kalman filters O(LMd2) • Rao-blackwised paricle smoothers O(LMNd2) • n: Number of EP iterations (Typically, 4 or 5) • d: Dimension of the parameter vector • L: Smooth window length • M: Number of samples in filtering • N: Number of samples in smoothing
Experimental Results (Chen, Wang, Liu 2000) EP outperforms particle smoothers in efficiency with comparable accuracy.
e2 eT e1 y1 yT y2 x1 xT x2 Bayesian Networks for Adaptive Decoding The information bits et are coded by a convolutional error-correcting encoder.
Outline • Background • Expectation Propagation (EP) on dynamic systems • Poisson tracking • Signal detection for wireless communications • Tree-structured EP on loopy graphs • Conclusions
x1 x2 x1 x2 y1 y2 y1 y2 x1 x2 x1 x2 y1 y2 y1 y2 EP on Boltzman machines
X1 X2 X3 X4 X5 X6 X7 X8 X9 X10 X11 X12 X13 X14 X15 X16 Inference on Grids Problem: estimate marginal distributions of the variables indexed by the nodes in a loopy graph, e.g., p(xi), i = 1, . . . , 16.
Boltzmann Machines Joint distribution is product of pair potentials: Want to approximate by a simpler distribution
BP vs. EP EP BP
Junction Tree Representation p(x) q(x) Junction tree
Approximating an Edge by a Tree Each potential f ain p is projected onto the tree-structure of q Correlations are not lost, but projected onto the tree
Moment Matching • Match single and pairwise marginals of • Reduces to exact inference on single loops • Use cutset conditioning and
Local Propagation • Original EP: globally propagate evidence to the whole tree • Problem: Computationally expensive • Exploit the junction tree representation: only locally propagate evidence within the minimal subtree that is directly connected to the off-tree edge. • Reduce computational complexity • Save memory
x5 x7 x1 x2 x1 x2 x5 x7 x1 x2 x1 x2 x5 x7 x1 x4 x3 x5 x1 x3 x1 x4 x3 x5 x3 x6 x1 x3 x3 x6 x1 x3 x1 x3 x1 x4 x3 x5 x3 x6 x1 x4 x3 x4 x3 x4 Global propagation Local propagation
4-node Graph TreeEP = the proposed method, BP = loopy belief propagation, GBP = generalized belief propagation on triangles, MF = mean-field, TreeVB =variational tree.
Fully-connected graphs • Results are averaged over 10 graphs with randomly generated potentials • TreeEP performs the same or better than all other methods in both accuracy and efficiency!
TreeEP versus BP and GBP • TreeEP is always more accurate than BP and is often faster • TreeEP is much more efficient than GBP and more accurate on some problems • TreeEP converges more often than BP and GBP
Outline • Background • Expectation Propagation (EP) on dynamic systems • Poisson tracking • Signal detection for wireless communications • Tree-structured EP on loopy graphs • Conclusions