O BJ C UT

UNIVERSITY OF OXFORD OBJ CUT M. Pawan Kumar Philip Torr Andrew Zisserman

Aim • Given an image, to segment the object Object Category Model Segmentation Cow Image Segmented Cow • Segmentation should (ideally) be • shaped like the object e.g. cow-like • obtained efficiently in an unsupervised manner • able to handle self-occlusion

Challenges Intra-Class Shape Variability Intra-Class Appearance Variability Self Occlusion

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Object Seed Pixels Cow Image

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Object Seed Pixels Background Seed Pixels Cow Image

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Segmented Image

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Object Seed Pixels Background Seed Pixels Cow Image

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Segmented Image

Motivation • Problem • Manually intensive • Segmentation is not guaranteed to be ‘object-like’ Non Object-like Segmentation

Our Method • Combine object detection with segmentation • Borenstein and Ullman, ECCV ’02 • Leibe and Schiele, BMVC ’03 • Incorporate global shape priors in MRF • Detection provides • Object Localization • Global shape priors • Automatically segments the object • Note our method is completely generic • Applicable to any object category model

Outline • Problem Formulation • Form of Shape Prior • Optimization • Results

Problem • Labelling m over the set of pixels D • Shape prior provided by parameter  • Energy E (m, ) = ∑x(D|mx)+x(mx| ) + ∑xy(mx,my)+ (D|mx,my) • Unary terms • Likelihood based on colour • Unary potential based on distance from  • Pairwise terms • Prior • Contrast term • Find best labelling m* = arg min ∑ wi E (m, i) • wi is the weight for sample i Unary terms Pairwise terms

MRF • Probability for a labellingconsists of • Likelihood • Unary potential based on colour of pixel • Prior which favours same labels for neighbours (pairwise potentials) Pairwise Potential xy(mx, my) mx m(labels) my Unary Potential x(D|mx) x y D(pixels) Image Plane

Example Cow Image Object Seed Pixels Background Seed Pixels x(D|obj) x … x …  x(D|bkg)  xy(mx,my) y … y … … … … … Prior Likelihood Ratio (Colour)

Example Cow Image Object Seed Pixels Background Seed Pixels Prior Likelihood Ratio (Colour)

Contrast-Dependent MRF • Probability of labelling in addition has • Contrast term which favours boundaries to lie on image edges mx m(labels) my x Contrast Term (D|mx,my) y D(pixels) Image Plane

Example Cow Image Object Seed Pixels Background Seed Pixels x(D|obj) x … x …  x(D|bkg) xy(mx,my)+ xy(D|mx,my) y … y … … … … … Prior + Contrast Likelihood Ratio (Colour)

Example Cow Image Object Seed Pixels Background Seed Pixels Prior + Contrast Likelihood Ratio (Colour)

Our Model • Probability of labelling in addition has • Unary potential which depend on distance from  (shape parameter)  (shape parameter) Unary Potential x(mx|) mx m(labels) my Object Category Specific MRF x y D(pixels) Image Plane

Example Cow Image Object Seed Pixels Background Seed Pixels ShapePrior Distance from  Prior + Contrast

Example Cow Image Object Seed Pixels Background Seed Pixels ShapePrior Likelihood + Distance from  Prior + Contrast

Outline • Problem Formulation • Energy E (m, ) = ∑x(D|mx)+x(mx| ) + ∑xy(mx,my)+ (D|mx,my) • Form of Shape Prior • Optimization • Results

Layered Pictorial Structures (LPS) • Generative model • Composition of parts + spatial layout Layer 2 Spatial Layout (Pairwise Configuration) Layer 1 Parts in Layer 2 can occlude parts in Layer 1

Layered Pictorial Structures (LPS) Cow Instance Layer 2 Transformations 1 P(1) = 0.9 Layer 1

Layered Pictorial Structures (LPS) Cow Instance Layer 2 Transformations 2 P(2) = 0.8 Layer 1

Layered Pictorial Structures (LPS) Unlikely Instance Layer 2 Transformations 3 P(3) = 0.01 Layer 1

LPS for Detection • Learning • Learnt automatically using a set of videos • Part correspondence using Shape Context Shape Context Matching Multiple Shape Exemplars

LPS for Detection • Detection • Putative parts found using tree cascade of classifiers (x,y)

LPS for Detection • MRF over parts • Labels represent putative poses • Prior (pairwise potential) - Robust Truncated Model • Match LPS by obtaining MAP configuration Linear Model Quadratic Model Potts Model

LPS for Detection Efficient Belief Propagation xi • Likelihood i(xi) • tree cascade of classifiers • Prior ij(xi,xj) • fij(xi,xj), if xi  Ci(xj) • ij , otherwise • Pr(x)   i(xi)  ij(xi,xj) i xj xk j k mj->i ij i Messages j jk k ki

LPS for Detection Efficient Belief Propagation xi • Likelihood i(xi) • tree cascade of classifiers • Prior ij(xi,xj) • fij(xi,xj), if xi  Ci(xj) • ij , otherwise • Pr(x)   i(xi)  ij(xi,xj) i xj xk j k Messages calculated as

LPS for Detection Efficient Generalized Belief Propagation xi • Likelihood i(xi) • tree cascade of classifiers • Prior ij(xi,xj) • fij(xi,xj), if xi  Ci(xj) • ij , otherwise • Pr(x)   i(xi)  ij(xi,xj) i xj xk j k ij mk->ij i Messages j ijk jk k ki

LPS for Detection Efficient Generalized Belief Propagation xi • Likelihood i(xi) • tree cascade of classifiers • Prior ij(xi,xj) • fij(xi,xj), if xi  Ci(xj) • ij , otherwise • Pr(x)   i(xi)  ij(xi,xj) i xj xk j k Messages calculated as

LPS for Detection Second Order Cone Programming Relaxations xi • Likelihood i(xi) • tree cascade of classifiers • Prior ij(xi,xj) • fij(xi,xj), if xi  Ci(xj) • ij , otherwise • Pr(x)   i(xi)  ij(xi,xj) i xj xk j k

LPS for Detection Second Order Cone Programming Relaxations 1 • Likelihood i(xi) • tree cascade of classifiers • Prior ij(xi,xj) • fij(xi,xj), if xi  Ci(xj) • ij , otherwise • Pr(x)   i(xi)  ij(xi,xj) 0 0 0 0 i 1 0 0 1 j k m - Concatenation of all binary vectors l - Likelihood vector P - Prior matrix

LPS for Detection Second Order Cone Programming Relaxations 1 0 0 0 0 i 1 0 0 1 j k

Outline • Problem Formulation • Form of Shape Prior • Optimization • Results

Optimization • Given image D, find best labelling as m* = arg max p(m|D) • Treat LPS parameter as a latent (hidden) variable • EM framework • E : sample the distribution over  • M : obtain the labelling m

E-Step • Given initial labelling m’, determine p( | m’,D) • Problem Efficiently sampling from p( | m’,D) • Solution • We develop efficient sum-product Loopy Belief Propagation (LBP) for matching LPS. • Similar to efficient max-product LBP for MAP estimate

Results • Different samples localize different parts well. • We cannot use only the MAP estimate of the LPS.

M-Step • Given samples from p( |m’,D), get new labelling mnew • Sample iprovides • Object localization to learn RGB distributions of object and background • Shape prior for segmentation • Problem • Maximize expected log likelihood using all samples • To efficiently obtain the new labelling

M-Step w1 = P(1|m’,D) Cow Image Shape 1 RGB Histogram for Background RGB Histogram for Object

M-Step w1 = P(1|m’,D) Cow Image Shape 1 1 m(labels) Image Plane D(pixels) • Best labelling found efficiently using a Single Graph Cut

Segmentation using Graph Cuts Obj x … y … … … m z … … Bkg

M-Step w2 = P(2|m’,D) Cow Image Shape 2 RGB Histogram for Background RGB Histogram for Object

M-Step w2 = P(2|m’,D) Cow Image Shape 2 2 m(labels) Image Plane D(pixels) • Best labelling found efficiently using a Single Graph Cut

O BJ C UT

O BJ C UT

Presentation Transcript

Gvmk,bj .

Gvmk,bj .

LOTO: L ock O ut T ag O ut

o c

O BJ C UT & Pose Cut CVPR 05 ECCV 06

“ Beyond Sounding O ut ”

Keep Sun R ays O ut

Stepping O ut I n Style

Gvmk,bj .

Fifth Graders G o O ut T o Lunch

BJ Bennion

O BJ C UT

BJ Concepcion

Betty (BJ) Barclay

Bellamy, BJ

bj shoes

O BJ C UT

O BJ C UT

Presentation Transcript

Gvmk,bj .

Gvmk,bj .

LOTO: L ock O ut T ag O ut

o c

O BJ C UT &amp; Pose Cut CVPR 05 ECCV 06

“ Beyond Sounding O ut ”

Keep Sun R ays O ut

Stepping O ut I n Style

Gvmk,bj .

Fifth Graders G o O ut T o Lunch

BJ Bennion

O BJ C UT

BJ Concepcion

Betty (BJ) Barclay

Bellamy, BJ

bj shoes

O BJ C UT & Pose Cut CVPR 05 ECCV 06