+ All Categories
Home > Documents > Bi-layer segmentation of binocular stereo video

Bi-layer segmentation of binocular stereo video

Date post: 29-Nov-2014
Category:
Upload: videoguy
View: 1,090 times
Download: 2 times
Share this document with a friend
Description:
 
18
Bi-layer Bi-layer segmentation segmentation of binocular of binocular stereo video stereo video V. Kolmogorov, A. Criminisi, A. Blake, G. Cross, C. Rother Microsoft Research, Cambridge, UK Computer Vision Group
Transcript
Page 1: Bi-layer segmentation of binocular stereo video

Bi-layer segmentationBi-layer segmentation of binocular stereo video of binocular stereo video

V. Kolmogorov, A. Criminisi, A. Blake, G. Cross, C. Rother

Microsoft Research, Cambridge, UKComputer Vision Group

Page 2: Bi-layer segmentation of binocular stereo video

ProblemProblem

• Two layers (Bg and Fg)– Scenarios: video-conferencing, live bluescreening without bluescreen

• Two cameras• Background may not be static• Goal: accurately segment foreground object in real-time

– Applications: background substitution, ...

Left input Right input

Page 3: Bi-layer segmentation of binocular stereo video

Sources of informationSources of information

• Stereo– Foreground object has larger disparity

• Colour– Background and foreground have distinct colour

distributions

• Contrast– There is image gradient at Bg/Fg transition

• Spatial coherence– MRF model

Page 4: Bi-layer segmentation of binocular stereo video

Previous workPrevious work

• Colour/Contrast (+2D coherence)– Graph cuts

(Boykov et al. ’01, Rother et al. ’04)

• Stereo (+1D coherence)– Dynamic programming

(Ohta et al. ’85, Cox et al. ’96, Criminisi et al. ’03)

Left input

Page 5: Bi-layer segmentation of binocular stereo video

Fusing colour/contrast and stereoFusing colour/contrast and stereo

• Colour and stereo complement each other• Result from fusion:

Results from our technique

Page 6: Bi-layer segmentation of binocular stereo video

• Ideally:

segmentation (Bg/Fg/Occ)

disparity

Fusing colour/contrast and stereoFusing colour/contrast and stereo

data

Construct p(x , d | z) Prior p(x , d) –2D MRF model

Marginalise d out to get p(x | z)

Compute x as MAP configuration Intractable!

Page 7: Bi-layer segmentation of binocular stereo video

Approximations

• Simplify the model to get real-time performance• Two different approaches:

– Layered Dynamic Programming (LDP)– Layered Graph Cut (LGC)

• Probabilistic formulation– Parameters can mostly be set automatically

• Very similar error statistics– Consistently better than colour/contrast or stereo alone

Page 8: Bi-layer segmentation of binocular stereo video

Layered Dynamic Programming (LDP)Layered Dynamic Programming (LDP)

• Approximation: in the prior neglect coupling between scanlines

• 1D MRF model (Markov chain)

• Compute path from (0,0) to (N,N) and label {Fg,Bg,Occ} for each vertex

Left

Rig

ht

Disparity

Page 9: Bi-layer segmentation of binocular stereo video

Layered Graph Cut (LGC)Layered Graph Cut (LGC)

• Approximation: in the prior neglect conditioning of disparity dp on disparities of neighbours

– conditioned only on segmentation label xp{Bg,Fg,Occ}

• Marginalise disparities out• Energy minimisation problem with 3 labels• Solve it using 2 graph cut computations

– Approx. 20 frames per second (320 x 240, 3GHz)

Page 10: Bi-layer segmentation of binocular stereo video

Setting parametersSetting parameters

• Two approaches:– Generative: from physics (e.g. from average width of

occluded regions)– Discriminative: minimize error rates

• Consistent results! – See technical report 2005 (

http://research.microsoft.com/vision/cambridge/i2i/)

Page 11: Bi-layer segmentation of binocular stereo video

Experiments: ground truth dataExperiments: ground truth data

Ground-Truth segmentation at http://research.microsoft.com/vision/cambridge/i2i/

• 19 calibrated stereo sequences• 6 with ground truth segmentation

– Every 5th or 10th frame– Pixels marked as “Bg”, “Fg” or “Unknown”

Original Ground-truth segmentation

Page 12: Bi-layer segmentation of binocular stereo video

Accuracy Accuracy ofof segmentation segmentation

LGC Segmentation

Page 13: Bi-layer segmentation of binocular stereo video

Accuracy of segmentationAccuracy of segmentation

Page 14: Bi-layer segmentation of binocular stereo video

Accuracy of segmentationAccuracy of segmentation

Page 15: Bi-layer segmentation of binocular stereo video

ConclusionConclusion

• Two algorithms based on different approximations– Fuse colour/constrast, stereo, and spatial coherence– Probabilistic formulation– Capable of real-time performance

• Similar error statistics– Consistently better than state-of-the art techniques

• Different characteristics– LDP: Parallelisable (scanlines processed independently)– LGC: Marginalisation could be done on GPU

MSN: i2i cambridge

Page 16: Bi-layer segmentation of binocular stereo video
Page 17: Bi-layer segmentation of binocular stereo video

Segmentation errors (LGC)

Page 18: Bi-layer segmentation of binocular stereo video

Segmentation errors (LGC)


Recommended