+ All Categories
Home > Documents > 3D ShapeNets: A Deep Representation for Volumetric Shape...

3D ShapeNets: A Deep Representation for Volumetric Shape...

Date post: 03-Jul-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
112
3D ShapeNets: A Deep Representation for Volumetric Shape Modeling by Wu, Song, Khosla, Yu, Zhang, Tang, Xiao presented by Abhishek Sinha 1
Transcript
Page 1: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3D ShapeNets: A Deep Representation for

Volumetric Shape Modelingby Wu, Song, Khosla, Yu, Zhang, Tang, Xiao

presented by Abhishek Sinha

1

Page 2: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3D Shape Prior

2Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 3: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3D Shape Prior

2Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 4: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3D Shape Prior

2Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 5: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

Page 6: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

• Problem

Page 7: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

• Problem• Motivation

Page 8: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

• Problem• Motivation• Desirable Properties for Representation

Page 9: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

• Problem• Motivation• Desirable Properties for Representation• Architecture

Page 10: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

• Problem• Motivation• Desirable Properties for Representation• Architecture• Dataset

Page 11: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

• Problem• Motivation• Desirable Properties for Representation• Architecture• Dataset• Applications

Page 12: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

• Problem• Motivation• Desirable Properties for Representation• Architecture• Dataset• Applications• Extensions

Page 13: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

3

Outline

• Problem• Motivation• Desirable Properties for Representation• Architecture• Dataset• Applications• Extensions• Discussion Points

Page 14: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

4

Problem

Page 15: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Learn ‘Useful’ 3D shape representations from

images

Page 16: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

6

Motivation

Page 17: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

7

3D Shape Representation

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 18: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

7

3D Shape Representation

Shape Synthesis

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 19: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

7

3D Shape Representation

Shape Synthesis

Shape Completion

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 20: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

7

3D Shape Representation

Shape Synthesis

Shape Completion

2.5D Object Recognition

person

tricycle

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 21: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

7

3D Shape Representation

Shape Synthesis

Feature ExtractorShape Completion

2.5D Object Recognition

person

tricycle

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 22: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

8

Desirable Properties

Page 23: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

What is a Desirable 3D Shape Representation?

9

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 24: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

What is a Desirable 3D Shape Representation?

9

Data-driven

Generic

Compositional

Versatile

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 25: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

10

Data-driven

Generic

Compositional

Versatile

What is a Desirable 3D Shape Representation?

Data-driven

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 26: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

11

Simple Shapes Complex Shapes

Data-driven

Generic

Compositional

Versatile

Generic

What is a Desirable 3D Shape Representation?

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 27: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

12

building blocks full object

Data-driven

Generic

Compositional

VersatileCompositional

What is a Desirable 3D Shape Representation?

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 28: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

13

mesh classification shape completion

shape generation2.5D object recognition

person

tricycle

Data-driven

Generic

Compositional

Versatile

What is a Desirable 3D Shape Representation?

Versatile Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 29: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

14

Architecture

Page 30: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

15

3D Deep Learning

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 31: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

15

3D Deep Learning3D Shape as Volumetric Representation

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 32: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

15

3D Deep Learning

mesh

3D Shape as Volumetric Representation

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 33: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

15

3D Deep Learning

mesh

3D Shape as Volumetric Representation

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 34: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

15

3D Deep Learning

mesh binary voxel

3D Shape as Volumetric Representation

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 35: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

16

3D ShapeNets

Convolutional Deep Belief Network

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 36: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

16

3D ShapeNets

Convolutional Deep Belief Network

A Deep Belief Network is a generative graphical model that describes the distribution of input x over class y.

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 37: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

16

3D ShapeNets

• Convolution to enable compositionality• No pooling to reduce reconstruction

error

Convolutional Deep Belief Network

A Deep Belief Network is a generative graphical model that describes the distribution of input x over class y.

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 38: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

16

3D ShapeNets

• Convolution to enable compositionality• No pooling to reduce reconstruction

error

Layer 1-3 convolutional RBM

Layer 4 fully connected RBM

Layer 5 multinomial label + Bernoulli feature form an associate memory

configurations

Convolutional Deep Belief Network

A Deep Belief Network is a generative graphical model that describes the distribution of input x over class y.

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 39: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

17

3D ShapeNets

Convolutional Deep Belief Network

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 40: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

17

3D ShapeNets

Convolutional Deep Belief Network

3D ShapeNets ≠ CNNs

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 41: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

17

3D ShapeNets

Convolutional Deep Belief Network

3D ShapeNets ≠ CNNs

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 42: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

17

3D ShapeNets

Convolutional Deep Belief Network

3D ShapeNets ≠ CNNs

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 43: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

17

3D ShapeNets

Convolutional Deep Belief Network

3D ShapeNets ≠ CNNs

generative process

discriminative process

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 44: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

17

3D ShapeNets

Convolutional Deep Belief Network

3D ShapeNets ≠ CNNs

* 3D ShapeNets can be converted into a CNN, and discriminatively trained with back-propagation.

generative process

discriminative process

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 45: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

18

TrainingMaximum Likelihood Learning

Convolutional Deep Belief Network

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 46: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

18

Training

Layer-wise pre-training:

Lower four layers are trained by CD

Last layer is trained by FPCD[1]

Fine-tuning:

Wake sleep[2] but keep weights tied

[2] Hinton, et al "A fast learning algorithm for deep belief nets." Neural computation[1] Tijmen, et al. "Using fast weights to improve persistent contrastive divergence.”

Maximum Likelihood Learning

Convolutional Deep Belief Network

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 47: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

18

Training

Layer-wise pre-training:

Lower four layers are trained by CD

Last layer is trained by FPCD[1]

Fine-tuning:

Wake sleep[2] but keep weights tied

[2] Hinton, et al "A fast learning algorithm for deep belief nets." Neural computation[1] Tijmen, et al. "Using fast weights to improve persistent contrastive divergence.”

Maximum Likelihood Learning

Convolutional Deep Belief Network

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 48: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

generation process:Gibbs Sampling

Convolutional Deep Belief Network 19

Sampling

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 49: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Sampling

generation process:Gibbs Sampling

20

object label

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 50: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Sampling

generation process:Gibbs Sampling

20

object label

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 51: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Sampling

generation process:Gibbs Sampling

20

object label

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 52: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Sampling

generation process:Gibbs Sampling

20

object label

••••

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 53: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Sampling

generation process:Gibbs Sampling

20

object label

••••

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 54: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

21

Dataset

Page 55: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Big 3D Data

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 56: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Big 3D DataQuery Keyword: common object categories from the SUN database that

contain no less than 20 object instances per category

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 57: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Big 3D Data

23

Query Keyword: common object categories from the SUN database that

contain no less than 20 object instances per category

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 58: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Big 3D Data

151,128 models 660 categories Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 59: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

25

Applications

Page 60: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Slide Credit: Wu et al

Page 61: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Slide Credit: Wu et al

Page 62: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Slide Credit: Wu et al

Page 63: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Slide Credit: Wu et al

Page 64: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Slide Credit: Wu et al

Page 65: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Slide Credit: Wu et al

Page 66: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Slide Credit: Wu et al

Page 67: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Slide Credit: Wu et al

Page 68: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Gibbs sampling with clamping Slide Credit: Wu et al

Page 69: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

2.5D Completion & Recognition

26

Gibbs sampling with clamping Slide Credit: Wu et al

Page 70: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

27[29] R. Socher, B. Huval, B. Bhat, C. D. Manning, and A. Y. Ng. Convolutional-recursive deep learning for 3d object classification. In NIPS 2012.

2.5D Completion & Recognition

Training on CAD models and no discriminative tuning!

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 71: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

View Planning for Recognition

28

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 72: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

View Planning for Recognition

Volumetric representation

28

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 73: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

View Planning for Recognition

Volumetric representation

sofa?

bathtub?What is it?

?dresser?

28

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 74: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

View Planning for Recognition

Volumetric representation

sofa?

bathtub?What is it?

?dresser?

Not sure. Look from

another view?

28

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 75: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

View Planning for Recognition

Volumetric representation

sofa?

bathtub?What is it?

?dresser?

Not sure. Look from

another view? Where to look next?

Next-Best-View

28

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 76: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

View Planning for Recognition

Volumetric representation New depth map

sofa?

bathtub?What is it?

?dresser?

Not sure. Look from

another view? Where to look next?

Next-Best-View

28

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 77: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

View Planning for Recognition

Volumetric representation New depth map

sofa?

bathtub?What is it?

?dresser?

Aha! It is a sofa!

Not sure. Look from

another view? Where to look next?

Next-Best-View

28

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 78: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

View Planning for Recognition

Volumetric representation New depth map

sofa?

bathtub?What is it?

?dresser?

3D ShapeNets

Aha! It is a sofa!

Not sure. Look from

another view? Where to look next?

Next-Best-View

28

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 79: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 80: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 81: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 82: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 83: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 84: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 85: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 86: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 87: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Deep View Planning

Mathematically, this is equivalent to evaluate the conditional mutual information:

0.8

0.5

0.2

0.3 0.4

0.8 0.3

0.8

0.80.3

0.7

0.3

0.4

0.8

0.80.3

29

Slide Credit: Wu et al

Page 88: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

30

Recognition Accuracy from Two Views.

Deep View Planning

Based on the algorithms’ choice, we obtain the actual depth map for the next view and recognize the objects using two views by our 3D ShapeNets.

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 89: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

30

Recognition Accuracy from Two Views.

Deep View Planning

Based on the algorithms’ choice, we obtain the actual depth map for the next view and recognize the objects using two views by our 3D ShapeNets.

Our algorithm stands out as the uncertainty of the first view increasesSlide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 90: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Back Propagation Fine-tuning

31

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

3D ShapeNets

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 91: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Back Propagation Fine-tuning

31

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

3D ShapeNets

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 92: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Back Propagation Fine-tuning

31

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

3D ShapeNets

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 93: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Back Propagation Fine-tuning

31

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

object label

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

3D ShapeNets

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 94: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Back Propagation Fine-tuning

31

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

object label

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

3D ShapeNets

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 95: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Back Propagation Fine-tuning

31

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

object label

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

3D ShapeNets

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 96: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Back Propagation Fine-tuning

31

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

object label

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

3D ShapeNets

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 97: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Back Propagation Fine-tuning

31

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

object label

48 filters of stride 2

160 filters of stride 2

512 filters of stride 1

30

13

5

1200

2

4000

object label

3D ShapeNets 3D CNN

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 98: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

As a 3D Feature Extractor

32

Slide Credit: Wu et al

Page 99: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

As a 3D Feature Extractor

32

Mesh Classification & Retrieval

Slide Credit: Wu et al

Page 100: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

As a 3D Feature Extractor

32

Mesh Classification & Retrieval

[29] R. Socher, B. Huval, B. Bhat, C. D. Manning, and A. Y. Ng. Convolutional-recursive deep learning for 3d object classification. In NIPS 2012.

2.5D object recognition

Slide Credit: Wu et al

Page 101: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

33mesh retrieval

As a 3D Feature Extractor

Slide Credit: Wu, Song et al. 3D ShapeNets: A Deep Representation for Volumetric Shape Modeling, CVPR 2015

Page 102: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Extensions

34

Page 103: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

• Include RGB information in representation

• 3D Segmentation

• Improve for non-rigid 3D objects

35

Page 104: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Discussion Points

36

Page 105: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

• Is the network deep enough?

• 30x30x30 = 27000 vs 256x256 = 65000 for Image Net

• 150K training examples vs millions for Image Net

37

Page 106: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

• Won’t removal of max-pooling layers hurt performance on classification tasks?

http://3dshapenets.cs.princeton.edu/38

Page 107: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

• Any other systems that use binary units with approximate training and inference techniques rather than standard back-prop?

• Hinton, Geoffrey E., Simon Osindero, and Yee-Whye Teh. "A fast learning algorithm for deep belief nets." Neural computation 18.7 (2006): 1527-1554

• Salakhutdinov, Ruslan, Andriy Mnih, and Geoffrey Hinton. "Restricted Boltzmann machines for collaborative filtering." Proceedings of the 24th international conference on Machine learning. ACM, 2007.

39

Page 108: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

• Are there better ways for representing 3D Shapes. In particular, doesn’t the voxel representation have the bottleneck of cubic dependency on grid size?

• Yes. Su, Majhi et al that tries to recognize 3D shapes from multiple 2D views instead of voxel representation and get better results for classification .

H. Su, S. Maji, E. Kalogerakis, E. Learned-Miller. Multi-view Convolutional Neural Networks for 3D Shape Recognition. ICCV2015.40

Page 109: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

• Are there other 3D CAD model datasets

• 3D Warehouse. https://3dwarehouse.sketchup.com/

• Manually removing clutter from 3D CAD models a problem

• Did not address non-rigid objects sufficiently.

• Even the 40 model classification dataset seemed to contain only 4 non-rigid categories — persons, plant, sofas, curtains.

41

Page 110: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Appendix

42

Page 111: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

Contrastive divergence learning: A quick way to learn an RBM

0>< jihv 1>< jihv

i

j

i

j

t = 0 t = 1

)( 10 ><−><=Δ jijiij hvhvw ε

Start with a training vector on the visible units. Update all the hidden units in parallel Update all the visible units in parallel to get a “reconstruction”. Update all the hidden units again.

This is not following the gradient of the log likelihood. But it works well. It is approximately following the gradient of another objective function.

reconstructiondata

12

Slide Credit: Geoffery Hinton

43

Page 112: 3D ShapeNets: A Deep Representation for Volumetric Shape …vision.cs.utexas.edu/381V-spring2016/slides/sinha-paper.pdf · 2016-04-22 · Slide Credit: Wu, Song et al. 3D ShapeNets:

The wake-sleep algorithm for an SBN

• Wake phase: Use the recognition weights to perform a bottom-up pass.

– Train the generative weights to reconstruct activities in each layer from the layer above.

• Sleep phase: Use the generative weights to generate samples from the model.

– Train the recognition weights to reconstruct activities in each layer from the layer below.

h2

data

h1

h3

2W

1W1R

2R

3W3R

Slide Credit: Geoffery Hinton

44


Recommended