+ All Categories
Home > Documents > Neural Rendering - MIT Deep Learning...

Neural Rendering - MIT Deep Learning...

Date post: 31-May-2020
Category:
Upload: others
View: 10 times
Download: 0 times
Share this document with a friend
94
Neural Rendering Chuan Li Lambda Labs Collaborators: Thu Nguyen-Phuoc, Bing Xu, Yongliang Yang, Stephen Balaban, Lucas Theis, Christian Richardt, Junfei Zhang, Rui Wang, Kun Xu, Rui Tang
Transcript
Page 1: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Rendering

Chuan Li

Lambda Labs

Collaborators: Thu Nguyen-Phuoc, Bing Xu, Yongliang Yang, Stephen Balaban, Lucas Theis, Christian Richardt, Junfei Zhang, Rui Wang, Kun Xu, Rui Tang

Page 2: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Model Pictures

Forward (Computer Graphics)

Page 3: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Model Pictures

Forward (Computer Graphics)

Inverse (Computer Vision)

Page 4: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels
Page 5: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels
Page 6: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Integral of the incident radians

Page 7: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

BRDF

Page 8: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels
Page 9: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels
Page 10: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

32K SPP Ray Tracing (90 mins 12 CPU Cores)The Tungsten Renderer

Page 11: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

P0

P1

Page 12: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

P0

P1

Page 13: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

R01 | T

01

P0

P1

Page 14: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Inverse (Computer Vision)

R01 | T

01

P0

P1

Page 15: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

P2

R12 | T12

Inverse (Computer Vision)

R01 | T

01

P0

P1

Page 16: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Building Rome in a DaySameer Agarwal, Noah Snavely, Ian Simon, Steven M. Seitz and Richard Szeliski

Page 17: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Model Pictures

Sub-module

End-2-End

Differentiable Rendering

Page 18: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

1 SPP

2048 SPP

Page 19: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Sub-modules

Mastering the game of Go with deep neural networks and tree searchDavid Silver et al.

Page 20: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Sub-modules

Value Network

Mastering the game of Go with deep neural networks and tree searchDavid Silver et al.

Page 21: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Sub-modules

Value Network

Policy Network

Mastering the game of Go with deep neural networks and tree searchDavid Silver et al.

Page 22: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

2^15 SPP4 SPP

Value Networks

Denoising

Page 23: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

2^15 SPP

Value Networks

Denoising

Policy Networks

Same SPP

4 SPP

Page 24: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

2^15 SPP

Value Networks

Denoising

Policy Networks

Same SPP

4 SPP

Page 25: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

4 SPP Denoised1 sec 2080 Ti

32K SPP Ray Tracing90 mins 12 cores CPU

Adversarial Monte Carlo denoising with conditioned auxiliary feature modulationB Xu et al. Siggraph Asia 2019

Page 26: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

loss

DecoderEncoderInput x

Ref

Output

Adversarial Monte Carlo denoising with conditioned auxiliary feature modulationB Xu et al. Siggraph Asia 2019

Page 27: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

L1 VGG Loss

L1 VGG Loss + GAN

Adversarial Monte Carlo denoising with conditioned auxiliary feature modulationB Xu et al. Siggraph Asia 2019

Page 28: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

loss

DecoderEncoderDiffuseInput x Diffuse

Output

DecoderEncoderSpecular

Input x SpecularOutput Ref

Output

Adversarial Monte Carlo denoising with conditioned auxiliary feature modulationB Xu et al. Siggraph Asia 2019

Page 29: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Auxiliary

loss

DecoderEncoderDiffuseInput x Diffuse

Output

DecoderEncoderSpecular

Input x SpecularOutput Ref

Output

Albedo, normal, depth

Page 30: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Auxiliary

Conv

LeakyReLU

Conv

x

Element-wiseBiasing

Page 31: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Conv

Auxiliary

LeakyReLU

Conv

Conv

LeakyReLU

Conv

x

Element-wiseBiasing

Element-wiseScaling

Page 32: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Conv

Auxiliary

LeakyReLU

Conv

Conv

LeakyReLU

Conv

x

Element-wiseBiasing (OR)

Element-wiseScaling (AND)

Page 33: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Denoise comparison 4 SPP

Adversarial Monte Carlo denoising with conditioned auxiliary feature modulationB Xu et al. Siggraph Asia 2019

Page 34: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

2^15 SPP

Value Networks

Denoising

Policy Networks

Same SPP

4 SPP

Page 35: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Importance SamplingThomas Müller et al. ACM Transactions on Graphics 2019

Page 36: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

incidence radiance map

Neural Importance SamplingThomas Müller et al. ACM Transactions on Graphics 2019

Page 37: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels
Page 38: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Importance SamplingThomas Müller et al. ACM Transactions on Graphics 2019

Page 39: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Importance SamplingThomas Müller et al. ACM Transactions on Graphics 2019

Page 40: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Model Pictures

Sub-module

End-2-End

Differentiable Rendering

Page 41: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Ray TracingImage Centric

RasterizationObject Centric

Page 42: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Ray TracingImage Centric

RasterizationObject Centric

Visibility

Page 43: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Ray TracingImage Centric

RasterizationObject Centric

Shading

Page 44: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Depth Map Voxel Point Cloud Mesh

Memory Good Very Poor Poor Very Good

NN friendly Great Yes No Enemy

Page 45: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Depth Map Voxel Point Cloud Mesh

Memory Good Very Poor Poor Very Good

NN friendly Great Yes No Enemy

Page 46: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Depth Map Voxel Point Cloud Mesh

Memory Good Very Poor Poor Very Good

NN friendly Great Yes No Enemy

Page 47: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Depth Map Voxel Point Cloud Mesh

Memory Good Very Poor Poor Very Good

NN friendly Great Yes No Enemy

Page 48: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 49: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Voxels

32 x 32 x 32 x 16

3D Encoder

Neural Voxels

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 50: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Voxels

32 x 32 x 32 x 16

3D Encoder

3D-2D

32 x 32 x 512

Neural Pixels

VisibilityNeural Voxels

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 51: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Voxels

32 x 32 x 32 x 16

3D Encoder

3D-2D

32 x 32 x 512

Neural Pixels

VisibilityNeural Voxels

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 52: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Voxels

32 x 32 x 32 x 16

3D Encoder

3D-2D

32 x 32 x 512

Neural Pixels

2D Decoder

ShadingNeural Voxels Visibility

MSE pixel loss

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 53: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 54: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Contour

Toon

Ambient OcclusionRenderNet: A deep convolutional network for differentiable rendering from 3D shapes

Thu Nguyen-Phuoc et al. NeurIPS 2018

Page 55: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 56: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 57: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

3D Encoder

3D-2D Neural Pixels

2D Decoder

TextureNetwork

NeuralTexture Voxels

or

Neural Voxels

Channel-wise Concatenation

64 x 64 x 64 x 4

64 x 64 x 64 x 1

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 58: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 59: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Same shape, different textures

Same texture, different shapes

RenderNet: A deep convolutional network for differentiable rendering from 3D shapesThu Nguyen-Phuoc et al. NeurIPS 2018

Page 60: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Depth Map Voxel Point Cloud Mesh

Memory Good Very Poor Poor Very Good

NN friendly Great Yes No Enemy

Page 61: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Rasterization a RGB point cloud

Neural Point-Based GraphicsKA Aliev et al, arxiv 2019

Page 62: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Rasterization a neural point cloud

(First three PCA dimensions of the neural descriptor)

Neural Point-Based GraphicsKA Aliev et al, arxiv 2019

Page 63: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Rasterization a neural point cloud

(First three PCA dimensions of the neural descriptor)

Neural Point-Based GraphicsKA Aliev et al, arxiv 2019

Page 64: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural Point-Based GraphicsKA Aliev et al, arxiv 2019

RB

G ra

ster

izat

ion

Neu

ral r

aste

rizat

ion

Page 65: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Neural 3D Mesh RendererH Kato et al, CVPR 2018

Deferred Neural Rendering: Image Synthesis using Neural Textures

J Thies et al, Siggraph 2019

Page 66: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Model Pictures

Sub-module

End-2-End

Page 67: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

?

Page 68: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

TargetApproximation

Page 69: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

TargetApproximation RenderedApproximation

Page 70: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Loss

Back-propagate

TargetRenderedApproximation

Approximation

Page 71: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Loss

TargetRenderedApproximation

UpdatedApproximation

Back-propagate

Page 72: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Loss

TargetRenderedApproximation

UpdatedApproximation

Back-propagateFor Free

Page 73: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Loss

TargetRenderedApproximation

UpdatedApproximation

Back-propagate

Expensive

Page 74: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Loss

TargetRenderedApproximation

DecoderEncoder

Reconstruction Rendering

Page 75: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Human perception imposes coordinate frame on objects

Inductive Bias: Separate Appearance from Pose

Page 76: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Learning 3D representation from natural images without 3D supervision

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 77: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Conditional GANs

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 78: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Info GANs

Conditional GANs

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 79: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 80: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet3D Generator

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 81: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet3D Generator

3D StyleGAN

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 82: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet3D Generator

3D StyleGAN

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 83: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet3D Generator

3D StyleGAN

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 84: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet3D Generator

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 85: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet3D Generator

Real/Fake

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 86: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

RenderNet3D Generator

A representation that is unbreakable under 3D rigid-body transformations

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 87: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 88: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 89: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 90: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

z1 z2

Shape Controller Texture Controller

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 91: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

HoloGAN: Unsupervised learning of 3D representations from natural imagesThu Nguyen-Phuoc et al, ICCV 2019

Page 92: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Model Pictures

Forward (Computer Graphics)

Inverse (Computer Vision)

Page 93: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Model Pictures

Sub-module for Ray Tracing (Value / Policy Networks)

End-2-End Rasterization (Depthmap, Voxel, Point Cloud, Mesh)

Differentiable Rendering (Representation Learning)

Page 94: Neural Rendering - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L9.pdf · Neural Voxels 32 x 32 x 32 x 16 3D Encoder 3D-2D 32 x 32 x 512 Neural Pixels

Thu Nguyen-Phuoc Bing Xu Yongliang Yang Stephen Balaban

Lucas Theis Christian Richardt Junfei Zhang Rui Wang Kun Xu Rui Tang


Recommended