+ All Categories
Home > Documents > NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on...

NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on...

Date post: 26-Sep-2020
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
42
NIPS 2016 Ni Lao 2017/1/31
Transcript
Page 1: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

NIPS 2016Ni Lao

2017/1/31

Page 2: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

NIPS 2016Largest ML conference

Barcelona, Spain

6000 attendees

Dec 5 tutorials, posters

Dec 6,7 presentations, posters

Dec 8 presentations, symposiums

Dec 9,10 workshops

Page 3: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

AI is getting popular● A lot of industry presence

○ Facebook, Microsoft, Amazon, NVIDIA, most of Google Brain and most of DeepMind

○ Automotive, financial, e-commerce, and all kind of companies looking to grow their ML groups

○ Startup founders and CEOs of AI companies walking around

Credit http://blog.evjang.com/2017/01/nips2016.html

Page 4: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

The venue for big announcements● (2015) Google gave its introduction/tutorial on

TensorFlow, released its best model on ImageNet

● (2015) OpenAI announced its existence

● OpenAI released their Universe platform for developing a reinforcement learning agent

● DeepMind open-sourced their platform for experimenting with RL agent

● Uber announced they had acqui-hired Geometric Intelligence in order to expedite their own AI efforts

● Apple announced that it will publish it’s research and make source-code available

Credit http://blog.evjang.com/2017/01/nips2016.html

Page 5: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Plan● Keynote● Generative Adversarial Nets (GAN)● Deep RL● Meta-Learning

Page 6: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Keynote

-- Yann Lecun

Page 7: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

LeCun’s Cake● learning needs

to be mostly unsupervized

● for good data representations

Page 8: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

He Really Meant End-to-End Training

Page 9: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Learning Agents● Inspiration from human brain ● The essence of intelligence is

the ability to predict

Page 10: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Generative Adversarial Nets (GAN)

-- a tutorial by Ian Goodfellow

“the biggest breakthrough in Machine Learning in the last 1-2 decades.”

-- Yann Lecun

Page 11: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Why study generative models?

● Realistic generation tasks● Semi-supervised learning

“What I cannot create, I do not understand.”—Richard Feynman

[Vinyals, Le, 2015]

Page 12: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Super-Resolution[Ledig+, 2016]

Page 13: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Art

Figure 2: Images that combine the content of a photograph with the style of several well-known artworks. The images were created by finding an image that simultaneously matches the content representation of the photograph and the style representation of the artwork (see Methods).

[Gatys, Ecker, Bethge, 2015]

Page 14: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Graphics[Isola+, 2016]

Page 15: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform
Page 16: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform
Page 17: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

GAN as a way of regularization

G

D

data pointsdata points

● Less incentive to fit individual data points

Page 18: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Deep Convolutional (DC) GAN[Radford+ 2016]

Page 19: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Deep Convolutional (DC) GAN[Radford+ 2016]

Page 20: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Deep RL-- tutorials by Pieter Abbeel and John Schulman

Page 21: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Reinforcement Learning● Any ML problem can be formulated as a RL problem

Page 22: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Policy Optimization

Page 23: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

A relatively new field with recent successes

Page 24: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

The RL landscape ● Simple ● Stable

● Data efficient

Page 25: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Cross-Entropy Method● Can work surprisingly well● not data efficient

[Szita and Lorincz 2006]

Page 26: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Likelihood Ratio Policy Gradient

● Optimizing the expected utility

● Is almost the same as MLE except for a weight P(t;θ)R(t)

● Valid even if R and sample space are discrete!!

● Unstable, need good model initialization and ways to reduce gradient variances

Page 27: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

The Step Size Problem

● Bad stability

Page 28: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Surrogate Objective● Collect data with an old policy (for stability)● Reweight examples by importance sampling

○ The probability ratio between the new policy and the old policy

[Kakade and Langford 2002][Schulman+ 2015]

Page 29: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Experience Replay● Keep a set of (hard to find, or human generated) good examples ● Repeatedly use them for training (together with recent bad examples)

○ E.g., 1M replay buffer for DQN Artari training○ E.g., Neural symbolic machines keep track of the best program for each query

[Mnih+ 2015, 2016][Chen+ 2016]

Page 30: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

The Delayed Reward Problem

● Bad data efficiency

Page 31: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Bootstrapping● use the value function to estimate future rewards

Page 32: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Advantage Actor-Critic[Mnih+ 2015, 2016]

● Minimize reward loss and value function error at the same time

Page 33: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Meta-Learning

Page 34: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Meta-Generative Models● Two models works better than one

[Radford+ 2016]

Page 35: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Meta-RL Models[Wang+ 2016]

● Two learning systems:

○ one lower-level system that learns relatively quickly, and which is primarily responsible for adapting to each new task;

○ and a slower higher-level system that works across tasks to tune and improve the lower-level system

Page 36: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Meta-RL Models[Kulkarni+ 2016]

Page 37: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Meta-Optimizer[Andrychowicz+ 2016]

● Control NN parameter updates using LSTMs

Page 38: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Multiresolution Caption Model

[Yang+ 2016]

Page 39: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Multiresolution Dialogue Models

[Serban+ 2016]

● Sorry I can't find their poster online

Page 40: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

[Serban+ 2016]

Multiresolution Dialogue Models

Page 41: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Thanks

Page 42: NIPS 2016 · (2015) Google gave its introduction/tutorial on TensorFlow, released its best model on ImageNet (2015) OpenAI announced its existence OpenAI released their Universe platform

Reference1. Overview http://beamandrew.github.io/deeplearning/2016/12/12/nips-2016.html2. Overview http://blog.evjang.com/2017/01/nips2016.html3. Keynote: https://t.co/LDzqac7na14. GAN: https://arxiv.org/abs/1701.001605. GAN: http://www.slideshare.net/indicods/deep-advancements-in-generative-modeling6. RL: http://people.eecs.berkeley.edu/~pabbeel/nips-tutorial-policy-optimization-Schulman-Abbeel.pdf7. RL: http://rll.berkeley.edu/deeprlcourse/docs/nuts-and-bolts.pdf


Recommended