Advanced Techniques for Mobile Robotics Gaussian...

transcript

Wolfram Burgard, Cyrill Stachniss,

Kai Arras, Maren Bennewitz

Gaussian Processes in Robotics

Advanced Techniques for Mobile Robotics

Overview

§  Regression problem

§  Gaussian process models

§  Learning GPs

§  Applications

§  Summary

The Regression Problem §  Given n observed points

§  Assuming the dependency

§  How to predict new points

The Regression Problem §  Solution 1: Parametric models

§  Linear

§ Quadratic

§ Higher order polynomials

§ …

§  Learning: optimizing the parameters

The Regression Problem §  Solution 1: Parametric models

The Regression Problem §  Solution 2: Non-parametric models

§  Radial Basis functions

§ Histograms, Splines, Support Vector Machines …

§  Learning: finding the structure of the model and optimize its parameters

The Regression Problem §  Solution 3: Express

directly in terms of the data points

§  Idea: Any finite set of values sampled from has a joint Gaussian distribution with a covariance matrix given by

Gaussian Process Models §  Then, the n+1 dimensional vector

which includes the new target to be predicted , comes from an n+1 dimensional Gaussian

§  The predictive distribution for the new target is a 1-dimensional Gaussian

§  Given the n observed points §  Squared exponential covariance

function

§  with §  and a noise level

Gaussian Process Model

Gaussian Process Models §  GP model

Learning GPs §  The squared exponential

covariance function:

§  Easy to interpret parameters

amplitude

index/input distance

characteristic lengthscale

noise level

Learning GPs §  Example: low noise

Learning GPs §  Example: medium noise

Learning GPs §  Example: high noise

Learning GPs §  Example: small lengthscale

Learning GPs §  Example: large lengthscale

Learning GPs §  Covariance function specifies the prior

prior posterior

Gaussian Process Models §  Recall, the n+1 dimensional vector

comes from an n+1 dimensional normal distribution

§  The predictive distribution for the new target is a 1-dimensional Gaussian.

§  Why?

The Gaussian Distribution §  Recall the 2-dimensional joint Gaussian:

§  The conditionals and the marginals are also Gaussians Figure taken from

Carl E. Rasmussen: NIPS 2006 Tutorial

The Gaussian Distribution §  Simple bivariate example:

marginal conditional

The Gaussian Distribution § Marginalization:

The Gaussian Distribution §  The conditional:

The Gaussian Distribution §  Slightly more complicated in the general

§  The conditionals and the marginals are also Gaussians

Figure taken from Carl E. Rasmussen: NIPS 2006 Tutorial

The Gaussian Distribution §  Conditioning the joint Gaussian in general

§  In case of zero mean:

Gaussian Process Models §  Recall the GP assumption

Gaussian Process Models §  Noise-free mean and variance of the

predictive distribution have the form

§  with

Gaussian Process Models §  Mean and variance of the predictive

distribution then lead to

§  with

Learning GPs §  Learning a Gaussian process means

§  choosing a covariance function §  finding its parameters and the noise level

§  What is the objective?

Learning GPs §  The hyperparameters

can be found by maximizing the likelihood of the training data e.g., using gradient methods

Learning GPs §  Objective: high data likelihood

§  Due to the Gaussian assumption, GPs have Occam’s razor built in

data fit complexity penalty

const.

Occam‘s Razor §  Use the simplest explanation that is

needed to describe the data

§  Data-fit favors overfitting §  Complexity penalty favors simplicity

too long just right too short

Advanced Topics / Extensions §  Classification/non-Gaussian noise §  Sparse GPs: Approximations for large

data sets §  Heteroscedastic GPs: Modeling non-

constant noise §  Nonstationary GPs: Modeling varying

smoothness (lengthscales) § Mixtures of GPs §  Uncertain inputs §  …

Advanced Techniques for Mobile Robotics Gaussian...

Documents