+ All Categories
Home > Documents > Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc ›...

Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc ›...

Date post: 30-May-2020
Category:
Upload: others
View: 4 times
Download: 0 times
Share this document with a friend
26
Lecture 12: RLSC - Prof. Sethu Vijayakumar 1 Lecture 12: Trajectory Formation Contents: Based on Heuristic Optimisation ZMP based walking Optimization Criterion Based Minimum Distance, Time Minimum Acceleration Change, Torque Change Minimum End Point Variance Multiple Model Learning Using Motor Redundancies Efficiently
Transcript
Page 1: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 1

Lecture 12: Trajectory Formation

Contents: • Based on Heuristic Optimisation

• ZMP based walking

• Optimization Criterion Based • Minimum Distance, Time

• Minimum Acceleration Change, Torque Change

• Minimum End Point Variance

• Multiple Model Learning

• Using Motor Redundancies Efficiently

Page 2: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 2

Trajectory Planning Phases

Trajectory generation Involves computation of the best trajectory for the object

Force Distribution Involves determining the force distribution between different actuators

(a.k.a. resolving actuator redundancy)

Some of the approaches solve the trajectory generation and force distribution problems separately in two phases.

It has been argued that solving the two issues simultaneously (as a global optimization problem) is superior in many cases.

Kinematics: refers to geometrical and time-based properties of motion; the

variables of interest are positions (e.g. joint angles or hand Cartesian coordinates) and their corresponding velocities, accelerations and higher derivatives.

Dynamics: refers to the forces required to produce motion and is therefore intimately linked to the properties of the object such as it’s mass, inertia and stiffness.

Page 3: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 3

Question: how to generate good trajectories?

• (Very) simple options: • Provide a few desired positions (angles) over time

(controller with step functions) • Provide smooth hand-tuned trajectories (e.g. with spline

fitting) • Use a sinusoidal controller

• These are open-loop solutions, i.e. no feedback to the

trajectory planner

Traj. Planning: Simple methods

Page 4: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 4

Simply provide a new desired angle every so often:

DOF i

DOF j

Desired angle Actual angle

Controller with step function

Problems: • very saccadic motion, • depends strongly on PID gains

Page 5: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 5

• Provide a trajectory as a vector • Hand-tuned trajectory: usually only give a few via points • Use of spline-fitting to interpolate

DOF i

Hand-tuned trajectories

Problems: • How to be sure that the locomotion is stable? • Need for mathematical tools to prove stability,

c.f. ZMP control in a few slides

Page 6: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 6

• Desired angle for each DOF:

• Problem: finding, for each DOF i suitable

)sin(0

iiiii tA

iiii A and ,,0

Ai

Ti ji

jj Tj

Aj

DOF i

DOF j

qi0

qj0

Open-loop sinus-based controller

Page 7: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 7

• Main idea: design walking kinematic trajectories, and use the dynamic equations to test and prove that locomotion is stable

• Trajectories are designed by trial-and-error, or from human recordings

• Most successful approach: Zero Moment Point (ZMP) method (Vukobratovic 1990)

Walking based on Trajectory Methods

Page 8: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 8

• Method for proving that a trajectory is stable

• Zero Moment Point (ZMP): point on the ground about which the net moment of the inertial forces and the gravity forces has no component along the horizontal planes (a.k.a. center of pressure, CoP).

• ZMP is different from the projection of the center of mass (CoM) on the ground

• ZMP ~ projection on the ground of the point around which the robot is rotating

Zero Moment Point Approach

Page 9: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

9

PZMP is such that:

TN

i

iiiiiiiiii mm ,*)0,0()( grωIωαIar

ZMPii ppr

gravity

velocity angular

onaccelerati angular

(matrix) inertia of moment

onaccelerati external:

i link of mass

i link of position

links of number

:

:

:

:

:

:

:

g

ω

α

I

a

i

i

i

i

i

i

m

p

N

Two independent equations, Two unknowns: pZMP,X pZMP,Y

Moment due to ext. acc.

M. due to Coriolis forces M. due to gravity

ZMP Approach

M. due to angular acceleration

Lecture 12: RLSC - Prof. Sethu Vijayakumar

Page 10: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 10

Locomotion is stable if the ZMP remains within the foot-print polygons

Foot-print polygon

ZMP Approach

Page 11: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 11

Honda, SONY and some HRP robots use ZMP control

Page 12: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 12

Most used method: 1. Human motion capture for getting trajectories, 2. Modify trajectories such that locomotion is stable

according to the ZMP criterion 3. Add online stabilization to deal with perturbations.

Example of online stabilization: • Use of hip actuators to manipulate the ZMP

ZMP Approach

Page 13: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 13

Some Simple Cost Functions

Shortest Distance Refer: F.C.Park and R.W.Brockett, Kinematic Dexterity

of robot mechanisms, Int. Journal of Robotic Research, 1391), pp. 1-15, 1994

Minimum Acceleration Refer: L. Noakes, G. Heinzinger, and B. paden, Cubic

splines on curved surfaces, IMA Journal of Mathematical Control & Information, 6, pp. 465-473, 1989.

Minimum Time (Bang-Bang Control) Refer: Z. Shiller, Time Energy optimal control of

articulated systems with geometric path constraints, In. Proc. of 1994 Intl. Conference on Robotics and Automation, pp. 2680-2685, San Diego, CA, 1994.

t

x

t

x

t

x

.

..

Page 14: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 14

Minimum Jerk Trajectory Planning

Proposed by Flash & Hogan (1985)

Optimization Criterion minimizes the jerk in the trajectory

The minimum-jerk solution can be written as:

Depends only on the kinematics of the task and is independent of the physical structure or dynamics of the plant

Predicts bell shaped velocity profiles

dtdt

yd

dt

xdC

T

J

0

2

3

32

3

3

2

1

.0),(/ˆ

)ˆ10ˆ6ˆ15)(()(

)ˆ10ˆ6ˆ15)(()(

00

354

00

354

00

tatscoordinateinitialareyxandtttwhere

tttyyyty

tttxxxtx

f

f

f

For movement in

x-y plane

Page 15: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 15

Minimum Torque Change Planning

Proposed by Uno, Kawato & Suzuki (1989)

Optimization Criterion minimizes the change of torque

The Min. Jerk and Min. Torque change cost functions are closely related since acceleration is proportional to torque at zero speed.

No Analytical solution possible for Min. Torque change criterion but iterative solution is possible.

Like Min. Jerk, predicts bell shaped velocity profiles.

But also predicts that the form of the trajectory should vary across the arm’s workspace.

dtdt

d

dt

dC

T

T

0

2

2

2

1

2

1 For two joint

arm or robot

Page 16: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 16

Min. Jerk vs Min. Torque Change

One way of resolving how humans plan their movement is by setting up a experiment which can distinguish between the kinematic vs dynamic plans

Page 17: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 17

Min. Jerk vs Min. Torque Change (II)

Most studies suggest that trajectories are planned in visually-based kinematic

coordinates

No perturbations at the start & end.

Page 18: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 18

Minimum Endpoint Variance Planning

Proposed by Harris & Wolpert (1998)

Also called TOPS (Trajectory Optimization in the Presence of Signal dependent noise)

Refer: Harris & Wolpert, Signal-dependent noise determines motor planning, Nature, vol. 394, 780-784

Basic Theory:

Single physiological assumption that neural signals are corrupted by noise whose variance increases with the size of the control signal.

In the presence of such noise, the shape of the trajectory is selected to minimize the variance of the final end-point position.

Biologically more plausible since we do have access to end point errors as opposed to complex optimization processes (min. jerk and min. torque change integrated over the entire movement) that other optimization criterion suggest the brain has to solve.

Page 19: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 19

Testing the Internal Model Learning Hypothesis

Page 20: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 20

Learning of Internal Models

Using the force manipulandum, one can create an interesting experiment in which you modify the dynamics of your arm movement in only the x-y plane.

Humans are very adept at learning these changed dynamic fields and adapt at a relatively short time scale.

When we remove these effects, after effects of learning can be felt for some time before re-adaptation, providing evidence that we learn internal models and use them in a predictive feedforward fashion

Page 21: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 21

Multiple Model Hypothesis

Page 22: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 22

Recursive Bayes Estimation

dpDp

pDpDp

pDpn

k

k

)()|(

)()|()|(

)|()|(1

x

If we explicitly index the individual experiences by n, i.e., Ðn = {x1,…,xn},

)|()|()|( 1 n

n

n DppDp x

)()|(

)|()|(

)|()|()|(

0

1

1

pDpwhere

dDpp

DppDp

n

n

n

nn

x

x

Using … We get …

P y | x P(x | y)P(y)

P(x)

Page 23: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 23

Multiple model hypothesis (II)

Multiple Paired Forward-Inverse Models (MPFIM)

Georgios Petkos and Sethu Vijayakumar, Context Estimation and Learning Control through Latent Variable Extraction: From discrete to continuous contexts, Proc. IEEE Intl Conf on Robotics and Automation, 2007).

Page 24: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 24

74:θ)(x)( headeyef

TEyeRTEyeRPEyeLTEyeLPHeadHeadHead

T

LRLR yyxx

321

θ

x

Inverse Kinematics

Eye & Head displacements

Retinal Displacement

Resolving Motor Redundancies

Page 25: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 25

Resolving kinematics with RMRC

θθ)(x J

Resolved Motion Rate Control with locality in joint positions

Integrate over small increments (where linearity holds) to get complete trajectory

θx JBased on collected training data forward kinematics in velocity space was almost completely linear, irrespective of joint position

Hence, in this case, we can use pseudo-inversion with the constant Jacobian !!

Aaron D'Souza, Sethu Vijayakumar and Stefan Schaal,Learning Inverse Kinematics, Proc. International Conference on Intelligence in Robotics and Autonomous Systems (IROS 2001), pp. 298-303 (2001)

Page 26: Lecture 12: Trajectory Formation - wcms.inf.ed.ac.ukwcms.inf.ed.ac.uk › ipab › rlsc › lecture-notes › RLSC_Lec12_Traj.pdf · Lecture 12: RLSC - 2Prof. Sethu Vijayakumar Trajectory

Lecture 12: RLSC - Prof. Sethu Vijayakumar 26

?θ , θx what isGiven J

Null Space Manipulation

nullkJJJ ) -(Ixθ ##

icurrent

opt

inull

Lk

,

,

2

,, )(2

1min idefaulticurrentiopt wL Optimization criterion

1# )( TTJJJJ : Pseudoinverse

Optimization based on gradient descent in Null space

)( ,, idefaulticurrentiw

Inertial Weighting for each joint


Recommended