Lecture 12: RLSC - Prof. Sethu Vijayakumar 1
Lecture 12: Trajectory Formation
Contents: • Based on Heuristic Optimisation
• ZMP based walking
• Optimization Criterion Based • Minimum Distance, Time
• Minimum Acceleration Change, Torque Change
• Minimum End Point Variance
• Multiple Model Learning
• Using Motor Redundancies Efficiently
Lecture 12: RLSC - Prof. Sethu Vijayakumar 2
Trajectory Planning Phases
Trajectory generation Involves computation of the best trajectory for the object
Force Distribution Involves determining the force distribution between different actuators
(a.k.a. resolving actuator redundancy)
Some of the approaches solve the trajectory generation and force distribution problems separately in two phases.
It has been argued that solving the two issues simultaneously (as a global optimization problem) is superior in many cases.
Kinematics: refers to geometrical and time-based properties of motion; the
variables of interest are positions (e.g. joint angles or hand Cartesian coordinates) and their corresponding velocities, accelerations and higher derivatives.
Dynamics: refers to the forces required to produce motion and is therefore intimately linked to the properties of the object such as it’s mass, inertia and stiffness.
Lecture 12: RLSC - Prof. Sethu Vijayakumar 3
Question: how to generate good trajectories?
• (Very) simple options: • Provide a few desired positions (angles) over time
(controller with step functions) • Provide smooth hand-tuned trajectories (e.g. with spline
fitting) • Use a sinusoidal controller
• These are open-loop solutions, i.e. no feedback to the
trajectory planner
Traj. Planning: Simple methods
Lecture 12: RLSC - Prof. Sethu Vijayakumar 4
Simply provide a new desired angle every so often:
DOF i
DOF j
Desired angle Actual angle
Controller with step function
Problems: • very saccadic motion, • depends strongly on PID gains
Lecture 12: RLSC - Prof. Sethu Vijayakumar 5
• Provide a trajectory as a vector • Hand-tuned trajectory: usually only give a few via points • Use of spline-fitting to interpolate
DOF i
Hand-tuned trajectories
Problems: • How to be sure that the locomotion is stable? • Need for mathematical tools to prove stability,
c.f. ZMP control in a few slides
Lecture 12: RLSC - Prof. Sethu Vijayakumar 6
• Desired angle for each DOF:
• Problem: finding, for each DOF i suitable
)sin(0
iiiii tA
iiii A and ,,0
Ai
Ti ji
jj Tj
Aj
DOF i
DOF j
qi0
qj0
Open-loop sinus-based controller
Lecture 12: RLSC - Prof. Sethu Vijayakumar 7
• Main idea: design walking kinematic trajectories, and use the dynamic equations to test and prove that locomotion is stable
• Trajectories are designed by trial-and-error, or from human recordings
• Most successful approach: Zero Moment Point (ZMP) method (Vukobratovic 1990)
Walking based on Trajectory Methods
Lecture 12: RLSC - Prof. Sethu Vijayakumar 8
• Method for proving that a trajectory is stable
• Zero Moment Point (ZMP): point on the ground about which the net moment of the inertial forces and the gravity forces has no component along the horizontal planes (a.k.a. center of pressure, CoP).
• ZMP is different from the projection of the center of mass (CoM) on the ground
• ZMP ~ projection on the ground of the point around which the robot is rotating
Zero Moment Point Approach
9
PZMP is such that:
TN
i
iiiiiiiiii mm ,*)0,0()( grωIωαIar
ZMPii ppr
gravity
velocity angular
onaccelerati angular
(matrix) inertia of moment
onaccelerati external:
i link of mass
i link of position
links of number
:
:
:
:
:
:
:
g
ω
α
I
a
i
i
i
i
i
i
m
p
N
Two independent equations, Two unknowns: pZMP,X pZMP,Y
Moment due to ext. acc.
M. due to Coriolis forces M. due to gravity
ZMP Approach
M. due to angular acceleration
Lecture 12: RLSC - Prof. Sethu Vijayakumar
Lecture 12: RLSC - Prof. Sethu Vijayakumar 10
Locomotion is stable if the ZMP remains within the foot-print polygons
Foot-print polygon
ZMP Approach
Lecture 12: RLSC - Prof. Sethu Vijayakumar 11
Honda, SONY and some HRP robots use ZMP control
Lecture 12: RLSC - Prof. Sethu Vijayakumar 12
Most used method: 1. Human motion capture for getting trajectories, 2. Modify trajectories such that locomotion is stable
according to the ZMP criterion 3. Add online stabilization to deal with perturbations.
Example of online stabilization: • Use of hip actuators to manipulate the ZMP
ZMP Approach
Lecture 12: RLSC - Prof. Sethu Vijayakumar 13
Some Simple Cost Functions
Shortest Distance Refer: F.C.Park and R.W.Brockett, Kinematic Dexterity
of robot mechanisms, Int. Journal of Robotic Research, 1391), pp. 1-15, 1994
Minimum Acceleration Refer: L. Noakes, G. Heinzinger, and B. paden, Cubic
splines on curved surfaces, IMA Journal of Mathematical Control & Information, 6, pp. 465-473, 1989.
Minimum Time (Bang-Bang Control) Refer: Z. Shiller, Time Energy optimal control of
articulated systems with geometric path constraints, In. Proc. of 1994 Intl. Conference on Robotics and Automation, pp. 2680-2685, San Diego, CA, 1994.
t
x
t
x
t
x
.
..
Lecture 12: RLSC - Prof. Sethu Vijayakumar 14
Minimum Jerk Trajectory Planning
Proposed by Flash & Hogan (1985)
Optimization Criterion minimizes the jerk in the trajectory
The minimum-jerk solution can be written as:
Depends only on the kinematics of the task and is independent of the physical structure or dynamics of the plant
Predicts bell shaped velocity profiles
dtdt
yd
dt
xdC
T
J
0
2
3
32
3
3
2
1
.0),(/ˆ
)ˆ10ˆ6ˆ15)(()(
)ˆ10ˆ6ˆ15)(()(
00
354
00
354
00
tatscoordinateinitialareyxandtttwhere
tttyyyty
tttxxxtx
f
f
f
For movement in
x-y plane
Lecture 12: RLSC - Prof. Sethu Vijayakumar 15
Minimum Torque Change Planning
Proposed by Uno, Kawato & Suzuki (1989)
Optimization Criterion minimizes the change of torque
The Min. Jerk and Min. Torque change cost functions are closely related since acceleration is proportional to torque at zero speed.
No Analytical solution possible for Min. Torque change criterion but iterative solution is possible.
Like Min. Jerk, predicts bell shaped velocity profiles.
But also predicts that the form of the trajectory should vary across the arm’s workspace.
dtdt
d
dt
dC
T
T
0
2
2
2
1
2
1 For two joint
arm or robot
Lecture 12: RLSC - Prof. Sethu Vijayakumar 16
Min. Jerk vs Min. Torque Change
One way of resolving how humans plan their movement is by setting up a experiment which can distinguish between the kinematic vs dynamic plans
Lecture 12: RLSC - Prof. Sethu Vijayakumar 17
Min. Jerk vs Min. Torque Change (II)
Most studies suggest that trajectories are planned in visually-based kinematic
coordinates
No perturbations at the start & end.
Lecture 12: RLSC - Prof. Sethu Vijayakumar 18
Minimum Endpoint Variance Planning
Proposed by Harris & Wolpert (1998)
Also called TOPS (Trajectory Optimization in the Presence of Signal dependent noise)
Refer: Harris & Wolpert, Signal-dependent noise determines motor planning, Nature, vol. 394, 780-784
Basic Theory:
Single physiological assumption that neural signals are corrupted by noise whose variance increases with the size of the control signal.
In the presence of such noise, the shape of the trajectory is selected to minimize the variance of the final end-point position.
Biologically more plausible since we do have access to end point errors as opposed to complex optimization processes (min. jerk and min. torque change integrated over the entire movement) that other optimization criterion suggest the brain has to solve.
Lecture 12: RLSC - Prof. Sethu Vijayakumar 19
Testing the Internal Model Learning Hypothesis
Lecture 12: RLSC - Prof. Sethu Vijayakumar 20
Learning of Internal Models
Using the force manipulandum, one can create an interesting experiment in which you modify the dynamics of your arm movement in only the x-y plane.
Humans are very adept at learning these changed dynamic fields and adapt at a relatively short time scale.
When we remove these effects, after effects of learning can be felt for some time before re-adaptation, providing evidence that we learn internal models and use them in a predictive feedforward fashion
Lecture 12: RLSC - Prof. Sethu Vijayakumar 21
Multiple Model Hypothesis
Lecture 12: RLSC - Prof. Sethu Vijayakumar 22
Recursive Bayes Estimation
dpDp
pDpDp
pDpn
k
k
)()|(
)()|()|(
)|()|(1
x
If we explicitly index the individual experiences by n, i.e., Ðn = {x1,…,xn},
)|()|()|( 1 n
n
n DppDp x
)()|(
)|()|(
)|()|()|(
0
1
1
pDpwhere
dDpp
DppDp
n
n
n
nn
x
x
Using … We get …
P y | x P(x | y)P(y)
P(x)
Lecture 12: RLSC - Prof. Sethu Vijayakumar 23
Multiple model hypothesis (II)
Multiple Paired Forward-Inverse Models (MPFIM)
Georgios Petkos and Sethu Vijayakumar, Context Estimation and Learning Control through Latent Variable Extraction: From discrete to continuous contexts, Proc. IEEE Intl Conf on Robotics and Automation, 2007).
Lecture 12: RLSC - Prof. Sethu Vijayakumar 24
74:θ)(x)( headeyef
TEyeRTEyeRPEyeLTEyeLPHeadHeadHead
T
LRLR yyxx
321
θ
x
Inverse Kinematics
Eye & Head displacements
Retinal Displacement
Resolving Motor Redundancies
Lecture 12: RLSC - Prof. Sethu Vijayakumar 25
Resolving kinematics with RMRC
θθ)(x J
Resolved Motion Rate Control with locality in joint positions
Integrate over small increments (where linearity holds) to get complete trajectory
θx JBased on collected training data forward kinematics in velocity space was almost completely linear, irrespective of joint position
Hence, in this case, we can use pseudo-inversion with the constant Jacobian !!
Aaron D'Souza, Sethu Vijayakumar and Stefan Schaal,Learning Inverse Kinematics, Proc. International Conference on Intelligence in Robotics and Autonomous Systems (IROS 2001), pp. 298-303 (2001)
Lecture 12: RLSC - Prof. Sethu Vijayakumar 26
?θ , θx what isGiven J
Null Space Manipulation
nullkJJJ ) -(Ixθ ##
icurrent
opt
inull
Lk
,
,
2
,, )(2
1min idefaulticurrentiopt wL Optimization criterion
1# )( TTJJJJ : Pseudoinverse
Optimization based on gradient descent in Null space
)( ,, idefaulticurrentiw
Inertial Weighting for each joint