Click here to load reader
Click here to load reader
Stability of Time-varying Control for an Underactuated Biped RobotBased on Choice of Controlled Outputs
Ting Wang and Christine Chevallereau
Abstractβ This paper studies the effect of controlled outputsselection on the walking stability for an under-actuated planarbiped robot. The control is based on tracking reference motionsexpressed as function of time. First, the reference motions areadapted at each step in order to create an hybrid zero dynamicsystem. Second, the stability of the walking gait under closed-loop control is evaluated with the linearization of the restrictedPoincare map of the hybrid zero dynamics. It shows that for thesame robot, and for the same reference trajectory, the stabilityof the walking can be modified by some pertinent choices ofcontrolled outputs. Third, we find that, at the desired momentof impact for one step, the height of swing foot is nearly zero forall the stable walking, though the configurations of the robotare not the desired configurations. Based on this, we proposea new method to choose the controlled outputs to obtain thestable walking for robot. As a result, two stable domains forthe controlled outputs selection are obtained. Furthermore, wepoint out that, this kind of control law, which based on referencemotion as a function of time, can produce better convergentproperty than that based on reference motion as a function ofa state of robot via pertinent choices of controlled outputs.
I. INTRODUCTION
The primary objective of this paper is to present a feedbackcontroller that achieve an asymptotically stable, periodicwalking gait for an under-actuated planar biped robot. Thebiped studied consists of five links, connected to form twolegs with knees and a torso. It has point feet withoutactuation between the feet and ground, so the ZMP heuristicis not applicable, and thus under actuation must be explicitlyaddressed in the feedback control design.
The control of this robot is based on tracking referencemotions. There are two groups of method which depend onthe differences of reference trajectory. The first one is basedon reference trajectory as function of time and the other oneis not. In the second method, for example, the method ofvirtual constraints, which has been proved very successful indesigning feedback controllers for stable walking in planarbipeds [9], [5], [13], [15]. A recent paper [7] extended it tothe case of spatial robots. In this method, a state quantity ofthe biped, which is strictly monotonic (i.e., strictly increasingor decreasing) along a typical walking gait, is used to replacetime in parameterizing a periodic motion of the biped. Whensuch a control has converged, the configuration of the planarrobot at the impact is the desired one but this approachinvolves parameterized reference trajectories. In fact, thisparametrization method is not usual in robotics. In addition,
Ting Wang and Christine Chevallereau are with the IRCCyN,CNRS, Ecole centrale de Nantes, 1 rue de la Noe, 44321Nantes, cedex 03, France, [email protected]@irccyn.ec-nantes.fr
it is impossible to find the strictly monotonic state for somerobots, for example, robot-semiquad [1], quadruped with acurve gait, or biped with frontal motion [11], [8].
Based on these observations, we want to propose a tool toanalyze the stability of a control based on tracking referencemotion as function of time and to propose solution to obtainstable walking. It has been observed that for the same robot,and for a same known cyclic motion, a control law basedon a reference trajectory as function of the state of a robotproduces a stable walking, while a control law based onreference motion as a function of time produces an unstablewalking [4]. In fact, for the control law based on referencemotion as a function of time, most periodic walking gaits areunstable when the controlled outputs are selected to be theactuated coordinates. In [7], the effect of output selection onthe zero dynamics is discussed first time, but this is basedon parameterized reference trajectory. Next, this approachis used successfully for the control based on time-variantreference trajectory in [14], in which a pertinent choice ofoutputs is proposed, leading to stabilization without the useof a supplemental event-based controller.
This paper focus on how to choose the control outputspertinently to improve the walking stability of biped, whichis extended from [14]. Poincare method is used to analyzethe stability of limit cycles for hybrid zero dynamics. Thenumerical simulation shows the effect of controlled outputsselection on the walking stability. A new method to choosethe control outputs is proposed based on a condition ofthe swing foot height at the desired moment of impact.As a result, two stable domains for the controlled outputsselection are obtained. We compared the control property ofthis method with the method of virtual constraints [9], [5],[13], [15]. It shows the velocity converge more quickly withour method.
II. MODEL
A. Description of the robot
The biped considered walks in a vertical xz plane. It iscomposed of a torso and two identical legs. In the simulation,the robot Rabbit is considered [16]. Each leg is composedof two links articulated by a knee. The knees and the hipsare one-degree-of-freedom rotational joints.
The gait is composed of single support phases separatedby impact phases. During the single support phase, the vectorπ = [π1, . . . , π5]
β² (Fig. 1) describes the configuration of thebiped. The knee and hip relative angles are actuated, but theankle joint is not actuated. We define the vector of actuatedvariables ππ = [π2, . . . , π5]
β² and unactuated variable ππ’ = π1.
The 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems October 18-22, 2010, Taipei, Taiwan
978-1-4244-6676-4/10/$25.00 Β©2010 IEEE 4083
z
xy
π1
π2
π3 π4
π5
Fig. 1. The studied biped
B. Dynamic model
The dynamic models for single support and impact (i.e.,double support) are derived here by assuming support on leg1. The models for support on leg 2 can be written in a similarway. The Euler-Lagrange equations yield the dynamic modelfor the robot in the single support phase as
π·(π)π +π»(π, π) = π΅ Ξ =
[01Γ4
πΌ4Γ4
]π’, (1)
where π·(π) is the positive-definite (5Γ 5) mass-inertiamatrix, π»(π, π) is the (5Γ 1) vector of Coriolis and gravityterms, π΅ is an (5Γ 4) full-rank, constant matrix indicatingwhether a joint is actuated or not, and π’ is the (4Γ 1) vectorof input torques. The double support phase is assumed to beinstantaneous. During the impact, the bipedβs configurationvariables do not change, but the generalized velocities un-dergo a jump. Defining β and + denotes the moment justbefore and after impact respectively. As shown in [16], thisjump is linear with respect to the joint velocity before theimpact πβ.
π+ = πΌ(πβ)πβ, (2)
Considering the exchange of legs, the configuration afterimpact becomes :
π+ = πΈπβ, (3)
where E is a (5Γ 5) matrix which describes the transforma-tion of two legs.
Define state variables as π₯ = [π, π]β², and let π₯+ = [π+, π+]β²
and π₯β = [πβ, πβ]β². Then a complete walking motion of therobot can be expressed as a nonlinear system with impulseeffects, and written as
Ξ£ :
{οΏ½οΏ½ = π(π₯) + π(π₯)π’ π₯β /β ππ₯+ = Ξ(π₯β) π₯β β π
, (4)
where π = {(π, π)β£π§π π€(π) = 0, π₯π π€(π) > 0} is theswitching surface, π§π π€, π₯π π€ describe coordinates of swingfoot, and π’ denotes the input torques.
π(π₯) =
[π
βπ·β1(π)π»(π, π)
], π(π₯) =
[05Γ4
π·β1(π)π΅
],
and
π₯+ = Ξ(π₯β) =[πΈπΈπΌ(πβ)
]π₯β.
III. CONTROL LAW
A. The output
Since the robot is equipped for π actuators, only πoutputs can be controlled. In many papers, the controlledvariables are simply the actuated variables. In fact, the choiceof the controlled variables directly affects the behavior of therobots [7], [14]. For simplicity, we limit our analysis to thecase that the controlled variables π£ are linear expression ofthe configuration variables. They are expressed as:
π£ = π
[ππ’ππ
]=
[π1 π2
] [ ππ’ππ
], (5)
where π is a (4 Γ 5) constant matrix. To obtain a simpleexpression of π with respect to π£ and ππ’, we impose π2 beinvertible. The π outputs that must be zeroed by the controllaw are :
π¦ = π£ β π£π(π‘) (6)
where π£π(π‘) is the desired evolution of the controlled vari-ables. The function ππ(π‘) corresponding to a cyclic motion ofthe robot, which has been obtained by [6]. The cyclic motionhas been defined for one cyclic step from π‘ = 0 to π‘ = π .In the control strategy this desired trajectory is restarted ateach impact. A more precise definition of the output is :β§β¨
β©π¦ = π£ β π£π(π) π₯β /β ππ = 1 π₯β /β ππ+ = 0 π₯β β π
. (7)
For this cyclic motion, the reference motion π£π is defined by
π£π(π) = πππ(π) = π1πππ’ +π2π
ππ (8)
Since the reference trajectory is expressed as the functionof time, the torques depend on the state of the robot and oftime π . To be able to consider the closed loop state as anautonomous system, we will extend the state as π = [π₯, π ]β²,and the studied system can be written as:
Ξ£ :
{οΏ½οΏ½ = ππ(π) + ππ(π)π’(π) π₯β /β ππ+ = Ξπ(π
β) π₯β β π, (9)
where
ππ(π) =
[π(π₯)1
], ππ(π) =
[π(π₯)01Γ4
], Ξπ =
[Ξ0
].
Considering ππ’ = π1 and (5), the current configuration ofthe robot can be expressed as:
π =
[1 01Γ4
βπ2β1π1 π2
β1
] [ππ’π£
], (10)
In equation (1), π· and π» are defined as:
π· =
[π·11(1Γ1) π·12(1Γ4)
π·21(4Γ1) π·22(4Γ4)
], π» =
[π»1(1Γ1)
π»2(4Γ1)
](11)
Then dynamic model in single support (1) is rewritten as:{(π·11 βπ·12π2
β1π1)ππ’ = βπ·12π2β1π£ βπ»1(π, π)
π’ = (π·21 βπ·22π2β1π1)ππ’ +π·22π2
β1π£ +π»2(π, π)(12)
4084
To zero the output in (6), the control input is defined as:
π£ = π£π(π)β πΎπ
π(οΏ½οΏ½ β οΏ½οΏ½π)β πΎπ
π2(π£ β π£π), (13)
where πΎπ > 0, πΎπ > 0, and π > 0. Using the control (13),the desired torques π’ in (12) can be calculated.
B. The Zero dynamics
The zero dynamics [10] is defined to describe the behaviorof the system when the outputs are assumed to be zero. Thereare two objectives of introducing the zero dynamics. Firstly,we want to study the effect of the uncontrolled variable ππ’on the property of the closed-loop system with a perfectcontrol law. Our second objective is to analyze the stabilityof walking in a reduced space. If the output is zero then using(6) and the first line of (12), the zero dynamics is definedby:{
π£(π‘) = π£π(π)(π·11 βπ·12π2
β1π1)ππ’ = βπ·12π2β1π£π(π)βπ»1(π, π)
(14)The zero dynamics can also be expressed with the vari-
ables ππ’ and ππ. Using the equation (8), we have:
π1ππ’ +π2ππ = π£ = π1πππ’ +π2π
ππ (15)
Since π2 is invertible, there exists:β§β¨β©
ππ(π) = πππ +πβ12 π1(π
ππ’ β ππ’)
(π·11 βπ·12π2β1π1)ππ’ = βπ·12(π2
β1π1πππ’
+πππ)βπ»1(π, π)(16)
This equation clearly shows that the behavior of the robotwill be affected by the value of π2
β1π1. In fact, accordingto the definition of the controlled outputs π£ in (5), theintroduction of π1 permits to take into account the trackingerror of the uncontrolled variable ππ’. In the following weimpose that π2 = πΌ4Γ4, which is a identity matrix. Whenπ1 = [0, 0, 0, 0]β², the controlled variables are simply theactuated variables ππ (see (5)).
We can clearly see that the dynamic properties of theswing phase zero dynamics depend on the particular choiceof the reference motion π£π(π) or ππ(π). For the same de-sired periodic motion, the choice of the controlled variablesdirectly affects the zero dynamics in (16). It will be provedwith the simulation results in section V-A.
C. The Hybrid Zero dynamics
According to [16, Chap. 5], while the feedback controllaw (12) and (13) have created a zero dynamics of the stancephase dynamics, it has not created a hybrid zero dynamics,that is, the zero dynamics considering the impact model(2). If the control law could be modified so as to create ahybrid zero dynamics, then the study of the swing phase zerodynamics (14) and the impact model would be sufficient todetermine the stability of the complete system of the robot,thereby leading to a reduced-dimension stability test [12].
The reference motions are modified stride to stride so thatthey are compatible with the initial state of the robot at the
beginning of each step [7]. The new output for the feedbackcontrol design is
π¦π = π£(π)β π£π(π)β π£π(π, π¦π, οΏ½οΏ½π). (17)
This output consists of the previous output (6), and acorrection term π£π that depends on (6) evaluated at thebeginning of the step, specifically, π¦π = π£(0) β π£π(0) andοΏ½οΏ½π = οΏ½οΏ½(0) β οΏ½οΏ½π(0). The values of π¦π and οΏ½οΏ½π are updated atthe beginning of each step (or at impact) and held constantthroughout the step. The function π£π is taken to be a three-times continuously differentiable function of π such that1β§β¨
β©π£π(0, π¦π, οΏ½οΏ½π) = π¦ποΏ½οΏ½π(0, π¦π, οΏ½οΏ½π) = οΏ½οΏ½ππ£π(π, π¦π, οΏ½οΏ½π) β‘ 0, π β₯ π
2 .(18)
With π£π designed in this way, the initial errors of the outputand its derivative are smoothly joined to the original virtualconstraint at the middle of the step, and π£π doesnβt introduceany discontinuity on the desired trajectory. In particular, forany initial error, the initial reference motion π£π is exactlysatisfied by the second part of the step : π β₯ π
2 .Under the new control law defined by (17), the behavior
of the robot is completely defined by the impact map andthe swing phase zero dynamics (14), where π£π is replacedby π£π + π£π, since π2 = πΌ4Γ4, this equation becomes :{
π£(π) = π£π(π) + π£π(π, π¦π, οΏ½οΏ½π)(π·11 βπ·12π1)ππ’ = βπ·12(π£
π + π£π)βπ»1(π, π)(19)
The zero dynamics manifold is defined by π ={(π, π, π)β£π¦π(π) = 0, οΏ½οΏ½π(π) = 0}, this manifold can beparametrized by a vector of dimension 3: (π, ππ’, ππ’). Whenthe reference trajectory is a function of the state variable[16], the zero dynamics manifold is of dimension 2, in ourcase, the supplementary variable π must be considered. Byintroducing π£π, the resulted walking motion can remain inthe manifold π in the presence of impact phase.
IV. STABILITY ANALYSIS
The stability analysis of walking can be done with thePoincare method. Since with the chosen control law, the stateof the robot remains on the zero dynamics manifold, thestability analysis can be done in a reduced space.
Different Poincare section can be considered. Usually, forbiped, the Poincare section is defined just before the impact.This choice implies that the perturbation of the state thatare introduced for the calculation of the Jacobian of thePoincare map is such that the two legs touch the ground.As a consequence the determination of the perturbation isnot obvious. To avoid this, we will consider the Poincaresection at π = 0.75π , at this instant the swing leg tip doesnot touch the ground and since π = 0.75π β₯ π
2 the value ofthe controlled variable are not affected by π£π.
A restricted Poincare map is defined from ππ β© π toππ β© π, where π = {(π, π, π)β£π¦π(π) = 0, οΏ½οΏ½π(π) = 0} and
1In our specific application, we used a four order polynomial for 0 β€ π β€π2
; continuity of position, velocity and acceleration is ensured at π = π2
.
4085
ππ = {(π, π, π)β£π = 0.75π} is the Poincare section. Thekey point is that since in π the state of the robot can beparametrized by three independents variables, in ππ β© π,the state of the robot can be represented using only twoindependent variables, π₯π§ = [ππ’(0.75π ), ππ’(0.75π )]
β², whereππ’ denotes the unactuated joint.
The known cyclic motion ππ(π) gives a fixed point π₯π§β =(ππ’
π(0.75π ), πππ’(0.75π )) for the proposed control law forany value of π1.
The restricted Poincare map π π§ : ππ β© π β ππ β© πinduces a discrete-time system π₯π§π+1 = π π§(π₯π§π). From [12],for π sufficiently small in (13), the linearization of π π§
about a fixed-point determines exponential stability of thefull order closed-loop robot model. Define πΏπ₯π§π = π₯π§π β π₯π§β,the Poincare map linearized about the fixed-point π₯π§β givesrise to a linearized system,
πΏπ₯π§π+1 = π΄π§πΏπ₯π§π, (20)
where the (2 Γ 2) square matrix π΄π§ is the Jacobian of thePoincare map [14].
A fixed-point of the restricted Poincare map is locallyexponentially stable, if, and only if, the eigenvalues of π΄π§
have magnitude strictly less than one [16, Chap. 4].
V. EXAMPLES
Using optimization techniques developed in [6], an op-timal cyclic motion has been defined for the robot Rabbitdescribed in section II-A. The corresponding stick-diagramof the walking gait and the joint profiles of each angle havebeen presented in [14]. To investigate the influence of thechoice of the controlled outputs (via the matrix π1) on thestability of the control law for a particular desired cyclicmotion, π£π(π), οΏ½οΏ½π(π) and π£π(π) have to be known. Since ππ
are known, according to (8), the desired reference motionare defined by : β§β¨
β©π£π(π) = πππ(π)οΏ½οΏ½π(π) = πππ(π)π£π(π) = πππ(π)
(21)
A. Stability Analysis with Different Choices of π1
It is shown that the hybrid zero dynamics depends on thechoice of the output (19). We will explore the effect of π1
on the stability of the control law.To study the stability of this control law around the
periodic motion, we need to compute the eigenvalues of π΄π§
in (20), which are noted as π1,2. Generally, it is possible touse an optimization technique to find a vector π1 such thatthe maximal norm eigenvalues of π΄π§ is less than one, as:
πππ₯ β£π1,2β£ < 1, (22)
Here we use an exploration technique to illustrate the effectof the choice of the output. We fix arbitrary three componentsof π1 to zero and πππ₯ β£π1,2β£ are drawn as function of thefourth component of π1. The results are shown in Fig. 2,where the red points note that πππ₯ β£π1,2β£ < 1. In order tofind these stable points, we search more precisely of π1(π),
β10 β5 0 5 100
50
100
150
π1(1)
πππ₯β£π
1,2β£
β2 0 2 40
100
200
300
400
π1(2)
πππ₯β£π
1,2β£
β5 0 50
20
40
60
80
100
120
π1(3)
πππ₯β£π
1,2β£
β10 β5 0 50
10
20
30
40
50
60
π1(4)
πππ₯β£π
1,2β£
Fig. 2. πππ₯ β£π1,2β£ versus π1(π), π = 1, 2, 3, 4, when the other threecomponents of π1 are zero.
π = 1, 2, 3, 4 near the πππ₯ β£π1,2β£ = 1. We can find threemain results from Fig. 2.1) When π1 get into some domains, the eigenvalues of
π΄π§ can change to infinity, as a result, the swing foot canβttouch the ground, but when π1 get out of it, the eigenvaluesof π΄π§ diminish instantaneously, as shown in π1(1) of Fig. 2.We observed that these domains are due to the singularity ofthe controller.πππππ : If we want to calculate the required torques π’ in
(12), we must obtain ππ’ using the first line of (12), whichcan be rewritten as:
ππ’ =βπ·12π£ βπ»1(π, π)
π·11 βπ·12π1, (23)
If there exist values of π1 such that:
π·11 βπ·12π1 = 0, (24)
the controller is singularity.During the swing phase, π·11 and π·12 are almost constant.
At the desired impact moment π π, there are π·11 β 24.2445,π·12 β [13.4347, 3.6283, 0.5375, 0.0410]. When π1 =[π1(1), 0, 0, 0], (24) is satisfied for π1(1) = 1.8046. Asshown in π1(1) of Fig. 2, there is a jump of πππ₯ β£π1,2β£near this value. Using the same method, we can deduce thatwhen π1 = [0, 6.6821, 0, 0]β², π1 = [0, 0, 45.1060, 0]β² andπ1 = [0, 0, 0, 592.7751]β², the controller will be singular too.2) For some π1, there exists a minimum πππ₯ β£π1,2β£ which
leads to stable walking.3) The πππ₯ β£π1,2β£ is very sensitive to some change of π1.
When π1 is modified a little, the stable walking gait maybecome unstable.
Here points 2) and 3) show that it is possible but difficultto choose π1 leading to stable walking. In order to furtherillustrate that, we choose different values of π1 to teststability. Each component of π1 is sampled between β10and 10 with a step of 2.5 to build 94 vectors π1. For eachvector π1, the effective eigenvalues of the Poincare map,πππ₯ β£π1,2β£, is calculated. Except some cases in which theswing foot canβt touch the ground because of the singularityof the controller, there are only 19 stable cases and 5363
4086
unstable cases. The vectors π1 leading to stable walkingare scattered and there is no obvious condition of stabilitycan be determined. Is there a method to help in the choiceof π1? In the next subsection, we will try to find a way ofsettling this problem.
B. A Method for Pertinent Choice of π1
For all cases corresponding to different π1 which werepresented in the previous paragraph, we choose some walk-ing characteristics to observe whether there exists differencebetween the stable and unstable cases. They are:
β position of πΆππ (Center of Mass) at the desired impactmoment π π,
β kinetic energy just before impact and after impact (see[2] and [3]),
β errors of uncontrolled variable ππ’ during the swingphase and at impact moment,
β height of swing foot ππ π€ at π π.We have not observed any special relations between the
effective eigenvalues of the Poincare map πππ₯ β£π1,2β£ and thefirst three walking characteristics, but we found ππ π€ β 0 atπ π for all stable cases, that is, πππ₯ β£π1,2β£ < 1, see Fig. 3.
To calculate ππ π€ at π π, we use the equation (16) to write:
π£(π π) = π£π(π π), (25)
and we suppose that there exists an error Ξππ’ on theunderactuated variable of biped ππ’, Ξππ’ = ππ’(π )β ππ’
π(π ).Considering (25), (10) and π2 = πΌ4Γ4, there is:
π(π π) = ππ(π π) +ππΞππ’ (26)
where ππ = [1,βπ1]β². We can compute ππ π€(π
π) withoutsimulation of the biped walking. We suppose Ξππ’ = 10β2,which is adequate to estimate the real Ξππ’ in the walkingsimulation. Then we get ππ π€(π
π) for all the case of π1(π) β[β10, 10], π = 1, 2, 3, 4, which was presented in the previoussubsection. The result is shown in Fig. 3 (we only presentthe cases of πππ₯ β£π1,2β£ < 50 ), where the red points denoteπππ₯ β£π1,2β£ < 1, that is, the stable cases.
0 10 20 30 40 50
β3
β2
β1
0
1
2
3
x 10β3
πππ₯ β£π1,2β£
ππ π€(π
π)
Fig. 3. The height of the swing foot at the end of the step ππ π€(π π) withrespect to the effective eigenvalues of the Poincare map, πππ₯ β£π1,2β£.
Based on Fig. 3, we conjecture that:A necessary condition for stable walking is ππ π€(π
π) β 0.This condition implies that the swing foot still can touch
ground at π π with an error of the unactuated variable. Ifthere is an error on the impact moment, π β= π π, there
0 20 40 60β10
0
10
20
30
40
50
60
70
π1(3)
π1(2)
β60 β40 β20 0β70
β60
β50
β40
β30
β20
β10
0
π1(3)
π1(2)
Fig. 4. The stable domain of π1(2) and π1(3) for two groups of solution,which is described with contour line πππ₯ β£π1,2β£ = 1
will exist not only the error of configuration and velocityof unactuated variable Ξππ’, Ξππ’, but also that of controlledvariables Ξπ£, ΞοΏ½οΏ½. These errors can lead to unstable walking.The necessary condition ππ π€(π
π) β 0 avoids the error onthe impact moment, so the errors Ξπ£ and ΞοΏ½οΏ½ are avoided.
Next is how to choose π1 with this necessary condition.According to (26), ππ π€ can be written with Taylor series:
ππ π€(π(ππ)) β ππ π€(π
π(π π)) + πΞππ’ + πΞππ’2 (27)
where π = βππ π€(ππ(ππ))βπ ππ and π =
12ππ
β² β2ππ π€(ππ(ππ))βπ2 ππ . Since the function of ππ π€(π(π
π))is highly nonlinear, here we considered the first two termsof the Taylor series, not only the first one.
If π and π satisfy :
π = π = 0 (28)
ππ π€(π(ππ)) is close to zero for any Ξππ’.
Two constraint equations exist in (28) for four componentsof π1, so only two components can be chosen. The condition(28) leads to two groups of solution of π1, because theequation π = 0 is a second order equation as function ofπ1. For each group of solution, π1(1) and π1(4) arededuced from π1(2) and π1(3). Then we search differentπ1(2) and π1(3) to analyze the stability of the system ,we can obtain large stable domain of π1 for each group ofsolution, as shown in Fig. 4, it is described with contour lineπππ₯ β£π1,2β£ = 1. The system is always stable as long as π1
is chosen in these domains.
C. Compare with The Method of Virtual Constraints
Here we will compare our method with the method ofvirtual constraints, which has been proved effectively indesigning feedback controllers for stable walking in planarbipeds [9], [5], [13], [15]. It is worth mentioning thatexperimental tests based on the method of virtual constraintshas been carried out successfully with Rabbit by the researchteam of J. Grizzle [16]. Here the compare of two method isbased on the same model of the robot, the same desiredtrajectory and the same method of stability analysis. It isshown that stability of an orbit is independent of the choiceof the output, as long as the constraints yield a nonsingularcontroller [16, Chap. 6]. As a result, the control propertiescan not be modified in this way. On the contrary, it can beimproved by a good choice of controlled outputs with ourmethod.
4087
In general, the effective eigenvalues of the Poincare map isthe smaller the better for the stability, so we can choose π1
according to the desired property. For example, we chooseπ1 = [1.9121,β4,β3,β1.4090] from the second solutionof Fig. 4, the result of stability analysis is πππ₯ β£π1,2β£ =0.2392. Then the planar bipedβs model in closed-loop issimulated with this π1 and the method of virtual constraints.The initial errors 0.01πππ and 0.1πππ/π are introduced oneach joint and itβs velocity respectively. As shown in Fig. 5,for the method of virtual constraints, the configuration of therobot at the impact is the desired one but the velocity con-verge more slowly. With our method, the velocity convergeto zero after walking four steps, and the control input π’ alsofollows the desired torque after these four steps. The torquesof the swing leg (see Fig. 1) are shown in Fig. 6.
0 1 2 3 4 5 6 7 8 9 10β0.02
0
0.02
0.04
0.06
0 1 2 3 4 5 6 7 8 9 10β0.05
0
0.05
0.1
0.15
the method of output selectionthe method of virtual constraints
the method of output selectionthe method of virtual constraints
Ξπ π’
(rad
)Ξπ π’
(rad
/s)
step number
step number
Fig. 5. The difference of the real values and desired values of ππ’ at theend of each step.
0 0.5 1 1.5 2 2.5 3 3.5 4
β40
β20
0
20
0 0.5 1 1.5 2 2.5 3 3.5 4
β20
β10
0
10
20
30
the real torque the disired torque
the real torque the disired torque
π’(4)(π
β π)
π’(3)(π
β π)
time (s)
time (s)
Fig. 6. The torques of the swing leg, where π’(3) notes the torque of torsoand π’(4) notes the torque of knee.
As stated above, the stability of walking can be improvedby pertinent choice of controlled outputs, furthermore, itis possible to produce better convergent property than theprevious method for this planar bipedal model.
VI. CONCLUSIONS
In this paper, a simple planar bipedal model has beenstudied, with the objective of developing a time-variant feed-
back control law that induces asymptotically stable walking,without relying on the use of large feet. We showed that theproperty of zero dynamics of the walking model is affectedby the choice of the controlled outputs. In addition, based onthe numerical results, we conjectured a necessary conditionfor stable walking is that the height of the swing foot at thedesired impact moment is close to zero. With this necessarycondition, two large stable domains are obtained. Finally, thesimulation results proved the validity and superiority of thismethod.
VII. ACKNOWLEDGMENT
The authors want to thank J. Grizzle and C.-L. Shih fortheir contribution to this study.
REFERENCES
[1] Y. Aoustin, C. Chevallereau, and A. Formalβsky, βNumerical andexperimental study of a virtual quadrupedal walking robot - semiquad,βMultibody System Dynamic, 2005.
[2] F. Asano and Z.-W. Luo, βAsymptotically stable gait generation forbiped robot based on mechanical energy balance,β in Proceedings ofthe 2007 IEEE/RSJ International Conference on Intelligent Robots andSystems, San Diego, USA, October 2007, pp. 3328β3333.
[3] ββ, βAsymptotic stability of dynamic bipedal gait with constraint onimpact posture,β in Proceedings of 2008 IEEE International Confer-ence on Robotics and Automation, USA, May 2008, pp. 1246β1251.
[4] C. Chevallereau, Modern Control Theory. Esculapio, Bologna (Italy),1999, ch. 7,βUnder actuated biped robotβ.
[5] C. Chevallereau, G. Abba, Y. Aoustin, F. Plestan, E. Westervelt, andet al., βRabbit: A testbed for advanced control theory,β IEEE ControlSystems, vol. 23, no. 5, pp. 57β78, 2003.
[6] C. Chevallereau and Y. Aoustin., βOptimal reference trajectories forwalking and running of a biped robot,β Robotica, vol. 19, no. 5, pp.557β569, 2001.
[7] C. Chevallereau, J. Grizzle, and C. Shih, βAsymptotically stablewalking of a five-link underactuated 3d bipedal robot,β IEEE Trans.on Robotics, vol. 25, no. 1, pp. 37β50, February 2009.
[8] T. Fukuda, M. Doi, Y. Hasegawa, and H. Kajima, Fast Motions inBiomechanics and Robotics. Springer-Verlag, 2006, chapter Multi-Locomotion Control of Biped Locomotion and Brachiation Robot.
[9] J. Grizzle, G. Abba, and F. Plestan, βAsymptotically stable walkingfor biped robots: Analysis via systems with impulse effects,β IEEETransactions on Automatic Control, vol. 46, pp. 51β64, January 2001.
[10] A. Isidori, Nonlinear Control Systems, 3rd ed. Berlin: Springer-Verlag, 1995.
[11] A. Kuo, βStabilization of lateral motion in passive dynamic walking,βInternational Journal of Robotics Research, vol. 18, no. 9, pp. 917β930, 1999.
[12] B. Morris and J. Grizzle, βHybrid invariant manifolds in systems withimpulse effects with application to periodic locomotion in bipedalrobots,β IEEE Transactions on Automatic Control, vol. 54, no. 8, pp.1751β1764, 2009.
[13] F. Plestan, J. Grizzle, E. Westervelt, and G. Abba, βStable walk-ing ofa 7-dof biped robot,β IEEE Transactions on Robotics and Automation,vol. 19, no. 4, pp. 653β668, 2003.
[14] T. Wang, C. Chevallereau, and C.-L. Shih, βChoice of output fortime-variant walking control for a five-link underactuated planar bipedrobot,β in Proceedings of 2009 IEEE-RAS International Conference onHumanoid Robots, Paris, France, December 2009.
[15] E. Westervelt, J. Grizzle, and D. Koditschek, βHybrid zero dynamicsof planar biped walkers,β IEEE Transactions on Automatic Control,vol. 48, no. 1, pp. 42β56, 2003.
[16] E. Westervelt, J. Grizzle, C. Chevallereau, J. Choi, and B. Morris,Feedback Control of Dynamic Bipedal Robot Locomotion, ser. Controland Automation. Boca Raton: CRC Press, June 2007.
4088