robustly stable adaptive control of a tandem of master\u0026amp;#8211;slave robotic manipulators...

19
IEEE Proof IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006 1 Robustly Stable Adaptive Control of a Tandem of Master–Slave Robotic Manipulators With Force Reflection by Using a Multiestimation Scheme 1 2 3 Asier Ibeas and Manuel de la Sen 4 Abstract—The problem of controlling a tandem of robotic ma- 5 nipulators composing a teleoperation system with force reflection 6 is addressed in this paper. The final objective of this paper is 7 twofold: 1) to design a robust control law capable of ensuring 8 closed-loop stability for robots with uncertainties and 2) to use the 9 so-obtained control law to improve the tracking of each robot to 10 its corresponding reference model in comparison with previously 11 existing controllers when the slave is interacting with the obstacle. 12 In this way, a multiestimation-based adaptive controller is pro- 13 posed. Thus, the master robot is able to follow more accurately 14 the constrained motion defined by the slave when interacting with 15 an obstacle than when a single-estimation-based controller is used, 16 improving the transparency property of the teleoperation scheme. 17 The closed-loop stability is guaranteed if a minimum residence 18 time, which might be updated online when unknown, between 19 different controller parameterizations is respected. Furthermore, 20 the analysis of the teleoperation and stability capabilities of the 21 overall scheme is carried out. Finally, some simulation examples 22 showing the working of the multiestimation scheme complete 23 this paper. 24 Index Terms—Adaptive control, force reflection, multiple 25 model, stability, switching, teleoperation. 26 I. I NTRODUCTION 27 T ELEOPERATION systems have been widely studied dur- 28 ing the last decades because they can extend the human 29 reach and manipulation capabilities to remote places and/or 30 hazardous locations. Teleoperation strategies can be found in 31 a broad variety of applications such as in space and undersea 32 explorations, nuclear operations, or medical applications [1]. 33 Recently, as the development of the virtual reality is in 34 progress, related application areas have been extended to the 35 entertainment and training fields [2]. A teleoperation system 36 consists of a human operator (which classifies this kind of 37 systems as those named as “man in the loop systems” [3], [4]), 38 a master manipulator, a remotely located slave manipulator, 39 and an environment/obstacle situated in the slave manipulator’s 40 Manuscript received May 10, 2005; revised November 8, 2005 and January 18, 2006. This work was supported in part by the MEC and the UPV-EHU under Projects DPI 2003-00164 and 9/UPV 00I06.I06-15263/2003, respectively. The work of A. Ibeas was supported by MECD under FPU Grant P2002-2727. This paper was recommended by Associate Editor M. S. De Queiroz. The authors are with the Departamento de Ingeniería de Sistemas y Au- tomática, University of the Basque Country, Bilbao 48080, Spain (e-mail: [email protected]; [email protected]). AQ1 Digital Object Identifier 10.1109/TSMCB.2006.874693 location. The human operator applies his/her intentional force 41 to the master manipulator to make the slave to perform a series 42 of commands. If the slave manipulator exactly reproduces the 43 master’s motions and the master manipulator accurately reflects 44 to the master the measured slave force, then the operator should 45 experience the same sensation as the slave manipulator does. 46 In other words, the teleoperation system would be completely 47 “transparent” [5], [6]. In this way, the final objective of the 48 teleoperated system is to achieve a good transparency condition 49 while remaining stable. Hence, the control of a tandem of 50 master–slave manipulators with force reflection constitutes a 51 challenge for control theorist because it requires not only to 52 face the problem of controlling robot manipulators but also 53 to achieve an adequate position and force tracking proper- 54 ties. Thus, “adaptive controllers” are an interesting alterna- 55 tive for the design of high-performance robotic manipulator 56 controllers for teleoperation systems due to their capabilities 57 for adapting to changing operation conditions. Much effort 58 has been done in this direction (see, for instance, [7]–[11]). 59 However, adaptive controllers may lead to a poor closed-loop 60 response in terms of reference tracking deviations, [12]–[17], 61 especially when the slave is interacting with an environment. 62 This situation leads to a poor transparency in the teleopera- 63 tion system. In this paper, a multiestimation-based adaptive 64 controller architecture is proposed to improve the position 65 and force tracking properties of teleoperation systems with 66 force reflection. 67 Thus, the idea of multiestimation is linked in this paper 68 with that of teleoperation for nonlinear uncertain dynamical 69 systems, which have not been before addressed in a unified 70 way in the literature. In particular, the multiestimation scheme 71 improves the behavior of the adaptive impedance-based control 72 law when the slave robot interacts with the environment. In 73 this way, the master robot is able to track more accurately 74 the motion and force of the slave robot when interacting with 75 an environment while improving the transparency condition 76 in comparison with existing single-model-based adaptive con- 77 trollers. Thus, the multiestimation scheme is a powerful tool 78 to improve the contact process performance for interactions 79 ranging between smooth impacts, associated with relatively 80 soft environments, and nonsmooth impacts, associated with 81 environments with a large stiffness, with the stability be- 82 ing guaranteed. Hence, a framework where teleoperated ro- 83 bot controllers capacities are enhanced in comparison with 84 existing adaptive controllers described in the literature [2], 85 1083-4419/$20.00 © 2006 IEEE

Upload: independent

Post on 20-Nov-2023

0 views

Category:

Documents


0 download

TRANSCRIPT

IEEE

Proo

f

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006 1

Robustly Stable Adaptive Control of a Tandem ofMaster–Slave Robotic Manipulators With ForceReflection by Using a Multiestimation Scheme

1

2

3

Asier Ibeas and Manuel de la Sen4

Abstract—The problem of controlling a tandem of robotic ma-5nipulators composing a teleoperation system with force reflection6is addressed in this paper. The final objective of this paper is7twofold: 1) to design a robust control law capable of ensuring8closed-loop stability for robots with uncertainties and 2) to use the9so-obtained control law to improve the tracking of each robot to10its corresponding reference model in comparison with previously11existing controllers when the slave is interacting with the obstacle.12In this way, a multiestimation-based adaptive controller is pro-13posed. Thus, the master robot is able to follow more accurately14the constrained motion defined by the slave when interacting with15an obstacle than when a single-estimation-based controller is used,16improving the transparency property of the teleoperation scheme.17The closed-loop stability is guaranteed if a minimum residence18time, which might be updated online when unknown, between19different controller parameterizations is respected. Furthermore,20the analysis of the teleoperation and stability capabilities of the21overall scheme is carried out. Finally, some simulation examples22showing the working of the multiestimation scheme complete23this paper.24

Index Terms—Adaptive control, force reflection, multiple25model, stability, switching, teleoperation.26

I. INTRODUCTION27

T ELEOPERATION systems have been widely studied dur-28

ing the last decades because they can extend the human29

reach and manipulation capabilities to remote places and/or30

hazardous locations. Teleoperation strategies can be found in31

a broad variety of applications such as in space and undersea32

explorations, nuclear operations, or medical applications [1].33

Recently, as the development of the virtual reality is in34

progress, related application areas have been extended to the35

entertainment and training fields [2]. A teleoperation system36

consists of a human operator (which classifies this kind of37

systems as those named as “man in the loop systems” [3], [4]),38

a master manipulator, a remotely located slave manipulator,39

and an environment/obstacle situated in the slave manipulator’s40

Manuscript received May 10, 2005; revised November 8, 2005 and January18, 2006. This work was supported in part by the MEC and the UPV-EHU underProjects DPI 2003-00164 and 9/UPV 00I06.I06-15263/2003, respectively. Thework of A. Ibeas was supported by MECD under FPU Grant P2002-2727. Thispaper was recommended by Associate Editor M. S. De Queiroz.

The authors are with the Departamento de Ingeniería de Sistemas y Au-tomática, University of the Basque Country, Bilbao 48080, Spain (e-mail:[email protected]; [email protected]).AQ1

Digital Object Identifier 10.1109/TSMCB.2006.874693

location. The human operator applies his/her intentional force 41

to the master manipulator to make the slave to perform a series 42

of commands. If the slave manipulator exactly reproduces the 43

master’s motions and the master manipulator accurately reflects 44

to the master the measured slave force, then the operator should 45

experience the same sensation as the slave manipulator does. 46

In other words, the teleoperation system would be completely 47

“transparent” [5], [6]. In this way, the final objective of the 48

teleoperated system is to achieve a good transparency condition 49

while remaining stable. Hence, the control of a tandem of 50

master–slave manipulators with force reflection constitutes a 51

challenge for control theorist because it requires not only to 52

face the problem of controlling robot manipulators but also 53

to achieve an adequate position and force tracking proper- 54

ties. Thus, “adaptive controllers” are an interesting alterna- 55

tive for the design of high-performance robotic manipulator 56

controllers for teleoperation systems due to their capabilities 57

for adapting to changing operation conditions. Much effort 58

has been done in this direction (see, for instance, [7]–[11]). 59

However, adaptive controllers may lead to a poor closed-loop 60

response in terms of reference tracking deviations, [12]–[17], 61

especially when the slave is interacting with an environment. 62

This situation leads to a poor transparency in the teleopera- 63

tion system. In this paper, a multiestimation-based adaptive 64

controller architecture is proposed to improve the position 65

and force tracking properties of teleoperation systems with 66

force reflection. 67

Thus, the idea of multiestimation is linked in this paper 68

with that of teleoperation for nonlinear uncertain dynamical 69

systems, which have not been before addressed in a unified 70

way in the literature. In particular, the multiestimation scheme 71

improves the behavior of the adaptive impedance-based control 72

law when the slave robot interacts with the environment. In 73

this way, the master robot is able to track more accurately 74

the motion and force of the slave robot when interacting with 75

an environment while improving the transparency condition 76

in comparison with existing single-model-based adaptive con- 77

trollers. Thus, the multiestimation scheme is a powerful tool 78

to improve the contact process performance for interactions 79

ranging between smooth impacts, associated with relatively 80

soft environments, and nonsmooth impacts, associated with 81

environments with a large stiffness, with the stability be- 82

ing guaranteed. Hence, a framework where teleoperated ro- 83

bot controllers capacities are enhanced in comparison with 84

existing adaptive controllers described in the literature [2], 85

1083-4419/$20.00 © 2006 IEEE

IEEE

Proo

f

2 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

Fig. 1. Teleoperation system architecture.

[8], [10], [18]–[22] is proposed by taking advantage of the86

new design possibilities added by the switching and multiple-87

model-based techniques [13]–[15], [23], [24]. In the general88

problem setting, a number of estimation algorithms running89

in parallel are used, each one being initialized by different90

estimated parameter values. At each time instant, a higher91

level switching structure selects in real time the element from92

the convex hull of the whole set of estimated parameter vec-93

tors, which is then used to parameterize an adaptive interval94

controller. The values of the coefficients of such a convex95

linear combination are selected at each time instant based96

on the values of a set of performance indexes evaluating the97

identification performance of each estimator. In this way, the98

so selected estimated vector parameterizes an impedance-type99

adaptive control law. Furthermore, the proposed control law is100

“modular” in the sense that the stability properties of the101

scheme are maintained irrespective of the estimation algorithms102

used if they possess some basic asymptotic estimation proper-103

ties. This modularity is obtained by adding a novel correcting104

term to the standard feedback linearization impedance control105

law used in the literature [8], [10], [20], [21]. The stability of106

the teleoperation system is proved, and explicit conditions on107

the influence of the human operator in the stability properties108

are discussed.109

The paper is organized as follows. First, the problem descrip-110

tion and modeling issues are described in detail in Section II.111

Section III deals with the multiestimation scheme architecture.112

The design of the control law is addressed in Section IV.113

Section V discusses the stability properties of the overall114

scheme. In Section VI, the numerical examples of the teleoper-115

ation system are carried out. Finally, conclusions end the paper.116

II. PROBLEM DESCRIPTION AND MODELING ISSUES 117

A. Teleoperation System 118

The force–force teleoperation control scheme displayed in 119

Fig. 1 has been chosen to implement the teleoperation ar- 120

chitecture [5]. According to Fig. 1, the force made by the 121

operator fop is transmitted to the slave robot, and the force it 122

measures in its end effector is transmitted to the master one. 123

Thus, it is necessary to dispose a force sensor at the slave’s end 124

effector to transmit the interaction force with the environment 125

to the master robot. However, a model for the environment 126

and interacting force is considered in this paper for computer 127

simulation purposes. For each robot, a multiestimation-based 128

adaptive controller is synthesized. Each controller is parameter- 129

ized in an independent way according to its associate estimator 130

from the corresponding multiestimation scheme, and thus, they 131

can be tuned separately. In this paper, both robots will be of 132

revolute joints because the stability proof is devoted to this kind 133

of mechanical systems. 134

B. Plant Description 135

The scheme is composed of two robots, namely, master 136

and slave. The subscript = m, s will be used to denote the 137

master or slave manipulators, respectively. Thus, a general 138

robotic manipulator with n rigid degrees of freedom is char- 139

acterized by a set of n generalized coordinates collected in 140

the vector 141

qT = [q1 q2 · · · qn] .

IEEE

Proo

f

IBEAS AND DE LA SEN: ADAPTIVE CONTROL OF ROBOTIC MANIPULATORS WITH FORCE REFLECTION 3

Thus, the general dynamic equations of both the master and

AQ2

142

slave robotic manipulators are written in the joint space as143

M(q)q + C(q, q)q + F (q)+ G(q) + JT

(q)f ext = τ (1)

where M(q) ∈ Rn×n symmetric and positive definite is144

the so-called “inertia matrix,” and C(q, q)q ∈ Rn×1 is a145

vector containing the “Coriolis” and centripetal terms.146

Also, F (q) ∈ Rn×1 models the friction phenomena,147

G(q) ∈ Rn×1 is the gravity vector, and J ∈ R

n×r is the148

“Jacobian” of the manipulator, where r is the number of149

degrees of freedom of the manipulator in the task space. The150

external forces (and torques) are f ext ∈ Rr×1 (r ≤ 6), and151

τ ∈ Rn×1 is the joint torque vector. Note that it is possible152

to consider a different number of degree of freedoms in the153

master and in the slave manipulators. This allows the existence154

of teleoperation systems with redundancies in any master–slave155

robot. This possibility enhances the applications of telerrobotic156

systems allowing the avoidance of certain obstacles during the157

robot motion or adding robustness against actuator failures to158

the system (which is useful, for example, in space applications;159

see, for instance, [25], [26]). The general dynamic model (1)160

possesses a certain number of well-known structural properties,161

which can be found, for instance, in [10] and [20]. However, the162

following one, then used in the design of the adaptive control163

law, is formulated for convenience below (subscripts = m, s164

denoting each robot are omitted for the sake of simplicity).165

Property 1: The inertia matrix M and the vectors Cq and G166

depend linearly on a p′ × 1 vector P ′ of parameters containing167

the masses and inertias of the system [10], [20], so that (1)168

becomes169

M(q)q + C(q, q)q + F (q) + G(q) + JT (q)f ext

= W ′(q, q, q)P ′ + F (q) + JT (q)f ext = τ

where W ′(q, q, q) ∈ Rn×p′

is the “regressor” matrix. 170

C. Nominal Dynamical Model of the Robot171

Nevertheless, it is difficult to obtain an accurate model of a172

real robotic manipulator. This fact implies that the nominal dy-173

namic model of the robot, used for control law design purposes,174

is always inherently approximated, and thus, there always exists175

an “unmodeled dynamic component” in the closed-loop system176

due to the parametrical and same-order structural uncertainties.177

To illustrate the unmodeled dynamics effect, a nominal model178

for the real robot is considered in this section. In this way,179

the real robots are simulated by using (1), whereas the control180

law is generated by using the nominal model described below.181

Moreover, because we are designing a multiestimation-based182

controller, we will consider a set of models of the plant operat-183

ing in parallel, each described by the equation184

M(i)0 (q)q + C

(i)0 (q, q)q

+ F(i)0 (q) + G

(i)0 (q) + JT

(q)f ext

= W(q, q, q)P(i) + JT

(q)f ext = τ (2)

with i = 1, 2, . . . , Ne, Ne ≥ 1 being the number of different 185

plant models disposed for each robot, whereas the matrices 186

M(i)0 , C(i)

0 , F(i)0 , G

(i)0 stand for the nominal model descrip- 187

tion, W(q, q, q) ∈ Rn×p , and P

(i) ∈ R

p×1, where p is 188

the number of the unknown robotic parameters (which may be, 189

for example, masses or friction coefficients), is the nominal 190

parameter vector. It is supposed that the robot parameters for 191

the nominal model are unknown, and therefore, they will be 192

estimated leading to a set of estimated nominal parameter 193

vectors P(i)

. Note that the larger number of estimators we 194

dispose, the more the closed-loop response may be improved. 195

Thus, the practical number of estimators needed is determined 196

by the available outlay on hardware and the characteristics of 197

this. Also, in [27], a recent approach to the design of families of 198

estimation models is presented for multimodel controller design 199

from a theoretical point of view. However, a small number of 200

estimators can still achieve an important response improvement 201

as simulation examples in Section VI point out. 202

Furthermore, the dynamic equations of the robotic manip- 203

ulator (1) can be rewritten in terms of each reduced nominal 204

model (2) as 205[M

(i)0 (q) + Λ(i)

1 (q)]q + C

(i)0 (q, q)q + F

(i)0 (q)

+ G(i)0 (q) + JT

(q)f ext + Λ(i)2 (q, q)

= W · P (i) + JT

f ext︸ ︷︷ ︸nominal

+ Λ(i)1 (q)q + Λ(i)

2 (q, q)︸ ︷︷ ︸unmodelled

= τ (3)

where the error signals Λ(i)1 ,Λ(i)

2 of the real dynamics com- 206

pared with the nominal one are defined by 207

Λ(i)1 (q) =M(q)−M

(i)0 (q) (4.1)

Λ(i)2 (q, q) =C(q, q)q + F (q) + G(q)

− C(i)0 (q, q)q − F

(i)0 (q) − G

(i)0 (q).

(4.2)

Moreover, the following assumption concerning the dynam- 208

ics of the friction terms is made. 209

Assumption 1: There exist real constantsCF1, CF2, C(i)F01, 210

C(i)F02 ≥ 0, = m, s, such that the friction terms for both real 211

and nominal models satisfy 212

‖F (q)‖ ≤CF1‖q‖ + CF2‖q‖2∥∥∥F (i)0 (q)

∥∥∥ ≤C(i)F01‖q‖ + C

(i)F02‖q‖2

i = 1, 2, . . . , Ne.

213

Assumption 1 is feasible for most friction models used in 214

the literature, which include viscous, Coulomb, and Stribeck 215

effects [28], [29]. The following Property 2, then used in the 216

design of the estimation algorithm, can be directly stated using 217

standard robotic properties and Assumption 1. 218

IEEE

Proo

f

4 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

Property 2: There exist real constants χ(i)10, χ

(i)0 , χ

(i)1 ,219

χ(i)2 ≥ 0 for i = 1, 2, . . . , Ne such that Λ(i)

1 (q),Λ(1)2 (q, q)220

satisfy from (4.1) and (4.2), i.e.,221 ∥∥∥Λ(i)1 (q)

∥∥∥≤ Λ(i)1 =χ

(i)10∥∥∥Λ(i)

2 (q, q)∥∥∥≤ Λ(i)

2 (q) =χ(i)0 +χ(i)

1 ‖q‖+χ(i)2 ‖q‖2.

222

Remark 1: Note that a sole set of constants could be used for223

each robot defining, i.e.,224

(χ10, χ0, χ1, χ2) = max1≤i≤Ne

(i)10 , χ

(i)0 , χ

(i)1 , χ

(i)2

).

Moreover, the following assumption concerned with the pertur-225

bation signals is also made. 226

Assumption 2: A set of constants χ10, χ0, χ1, χ2 ≥ 0,227

= m, s, satisfying Property 2 is known. 228

Thus, according to Property 2 and Assumption 2, the unmod-229

eled dynamics is not supposed to be stable but only growing230

nonfaster that quadratically with ‖q‖ contrarily to the twin hy-231

pothesis in [16] and [30]. The knowledge of the upperbounds of232

the parametrical uncertainty signals (Assumption 2) is required233

to design a relative adaptation dead zone for the estimation al-234

gorithm to guarantee adequate estimation properties. However,235

this knowledge can be overcome by using large values for the236

constants involved or, even, including them in the estimation237

process as made in [31].238

D. Simulation Model for the Environment239

In this section, a model for the interaction force between the240

slave’s end effector and the obstacle is presented for simulation241

purposes. Thus, the interaction force between the robot and the242

obstacle will be decomposed as the superposition of two elastic243

forces. This is a feasible case to point out the stability properties244

because an elastic environment is one of the most critical cases245

dealt with for stabilization purposes [32], [33]. On the other246

hand, nonelastic interactions imply the dissipation of energy247

that contributes to stabilize the system.248

The environment modeling issue is summarized as follows.249

Assumption 3: The model for the normal force is given by250

f = fnn; n = ∇g/‖∇g‖; fn = Ken|∆x⊥| (5)

where Ken > 0 is the “normal stiffness” of the environment251

or interacting obstacle, |∆x⊥| is the normal penetration of the252

slave in the obstacle, and n is a unitary vector perpendicular to253

the object and outward-oriented [34]. Also, the tangential force254

is given by the “Salisbury” model [35], whose basic behavior is255

described in Fig. 2, where Ket > 0 is the “tangential stiffness”256

and t is a vector satisfying ‖t‖ = 1, t · n = 0, t · xs < 0 where257

x is the vector containing the position of the end effector of the258

manipulator, and xad is a vector called “adherence point.” 259

The above model includes two kinds of tangential interaction260

terms, namely, the “static,” µs, and the “dynamic,” µd, one [35]261

satisfying µs > µd > 0.262

Fig. 2. Simulation model for the tangential force.

E. Model for the Human Operator 263

Inasmuch as telemanipulation systems are considered as man 264

in the loop systems [3], [4], a model for the operator performing 265

the task must also be considered in the closed-loop defini- 266

tion (shown in the telemanipulation scheme, Fig. 1, the block 267

representing the operator). Several models for the operator 268

behavior in the control loop have been proposed in the literature 269

[5], [6], [36]–[39]. A typical model for the operator is given by 270

a second-order linear matrix differential equation formalized as 271

follows. 272

Assumption 4: The model for the operator is described by 273

Mopx+Bopx+Kopx = fman (6)

where the real positive definite matrices Mop, Bop,Kop define 274

the operator impedance, and fman is a force applied to the 275

extreme of the operator hand. 276

Remark 2: Different authors point out different values for 277

the inertia, damping, and elastic coefficients for the operator 278

model (see [5], [6], [36], [37], [39]–[41]). The values of the 279

operator impedance determine the allowable values for the 280

target matrices, which guarantee closed-loop stability according 281

to the stability theorem below. Thus, the use of different models 282

leads to different stability requirements. However, to design a 283

system that remains insensitive regardless of the operator, an 284

algorithm to online adjust the value of the target matrices is 285

then proposed in Section IV. 286

Nevertheless, one always has to be cautious about the ob- 287

tained results when a human operator is involved in the closed- 288

loop as Flemmer et al. have pointed out in [42] because the 289

second-order linear model is an approximation of the complex, 290

probably nonlinear, behavior of the user. To perform the simula- 291

tion examples with a simulated operator connected in the loop, 292

the Kazerooni and Moore data [37], (Mop = 4.54 kg, Bop = 293

6.83 N · s/m, Kop = 12.5 N/m) have been used to implement 294

the user. Hence, simulations describe a case study when a 295

certain human operator is manipulating the device despite the 296

fact that the system remain bounded for any operator due to the 297

implementation of the online updating algorithm for the target 298

matrices introduced then in Section IV. 299

F. Closed-Loop Robot Reference Model 300

The control algorithm is based on an impedance-type control. 301

Thus, under an external force f ext, we want the robot to behave 302

as the mass-damped reference system described by 303

Mt xd(t) +Bt xd(t) +Kt xd(t) = υf ext(t) (7)

IEEE

Proo

f

IBEAS AND DE LA SEN: ADAPTIVE CONTROL OF ROBOTIC MANIPULATORS WITH FORCE REFLECTION 5

where xd is a vector containing the end effector’s task space304

desired position as well as a parameterization for the orienta-305

tions for each robot, = m, s, and υ is a scaling factor that306

allows to resize the force transmitted between both manipula-307

tors. This property would be useful when both robots are of308

different kinds, sizes, or powers. In that case, the scaling of the309

corresponding workspaces allows to manage a robot of large310

size with another of reduced one and vice versa. The targeted311

matrices Mt = MTt > 0, Bt = BT

t ≥ 0, Kt = KTt ≥ 0 are312

chosen diagonal to decouple the robot dynamics in the task313

space. It is recommended in the literature to chose reference314

models for both robots with the same dynamic behavior, ex-315

cluding the value of the gain υ [43]. Thus, to then establish316

the stability theorem under this recommended situation, the317

following assumption concerning the reference model of the318

master and slave manipulator is made.319

Assumption 5: It is assumed that both the master and slave320

robot possess the same dynamic behavior, i.e.,321

Mtm = Mts = Mt; Btm = Bts = Bt; Ktm = Kts = Kt.

322

Furthermore, it is remarked in several papers ([43], [44], and323

references therein) that it is convenient to change the impedance324

reference model when the system contacts an obstacle. Thus,325

once the contact has been detected, according to a prescribed326

threshold for the measured force at the slave’s end effector,327

the reference impedance is modified along the directions of328

contact to maintain stability, whereas the same reference model329

is maintained for the rest of degrees of freedom [41], [45].330

The modification of the reference model should be gradual331

between both sets of parameters to reduce the effect of the332

switching transient. There is a diversity of opinion about what333

constitutes the best parameterization for a master–slave ref-334

erence model [46]–[48]. In general, the target impedance in335

free motion should be selected to minimize the fatigue of the336

operator handling the master. Thus, small values for the inertia337

and dumping matrices are the most appropriate option. On338

the other hand, the reference model in the contact directions339

should be selected to block the master and slave manipulators340

motion in the directions of contact. Hence, high values for341

the damping and matrix inertia are recommended. In this way,342

exhaustive simulation work has revealed that an increment of343

the value of the dumping matrix a factor of a thousand in the344

directions where a contact has been detected provides a good345

behavior serving this factor as a rule of thumb. Furthermore,346

Raju et al. [47] and Bejczy and Handylykken [46] report347

that there seem to be different best combinations of force-348

feedback gain υm for different tasks. A set of guidelines for349

selecting an adequate parameterization for the reference model350

(7) for different teleoperation tasks can be found in [5], [40],351

[45]–[47], and [61]–[64].352

III. PARALLEL MULTIESTIMATION SCHEME353

ARCHITECTURE AND UPDATING SUPERVISION354

RULES OF THE ACTIVE ESTIMATOR355

The architecture of the parallel multiestimation scheme is356

depicted in Fig. 3.357

Fig. 3. Multiestimation scheme architecture.

There exists Ne estimation algorithms running in parallel. 358

Each one is different from another in what is concerned with 359

the estimates parameter vector initialization and/or the kind 360

of estimation algorithm. Hence, a set of estimated parameter 361

vectors P(i)

k,, i = 1, 2, . . . , Ne is available at each sampling 362

instant k, whereas a continuous estimated parameter vector can 363

be defined by P(i)

(t) = P(i)

k,, t ∈ [kTs, (k + 1)Ts). At each 364

time interval, the controller is parameterized by a vector P k, 365

belonging to the convex hull of the estimated parameter vectors 366

of the parallel multiestimation scheme, i.e., 367

P k, ∈ Co

P(1)

k,, P(2)

k,, . . . , P(Ne)

k,

:=

P k, : P k,=

Ne∑i=1

α(i)k, P

(i)

k,,

Ne∑i=1

α(i)k, ≤1, α

(i)k, ≥0

.

(8)

Such a parameter vector should guarantee the closed-loop 368

stability and asymptotic reference tracking properties while 369

improving the transient contact closed-loop response of the 370

overall system compared with traditional single-model-based 371

adaptive control schemes. All the estimation algorithms are 372

chosen to be of the same type for both robots, initialized by 373

different estimated parameter vectors. 374

A. Estimation Algorithm 375

For the implementation of the parameter estimation, a dis- 376

crete algorithm will be used, although the problem is continu- 377

ous in its initial formulation. Denote the sampling period as Ts. 378

At each sampling time tk = kTs, the joint torque, joint posi- 379

tions, velocities, and accelerations are supposed to be available. 380

In particular, the measurable joint torques are the control input. 381

Furthermore, acceleration data should be obtained because real 382

values for the signals have to be included in the estimation 383

IEEE

Proo

f

6 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

algorithm. Due to the fact that acceleration measurements are384

not usually directly available from sensors in standard robot385

manipulators, a causal linear strictly stable filter (1/F(D)) in386

the time derivative D = d/dt (D0 = 1, Di+1 = DiD) will be387

used to avoid the problem of obtaining the higher order deriva-388

tives of the joint velocities. In Section V-C, an example of how389

the acceleration measurement is avoided during estimation is390

pointed out. In this way, only joint velocities and positions have391

to be measured. This assumption is not quite restrictive because392

they are usually available for measurement in typical robots. In393

this way, the filtered identification errors for each (master and394

slave) robot are defined as E(i)τf = (1/F(D))E(i)

τ , with E(i)τ395

defined as396

E(i)τ =τ −τ

(i) =WP

(i)

+Λ(i)2 +Λ(i)

1 q=WP(i)

+η(i)

(9)

where η(i) defines the unmodeled dynamic contribution as-397

sociated with the ith model, and τ(i) is the estimated joint398

torque vector given via (1) by399

M(i)0 (q)q + C

(i)0 (q, q)q

+ F(i)

0 (q) + G(i)

0 (q) + JT (q)f ext = τ

(i)0 (10)

and τ is the control torque for the robot defined by the control400

law described completely in the sequel. Also, the filtered signal401

η(i)f = (1/F(D))η(i)

represents an upperbound of the contri-402

bution of the parametrical uncertainties η(i) according to (4.1)403

and (4.2) and Assumption 2, i.e.,404 ∥∥∥η(i)

∥∥∥ =∥∥∥Λ(i)

2 + Λ(i)1 q

∥∥∥ ≤ Λ(i)2 + Λ(i)

1 ‖q‖

=χ(i)0 + χ

(i)1 ‖q‖+χ

(i)2 ‖q‖2+χ(i)

10 ‖q‖ = η(i) .

(11)

Let ak = ‖WkfΓkΓk+1ΓkWTkf‖ and δ

(i)k = ϑ

(i)k +405

‖WkfΓ(i)k W

Tkf‖ with Wkf = (1/F(D))Wk, so that the406

P(i)k estimate vector is given at each sampling time by407

(12.1)–(12.3), shown at the bottom of the page.408

Initial values for P(i)

0, and Γ(i)0 = Γ(i)T

0 > 0 are arbitrary 409

and bounded, for some real design constants µ(i) > 1, where 410

ξ(i)k, ∈ [ε(i)

, 1], ∀k ≥ 0, is the forgetting factor of the esti- 411

mation algorithm with ε(i) > 0, s(i)

k, ∈ [0, 1] and i = 1, 2, 412

. . . , Ne. The free-design scalar parameter ϑ(i)k ∈ (0,∞) acts 413

as a gain modifying the adaptation rate. The inclusion of a 414

relative adaptation dead zone in the estimation algorithm acts 415

freezing the adaptation process when the identification error 416

lies below a prescribed bound related to the unmodeled dy- 417

namics upperbound amplitude allowing to guarantee adequate 418

estimation properties [49]. Each matrix Γ(i)k must be reset to its 419

original positive definite value Γ(i)0 when its minimum eigen- 420

value lies down a prescribed positive threshold to avoid the con- 421

vergence of Γ(i)k to a positive semidefinite matrix. Moreover, 422

it is possible to use different estimation algorithms for each 423

estimation model. For instance, the use of Landau–Lozano-type 424

estimation algorithms [49], which possess a larger number of 425

free-design parameters, is recommended to get more control 426

by the designer on the converge rate and properties of the 427

estimates. However, in this paper, we have used the same 428

estimation algorithm for all the estimators to point out the 429

relevance of the switching process to improve the closed-loop 430

response. The estimation algorithm possesses the following 431

properties. 432

Lemma 1: The estimation algorithm defined by 433

(12.1)–(12.3) has the following properties: 434

(i) ‖P (i)

k ‖ <∞, ∀k ≥ 0, provided that ‖P (i)

0 ‖ <∞; 435

(ii) limN→∞∑N

k=1(s(i)k ‖E(i)

τkf‖2)/(ϑ(i)k + ‖WkfΓ

(i)k × 436

WTkf‖) <∞; 437

(iii) limk→∞(s(i)k ‖E(i)

τkf‖2)/(ϑ(i)k + ‖WkfΓ

(i)k W

Tkf‖) = 438

0; 439

(iv) limN→∞∑N

k=1(s(i)2k ‖E(i)

τkf‖2)/(ϑ(i)k + ‖Wkf × 440

Γ(i)k W

Tkf‖) <∞; 441

(v) limN→∞∑N

k=η ‖P(i)k − P

(i)k−β,‖2 <∞, for any finite 442

β > 0; 443

(vi) limk→∞ ‖P (i)k − P

(i)k−β,‖ = 0, for any finite β > 0. 444

The proof can be carried out following [49]. Basically, 445

Lemma 1 states that the estimation algorithm for each estimated 446

P(i)

k+1, = P(i)

k, +s

(i)k,Γ

(i)k,W

TkfE

(i)τkf

ϑ(i)k, +

∥∥∥WkfΓ(i)k,W

Tkf

∥∥∥ ; P(i) (t) = P

(i)k, , ∀t ∈ [kTs, (k + 1)Ts) (12.1)

Γ(i)k+1, =

1

ξ(i)k,

(Γ(i)k, − s

(i)k,

Γ(i)k,

WTkf

Wkf Γ(i)k,

ϑ(i)k,

+∥∥Wkf Γ

(i)k,

WTkf

∥∥)

s(i)k, = 0

Γ(i)k, s

(i)k, = 0

Γ(i) (t) = Γ(i)

k, , ∀t ∈ [kTs, (k + 1)Ts) (12.2)

s(i)k, =

0 if∥∥∥E(i)

τkf

∥∥∥ ≤ µ(i)k,

√(ξ

(i)k, δ

(i)k, + a

(i)k

)/ξ

(i)k, δ

(i)k, η

(i)kf

1 − µ(i)

√(ξ

(i)k, δ

(i)k, + a

(i)k,

)/ξ

(i)k, δ

(i)k,

η(i)kf∥∥E(i)τkf

∥∥ otherwise.

(12.3)

IEEE

Proo

f

IBEAS AND DE LA SEN: ADAPTIVE CONTROL OF ROBOTIC MANIPULATORS WITH FORCE REFLECTION 7

parameter vector possesses standard estimation properties [49],447

ensuring the boundedness of the estimated parameter values for448

all time and their convergence to a finite limit.449

Remark 3: The above properties also hold for an estimated450

parameter vector belonging to the convex hull of the estimated451

parameter vectors, P k, ∈ CoP (1)

k,, P(2)

k,, . . . , P(Ne)

k, , with452

constant coefficients as can be concluded from the proof of the453

single estimation algorithm properties [49]. 454

B. Identification Performance Index455

The aim of the subsequent performance index with forgetting456

factor λ ∈ (0, 1] is to quantify the goodness of each estimation457

algorithm at each time instant and to govern the switching rule458

in between the various estimators within the multiestimation459

scheme, i.e.,460

J(i)w (k) =

k∑ζ=k−NC

λk−ζ E(i)TτζfQζE

(i)τζf (13)

where Nc > 0 is the sliding constant horizon size of the super-461

visory index (13), = m, s, and tς = ςTs and Qζ = QTζ ≥ 0462

are weighting matrices that allow the design of different ad hoc463

weights for each component of the identification error through464

time within each sliding-time optimization horizon window.465

Note that the identification performance index (13) is a measure466

of the long-term accuracy of each identification algorithm. The467

switching rule for the adaptive controller reparameterization468

is obtained from the performance index (13) by one of the469

following proposed methods.470

C. Selection of the Parameter Vector Used to Parameterize471

the Adaptive Controller472

In this section, three different switching policies between473

elements of (8) are proposed.474

Switching Rule 1 (Convex Combination of Estimates): The475

α(i)k, coefficients from the convex linear combination (8) are476

modified according to the flowchart for Switching Rule 1. Each477

robot is running the above algorithm updating the weighting478

functions at discrete times, which translates into continuous-479

time weighting functions α(i) (t) = α

(i)k,, t ∈ [kTs, (k + 1)Ts).480

The supervisory rule works as follows: if the elapsed time481

from the last previous α updating is smaller than the “residence482

time” NDTs (with the residence number of samples being483

ND), then the parameterization of the adaptive controller484

is kept unchanged, and hence, the same values for the α485

coefficients as before are used. Otherwise, a new controller486

reparameterization is given according to the algorithm above487

based on the identification performance index (13). The resi-488

dence time is subject to a minimum lowerbound constraint to489

guarantee closed-loop stability, which is a typical requirement490

in switching systems [13], [15], [16]. Note that the adaptive491

controller reparameterization only takes place at certain time492

instants, whereas all the estimators in the parallel multiestima-493

tion scheme are running at all time.494

Switching Rule 2 (Selection of a Single Estimate Vector):495

The switching rule chooses online one single estimator from the496

parallel multiestimation scheme that leads to the best closed- 497

loop behavior according to the minimization of (13). This is a 498

particular case of the convex combination of estimates when 499

one of the coefficients is unity and the remaining ones are 500

zero. In this way, at each sampling time tk = kTs, there exists 501

an estimator ck such that α(i)ck,

= 1, whereas the remaining 502

coefficients are zero. Notice that this is a particular case of 503

Switching Rule 1. Thus, Switching Rule 2 is formally expressed 504

as the flowchart for Switching Rule 2. 505

This algorithm runs for both robots simultaneously and in an 506

independent way. Likewise, a minimum residence time between 507

consecutive controller reparameterizations has to be respected 508

to guarantee closed-loop stability. However, the said minimum 509

residence time may be unknown in advance despite its existence 510

being ensured in [59]. A possible way to select the residence 511

time is to tentatively initialize it with a small value to correct 512

any potential erroneous choice of the initial values of the 513

estimates as fast as possible. 514

However, the use of a very small value for the residence 515

time might lead to instability [15], [16]. Hence, an algorithm 516

to online tune the value of the residence time is proposed in 517

Section V to be calculated by the system online through its 518

operation if the selected initial value does not fit the stability 519

requirements. 520

Furthermore, the subsequent Switching Rule 3 supplies an 521

online valid residence time without requiring a priori knowl- 522

edge on the parameterizations of the master–slave tandem. 523

IEEE

Proo

f

8 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

Switching Rule 3 (Residence Time Autotuning): Define524

H(t) = [ eT eT ]M[ eT eT ]T , where M = MT > 0 is a525

prefixed design matrix. The switching rule is described by the526

flowchart for Switching Rule 3.527

The above algorithm is running for = m, s (i.e., both528

master and slave). It states that onceH has decreased a value of529

γ > 0 from the last controller reparameterization (i.e., it satis-530

fied H(t+lastSwitching + TD) ≤ H(t−lastSwitching) − γ with531

TD > 0), then the system is allowed to change the controller532

parameterization within the time interval [TD, t] in which533

H(t+lastSwitching+ t) ≤ H(t−lastSwitching

)− γ/2.534

IV. BASIC ROBOT CONTROL ALGORITHM535

The following control law for the master and the slave robots536

is proposed:537

τf =(1/F(D))τ

τ =1

1 − ρ

[M0J

−1

(xd + kv(xd − x)

+ kp(xd − x)− Jq

)+ V0 + F0 + G0 + JT

fext − ρτ

](14)

ρ(t) = 1 − C2

‖e(t)‖α + C2(15)

where 1/F(D) is a strictly stable linear filter, and538

M0, C0, F 0, G0 are the estimated nominal model dynamical539

matrices selected from the convex hull (7) of the parallel540

multiestimation scheme according to any Switching Rule (i.e.,541

1, 2, or 3) to parameterize the adaptive controller (14). J is 542

the manipulator Jacobian with kp = kTp > 0, kv = kTv > 0, 543

e = x − xd, α ≥ 2, and C2 > 0, and τ is the estimated 544

joint torque for the selected active estimator (9). In [50], and 545

references therein, some methods to avoid singularity proximity 546

during the robot operation and, thus, noninvertibility of the 547

Jacobian are discussed. Note that the control law (14) can be 548

equivalently rewritten in an implicit form as 549

τ =F(D)τf

= M0J−1

(xd + kv(xd − x) + kp(xd − x)

− Jq + ρJM−10 (τ − τ)

)+ C0q + F0 + G0 + JT

fext

which can be interpreted as an acceleration resolved control law 550

[8] with a correcting factor (ρJM−10 (τ − τ)) [59]. 551

This correcting factor allows to separate the estimation al- 552

gorithm properties from the closed-loop stability results in 553

comparison with classical techniques [8]. In these ones, the esti- 554

mation algorithm structure is derived from Lyapunov’s stability 555

theorem, so that the inconvenient contribution of positive terms 556

to the Lyapunov function time derivative is cancelled by the 557

updating algorithm structure. Fig. 4 displays the scheme for the 558

IEEE

Proo

f

IBEAS AND DE LA SEN: ADAPTIVE CONTROL OF ROBOTIC MANIPULATORS WITH FORCE REFLECTION 9

Fig. 4. Schematic representation of the basic robot control law.

proposed basic control law for each robot. As Fig. 4 shows,559

the feedback linearization loop is not able to cancel exactly the560

nonlinear robot dynamics due to the existence of unmodeled561

dynamics component. By applying the control law (14) and562

(15) to the robotic manipulator described by (1), we obtain the563

following closed-loop equation:564

1/F(D)(e + kve + kpe − (ρ − 1)JM−1

0

×(W(q, q, q)P + Λ2 + Λ1q

))= v(t) (16)

where P = P − P is the parametrical error vector, v(t)565

is the initial condition response of the filter 1/F(D) and566

vanishes exponentially because the filter is exponentially sta-567

ble because it is linear and strictly stable, and e = x − xd568

for = m, s. To ensure the nonsingularity of M0 during569

parameter adaptation, it may be necessary to restrict the al-570

lowable variation range of the unknown parameters by using571

projection over an appropriate set of parameter values when572

necessary [20].573

V. STABILITY PROPERTIES OF THE ADAPTIVE SCHEME574

A. Main Stability Result575

To establish the main stability result, the subsequent assump-576

tion is made.577

Assumption 6: The trajectory in the joint space q(t) satisfies578

det J(q(t)) = 0,∀t ≥ 0. 579

This assumption means that the Jacobian remains nonsingu-580

lar during the manipulator movement provided that for initial581

conditions det J(q(0)) = 0. The following theorem ensures the582

stability of the overall teleoperated system described generi-583

cally in Fig. 1.584Theorem 1 (Closed-Loop Stability): Consider a teleopera-585

tion system whose master and slave dynamics are described586

by (1) subject to Assumptions 1 and 2. Assume also that 587

the adaptive controller is given by (14) and (15), satisfying 588

the basic estimation algorithm Lemma 1(i)–(iii) properties and 589

Assumptions 3–6. Then, the following items hold: 590

(i) The single estimation scheme (14) and (15) for the tele- 591

operated system is robustly stable provided that either 592

Mt > υmsMop, Bt ≥ υmsBop

and Kt +R

[Ken 00 Ket

]≥ υmsKop (17.1)

or 593

Mt < υmsMop, Bt ≤ υmsBop

and Kt +R

[Ken 00 Ket

]≤ υmsKop (17.2)

hold, where Mop, Bop, Kop denotes the above defined 594

operator model, υms = υm/υs is the ratio of the ref- 595

erence model gains. and R denotes the rotation matrix 596

from the (t, n) axis to the base where the vector xs is 597

expressed. As a result, all the signals in the closed-loop 598

system are bounded for all t ≥ 0. 599

(ii) Assume that the target matrices satisfy (17). Then, the 600

current multiestimation-based closed-loop system is ro- 601

bustly stable under any Switching Rule (1, 2, or 3), 602

provided that a minimum positive residence time TD 603

between successive changes of the parameterization of 604

the adaptive controller is respected. 605

(iii) If there are no uncertainties in the system, [i.e., η = 0 in 606

(9)–(11)] with the designer’s knowledge, then the closed- 607

loop system is asymptotically stable, provided that 608

Theorem 1[(i) and (ii)] conditions hold. Thus, all the 609

tracking errors of the system tend to be zero for both the 610

single and the multiestimation schemes. 611

IEEE

Proo

f

10 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

The proof of this theorem is given in Appendix. Note that612

because the inverse of the Jacobian is used in (14), the stability613

is not global and has to be restricted to joint space trajectories614

avoiding Jacobian singularities (Assumption 6). This is a gen-615

eral drawback in robotic control strategies involving the inverse616

of the Jacobian. Nevertheless, there is extensive work in the lit-617

erature dealing with this problem while implementing strategies618

to avoid Jacobian singularities (see [50] and references therein).619

Thus, these results ensure the applicability of the proposed620

control law to a large variety of manipulators in a wide variety621

of working operations just incorporating any of the techniques622

proposed in the literature to the control algorithm. Also, the623

presence of an unmodeled dynamics component in the closed-624

loop prevents the system from possessing asymptotic stability.625

In the case when the designer knows that this unmodeled626

component is not present, the asymptotic stability can be stated627

according to Theorem 1(iii).628

Furthermore, Theorem 1 emphasizes the influence of the629

human operator behavior, which is crucial to guarantee the630

stability of the scheme. Also, the ratio υm/υs acts gaining631

the coupling between both robots. Note that the stability of632

the multiestimation scheme is subject to a minimum residence633

time between switchings to different controller parameteri-634

zations. This is a typical requirement in switching systems635

[13]–[16], [23], [51] being either adaptive or composed of636

fixed models. Moreover, note that the presented control law637

is modular in the sense that the closed-loop stability holds638

irrespectively of the estimation algorithm used while giving639

to a bounded estimated parameters through time (see [53] and640

references therein). In the subsequent section, a brief outline of641

the stability proof structure is given.642

B. Guidelines for the Proof of Theorem 1643

The proof of Theorem 1 is carried out through the following644

steps.645

1) First, asymptotic properties of the basic estimation algo-646

rithm are stated in Lemma 1. In particular, boundedness647

of the estimated parameter vector (for bounded initial648

conditions) and prediction error square summability are649

established. This property is needed to ensure the asymp-650

totic tracking properties of the schemes in the absence of651

uncertainties in the system [53].652

2) Then, the tracking errors for each robot e = x −653

xd, e, e, = m, s, are proved to be bounded when654

the single-model-based adaptive control scheme is used655

considering the fact that the estimated parameter vec-656

tors are bounded via Lemma 1. This result is stated657

in Lemma A1.1.658

3) Thus, to prove scheme’s signal boundedness, the refer-659

ence model for each robot is detailed with the inter-660

dependence between them clearly pointed out. Starting661

with these equations, boundedness of all signals is finally662

proved in the Theorem 1(i) proof using the boundedness663

of the tracking errors.664

4) Hence, the proof of Theorem 1(ii) is presented based665

on the single estimation scheme. With the basic results666

of stability of the coupled adaptive scheme, the proof is667

extended to the multiestimation scheme using the same 668

steps as those pointed out in [59], which highlights the 669

modularity and structure of the proof. 670

5) Finally, the (nominal) case when no uncertainties are 671

present is considered in the same way as in [59]. Asymp- 672

totic converge of the tracking errors to zero is then proved, 673

completing the Theorem 1 proof. In this way, Theorem 1 674

is fully proved. 675

Remark 4: The stability analysis can be extended to more 676

complex environments than that considered in Section II-D. In 677

this way, the contact model might be given by the more general 678

interaction force common in models like [52], [54], [55] 679

fcontact = kstxs + kvelxs + kacxs

with kst, kvel, kac being positive semidefinite matrices. In 680

this case, the stability of the system is also guaranteed (see 681

Remark 6 in Appendix). Furthermore, nonlinear models lin- 682

early upperbounded as ‖fcontact‖ ≤ kst‖xs‖+ kvel‖xs‖ with 683

kst, kvel ≥ 0 as proposed, for instance, in [56], also guarantee 684

the stability (see the extension of the stability proof for this kind 685

of environments in Appendix). 686

C. Tuning Process for the Residence Time and Target Matrices 687

Nevertheless, the residence time for Switching Rules 1 and 688

2 and target matrices satisfying conditions (17) are especially 689

difficult to set. Also, the residence time for Switching Rules 1 690

and 2 must be initialized while being larger than the minimum 691

residence time, which guarantees stability. Thus, a procedure to 692

overcome these difficulties is described above. 693

IEEE

Proo

f

IBEAS AND DE LA SEN: ADAPTIVE CONTROL OF ROBOTIC MANIPULATORS WITH FORCE REFLECTION 11

Fig. 5. Robot manipulator.

The algorithm works as follows: Starting with initial values694

for the matrices and the residence time, if the norm of the695

master’s state vector exceeds a prescribed threshold, then we696

consider that the system is unstable, and we proceed to modify697

the residence time and target matrices. The modification is698

applied sequentially. First, the residence time is increased by699

a small quantity. If the master’s state vector still lies above700

the threshold, then at the next step, we would increase the701

value of the target inertia matrix. In the case that the system702

was still unstable, at the next step, the target dumping would703

be the variable increased until the norm of the mater’s state704

reduced below the given threshold. Note that a positive time705

interval between modifications Tnorm TD is stated to allow706

the algorithm enough time to appreciate the effect of modifi-707

cations in the system behavior. Moreover, the algorithm stops708

in finite time because finite large values for the target matrices709

and the residence time stabilize the system from Theorem 1(i)710

and (ii). The modification values σ1, σ2, σ3, and σ4 are usually711

expressed as a small percentage (1%–5%) of the initial values712

of the variables.713

Remark 5: Inasmuch as Switching Rule 3 provides online714

an appropriate value for the residence time, this parameter can715

be removed from the above algorithm when using Switching716

Rule 3. 717

VI. ANALYSIS OF THE TELEOPERATION SYSTEM718

PERFORMANCE THROUGH SIMULATION719

A. Analysis of the Closed-Loop System Under the Parallel720

Multiestimation Scheme721

1) Real and Reduced-Order Nominal Robot Models: The722

following specific two-link robotic manipulator shown in Fig. 5723

is used in the computer simulations to simulate the real robot.724

In fact, this will be the model for both the master and the slave725

manipulators.726

Inasmuch as the same robot model is used for both manipula-727

tors, and in order not to make the notation too cumbersome, the728

subscript = m, s is omitted. Thus, the generalized coordinates 729

qT = [ q1 q2 ] are chosen to be the angles defined in Fig. 5. A 730

mathematical dynamical model of the robot can be obtained by 731

using the above assumptions. 732

Assumptions 7.1–7.4: 733

(7.1) The elements have masses m1,m2, and the center of 734

mass of each element is placed at a length of lc1, lc2 from 735

the previous revolute joint. 736

(7.2) Inertia tensors are diagonal for both elements. Further- 737

more, the inertia tensor is defined by a unique element 738

for both arms, I1 for the first element and I2 for the 739

second one. 740

(7.3) The robot base is fixed to the ground. 741

(7.4) There is no gravitational effect because the manipulator 742

is assumed to operate in a horizontal plane. 743

By using Assumption 5, the following dynamic matrices 744

[(18.1)–(18.3), shown at the bottom of the next page] are 745

obtained for the real robot where s1 = sin(q1), c1 = cos(q1), 746

and so on is the typical notation in robotics [8], [10], [20]. 747

Also, a detailed friction model suitable for industrial controller 748

design [57] is introduced in (18.3) for each revolute joint. This 749

model is just considered to simulate the real robot behavior in 750

the computer. The reduced order nominal model, denoted by 751

M0(q), and (C0(q, q)q), is obtained, making lc1 = l1, lc2 = l2, 752

and I1 = I2 = 0 in (18.1)–(18.3). 753

The above conditions allow to consider a simpler nominal 754

dynamical model for designing the control law. This reduced 755

nominal model is obtained at the expense of the presence an 756

unmodeled dynamic component in the closed loop. Also, note 757

that a simplified model for the friction terms has been stated in 758

(19), where νi, ki are the friction coefficients, i.e., 759

F0(q)T = [ ν1q1 + k1sign(q1) ν2q2 + k2sign(q2) ] . (19)

Thus, this nominal model has been obtained considering a set 760

of mechanical assumptions on the robot manipulator. Hence, 761

different assumptions may lead to different nominal models. 762

Moreover, (1) can be represented in a linear parameter form, 763

for estimation purposes, as 764

Wf · P +(JTf ext

)f

= τ f (20)

PT

= [m1 m2 ν1 ν2 k1 k2 ] (21)

Wf =1

F (D)W

=

l21Dq1

F (D)W12F (D)

q1F (D) 0 sign(q1)

F (D) 0

0 W22F (D) 0 q2

F (D) 0 sign(q2)F (D)

(22)W12 = l22(Dq1 +Dq2) + l1l2c2(2Dq1 +Dq2)

+ l21Dq1 − l1l2s2q22 − 2l1l2s2q1q2

W22 = l1l2c2Dq1 + l22(Dq1 +Dq2) + l1l2s2q21

where F (D) has degree larger than unity to make Dqi/F (D) 765

causal, whereas Wf is implemented in (10) for estimation 766

purposes. The signal v(t), associated to the initial condition 767

response of the strictly stable filter F (D), vanishes exponen- 768

tially, and it can be neglected in the estimation process at the 769

IEEE

Proo

f

12 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

expense of worsening of the estimated values. Notice that no770

acceleration measurement is needed in the estimation algorithm771

because the acceleration value is obtained in (22) through a772

causal filter via the operator D/F (D), which appears in the773

filtered version of (22) (Wf ). In this way, the estimation process774

can be performed just measuring joint positions and velocities.775

2) Numerical Example:776

Robotic tandem: It is composed of two robots that are777

of the same type as that described in Section VI. The para-778

meters are as follows: l1 = l2 = 1 m, lc1m = lc2m = 0.4 m,779

lc1s = lc2s = 0.5 m; m1m = 5 kg, m2m = 3.5 kg, m1s =780

10 kg, m2s = 17.5 kg; and I1m = 0.8 kg ·m2, I1s = 4 kg ·781

m2, I2m = 1.1 kg ·m2, I2s = 3.6 kg ·m2; whereas the rest of782

the parameters are [57] α01 = 35, α11 = 1.1, α21 = 0.9, β11 =783

50, β22 = 65, α01 = 38, α11 = 1, α21 = 0.95, β11 = 55, and784

β22 = 60.785

Multiestimation scheme: It is formed by four estimators786

running in parallel. These estimators are initialized by787

P(1)T

0m = [ 25 15 4 5 8 10 ]

P(2)T

0m = [ 23 11 2.5 3.5 18 16 ]

P(3)T

0m = [ 12 6.5 −0.5 2 27 25 ]

P(4)T

0m = [ 7 4 −2 1 35 32 ]

P(1)T

0s = [ 1 700 3.9 0.001 0.001 1.7 ]

P(2)T

02 = [ 3 535 2.5 0.05 0.01 1.22 ]

P(3)T

02 = [ 6 235 1.5 0.2 0.05 0.72 ]

P(4)T

02 = [ 9.9 130 0.5 0.5 0.1 0.2 ].

The conventional adaptive scheme composed of a single es-788

timation algorithm used for comparison purposes is initialized789

by P(1)

0 . The parameters of the estimation algorithm are ξ(i)k =790

0.95, ∀k ≥ 0, Γ(i)0 = 104I6×6, i = 1, . . . , 4, and Qζ = I2×2;791

whereas the Γ(i)k matrix is reset when its minimum eigenvalue792

lies down 0.001, µ(i) = 1.2, χ(i)

01 = 0.5, χ(i)0 = 0.5, χ(i)

1 =793

1.5, χ(i)2 = 1.5, and ϑ(i)

k = ϑ(i) = 1.794

Control law parameters: The control matrices are795

kp = 750I2×2, kv = 150I2×2, whereas ND = 4 samples,796

Ts = 0.001 s, and α = 2 in (15). The filter is F (s) = 106/(s+797

103)2. It has been chosen with a large bandwidth to make the798

estimation well-behaved when interacting with the obstacle,799

and some signals are fast varying.800

Fig. 6. Task space motion for both robots. Normal stiffness: Ken = 105 N/m.

Environment: The obstacle will be a vertical wall situated 801

at xobs = 1.2 m with an infinity height. Ktn = 100 N/m and 802

µs = 0.25, µd = 0.2. 803

Reference model: The reference model is Vdi(s)/ 804

Fexti(s) = 1/(s+ 5) for i = x, y in the case of free motion, 805

whereas in the constrained motion case, the reference model 806

changes to Vdi(s)/Fexti(s) = 1/(s+ 1000) in the direction in 807

which contact is detected. 808

Figs. 6–8 display the trajectory in the workspace described 809

by the master and the slave robots for different values for the 810

normal stiffness of the environment. From these figures, it can 811

be concluded that when a contact occurs, the multiestimation 812

scheme allows to improve the tracking performance of the 813

master robot tracking the slave’s robot trajectory as compared 814

with single-model-based adaptive schemes. Furthermore, the 815

proposed architecture reflects with more fidelity the sensation 816

the slave manipulator is experiencing, improving the trans- 817

parency of the teleoperation system. Simulation examples show 818

the usefulness of the scheme for environments ranging be- 819

tween a relatively low value and a hundred million. Figs. 9–11 820

display the active controller for the master robot under the 821

multiestimation scheme through time. This is the most relevant 822

controller because switching in the master robot controller is 823

responsible for the tracking improvement. Notice that the use 824

of a switching-based control law allows to improve the overall 825

closed-loop performance, due to the switching process within 826

the nominal multiestimation scheme when the slave bumps 827

into the wall. Fig. 8 shows the evolution of the estimated 828

M(q) =

[m1l

2c1 +m2

(2l1lc2c2 + l21 + l2c2

)+ I1 + I2 l2c2m2 + l1lc2m2c2 + I2

l2c2m2 + l1lc2m2c2 + I2 l2c2m2 + I2

](18.1)

(C(q, q)q)T = [−m2l1lc2s2q22 − 2m2l1lc2q1q2 m2l1lc2s2q

21 ] (18.2)

Fi(qi) =[α0,i + α1,ie

−β1,i|qi| + α2,i

(1− e−β2,i|qi|

)]sign(qi), i = 1, 2 (18.3)

IEEE

Proo

f

IBEAS AND DE LA SEN: ADAPTIVE CONTROL OF ROBOTIC MANIPULATORS WITH FORCE REFLECTION 13

Fig. 7. Task space motion for both robots. Normal stiffness: Ken = 106 N/m.

Fig. 8. Evolution of the estimates for the slave.

parameters of the slave through time. The evolution of the829

estimates is altered when the slave contacts an obstacle, making830

the values of the parameters adapt to their new configuration.831

Moreover, it can be concluded from Fig. 12, for the evolution832

of the estimated parameters of the master, that their values also833

change although the master is not interacting with anything.834

The change in the master estimated parameters appears due835

to the coupling between both robots through the force–force836

scheme according to the Fig. 1 diagram. Also, Fig. 13 displays837

the force tracking property of the system. From Fig. 13, it838

can be concluded that the multiestimation scheme with the839

switching process is able to improve the force tracking response840

when the contact occurs, contributing to the improvevent of the841

transparency of the scheme. Moreover, Figs. 14 and 15 show842

the force reference tracking achieved by the multiestimation843

scheme. The scheme is seen to be able to track the desired force844

reference despite the uncertainty in the robot models. Note that845

because the slave manipulator is heavier that the master one, its846

Fig. 9. Switching map for the master robot.

Fig. 10. Switching map for the master manipulator.

response is slower due to its larger inertia. This fact explains 847

why the slave manipulator penetrates into the wall more than 848

the master in Figs. 6, 7, and 16. 849

VII. CONCLUSION 850

In this paper, a robust adaptive multiestimation-based scheme 851

has been designed for robotic manipulators with uncertainties, 852

composing a teleoperation system with force reflection. The 853

controller of each robot is composed of a bank of estimation 854

algorithms running in parallel and a switching logic that selects 855

an element from the convex hull of the estimated parameter 856

vectors to parameterize an adaptive controller. The multiesti- 857

mation architecture is capable of improving the master–slave 858

tracking properties when the latter interacts with an elas- 859

tic obstacle in comparison with classical single-model-based 860

adaptive controllers. Furthermore, the force tracking response 861

is also improved when using the multiestimation scheme. This 862

IEEE

Proo

f

14 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

Fig. 11. Switching map for the master manipulator.

Fig. 12. Evolution of the estimated parameters for the master.

implies that the contact process can be transmitted to the863

operator handling the master manipulator with higher fidelity,864

improving the transparency of the teleoperation scheme. More-865

over, the stability of the overall closed-loop scheme is proved866

as well. Finally, simulation examples showing the operation,867

properties, and usefulness of the proposed scheme are showed868

and discussed for different values of the environment stiffness.869

APPENDIX870

PROOF OF THE CLOSED-LOOP STABILITY871

Proof of Theorem 1: First, the following result then used872

to prove Theorem 1(i) is established.873

Lemma A1.1 (Boundedness of e): The tracking error e =874

x − xd for the single-estimation-based scheme is bounded875

for all time, subject to the condition of Theorem 1(i). (In876

the following, the fixed subscript is omitted for the sake of877

simplicity when no confusion is expected.)878

Fig. 13. Force tracking during the interaction. Normal stiffness: Ken =105 N/m.

Fig. 14. Comparison between the reference interaction force and the oneapplied by the slave. Normal stiffness: Ken = 106 N/m.

Proof of Lemma A1.1: Consider L(e, e) = 1/2[eT kpe + 879

‖e‖2] being positive definite because kp = kTp > 0. Its time 880

derivative is from (16), i.e., 881

L = − eT kv e+ (ρ− 1)eTJM−10

×[W (q, q, q)P + Λ2 + Λ1q

]+ w(t)

≤ − eT kv e+C ′

‖e‖α + C2‖e‖

×∥∥∥W (q, q, q)P + Λ2 + Λ1q

∥∥∥+ w(t) (23)

where w(t) = F (D)v(t) → 0 exponentially. Thus, it will be 882

zeroed in the sequel without loss of generality for stability 883

analysis purposes. C ′ = C2‖JM−10 ‖ is a finite constant 884

because the Jacobian is bounded from q being bounded for 885

revolute manipulators and the fact that the estimate vector is 886

IEEE

Proo

f

IBEAS AND DE LA SEN: ADAPTIVE CONTROL OF ROBOTIC MANIPULATORS WITH FORCE REFLECTION 15

Fig. 15. Reference interaction force and the one applied by the slave.

Fig. 16. Task space motion for both robots. Normal stiffness: Ken =10, 000 N/m.

bounded from estimation algorithm property (i). Furthermore,887

there exist real constants µ1, µ2, µ3 ≥ 0 such that888

‖W (q, q, q)P + Λ2 + Λ1q‖ ≤ µ1 + µ2‖e‖ + µ3‖e‖2 from889

Assumption 2, (16), and the fact that the joint angles are890

bounded. Thus891

L ≤ −eT kv e+C ′

‖e‖α + C2

(µ1‖e‖ + µ2‖e‖2 + µ3‖e‖3

).

If α ≥ 2 in (15), then the positive term diverges at most at the892

rate of ‖e‖ as ‖e‖ diverges, whereas the negative one decreases893

accordingly to ‖e‖2. Then, L(e, e) is bounded at all time,894

and the closed-loop system exhibits global Lyapunov stability895

with ultimate boundedness because for large values of ‖e‖,896

L(e, e) < 0, and thus, L decreases, whereas for small values897

for ‖e‖, L(e, e) > 0, and L increases. Inasmuch as) L(e, e)898

is bounded for all time and for any bounded initial conditions899

e(0), e(0), it can be concluded that e and e are bounded for900

all time. Furthermore, the boundedness of L ensures that there 901

is no finite escape time in (16). Inasmuch as the above result 902

holds for both robots, one explicitly has that em, em,es, es are 903

bounded. Now, rewrite (16) as 904

e =(ρI+(1−ρ)JM−1

0 MJ−1)−1

×[−kv e−kpe+(ρ−1)JM−1

0

(W (q, q, qd)P +Λ2

)]. (24)

Note that the inverse of the matrix (ρI + (1 − ρ)JM−10 MJ−1) 905

in (24) always exists because it is the sum of two positive 906

definite matrices, the estimated matrix M0 is nonsingular for 907

all time, and the robot manipulator avoids singularities; there- 908

fore, the Jacobian is invertible from Assumption 4. Also, e is 909

bounded because the right-hand side of the above equation is 910

a continuous function, which only depends on e, e, which are 911

bounded, and the fact that e is always bounded in a rotary mo- 912

tion. Hence, one has that em, em, em,es, es, es are bounded, 913

and the proof is completed. 914

Proof of Theorem 1(i): The subsequent coupled 915

master–slave coupled equations follow from (5), i.e., 916

Mtmxdm+Btmxdm+Ktmxdm

=υm

(Mopxm+Bopxm+Kopxm−R

[Ken 00 Ket

]xs

)(25)

and the equivalent for the slave where it has been supposed that 917

xobs = 0 and xad = 0 without loss of generality for stability 918

analysis purposes. Moreover, R is the rotation matrix, which 919

transforms the (t, n) base to the reference of the position x. 920

Subtracting both equations, one gets 921

Mtmxdm +Btmxdm +Ktmxdm

=υmυs

(Mtsxds +Btsxds +Ktsxds)

= υms(Mtsxds +Btsxds +Ktsxds).

From Assumption 5, the above equation becomes 922

Mt(xdm − υmsxds) +Bt(xdm − υmsxds)

+Kt(xdm − υmsxds) = 0 (26)

with Mt = MTt > 0, Bt = BT

t ≥ 0, and Kt = KTt ≥ 0, 923

which implies that (xdm − υmsxds), (xdm − υmsxds), and 924

(xdm − υmsxds) are bounded, so that xdm = υmsxds + ξ1, 925

xdm = υmsxds + ξ2, and xdm = υmsxds + ξ3 with ξ1, ξ2, ξ3 926

bounded. Also, because e = x − xd, e = x − xd, and 927

e = x − xd, = m, s, one obtains the following equation 928

from (25): 929

(Mt − υmsMop)xds + (Bt − υmsBop)xds

+(Kt − υmsKop +R

[Ken 00 Ket

])xds

= Mop(em + ξ3) +Bop(em + ξ2)

+Kop(em + ξ1)−R

[Ken 00 Ket

]es. (27)

IEEE

Proo

f

16 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

Inasmuch as the right-hand side of (27) is bounded from930

Lemma A1.1, (27) leads to bounded xds, xds, xds, provided that931

either932

Mt > υmsMop, Bt ≥ υmsBop, Kt +R

[Ken 00 Ket

]≥ υmsKop

or933

Mt < υmsMop, Bt ≤ υmsBop, Kt +R

[Ken 00 Ket

]≤ υmsKop

hold. Thus, xs, xs, xs are bounded as well because ‖xs‖ =934

‖es + xds‖ ≤ ‖es‖+ ‖xds‖ <∞ and so are the rest of vari-935

ables. Now, because xdm = xds + ξ1, xdm = xds + ξ2 and936

xdm = xds + ξ3, it can be concluded that xdm, xdm, xdm are937

bounded for all time, and hence, xm, xm, xm are bounded as938

well. Then, one has that q, x, q, x, q, x are bounded because939

the Jacobian is bounded for revolute manipulators and for940

both robots. Moreover, the estimated joint torque (16) and the941

control law (14) and (15) are bounded as well. Thus, all system942

variables remain bounded for all time, and item (i) is proved. 943

Remark 6 (Extension to Other Environments): According to944

Remark 4, the above stability proof can be extended to other945

linear environments. Thus, consider environments modeled by946

fcontact = kstxs + kvelxs + kacxs

with kst, kvel, kac being positive semidefinite matrices. This947

generalization allows to consider dissipative environments. The948

stability proof is identical until the end step in which (26) would949

read as950

(Mt − υmsMop + kac)xds + (Bt − υmsBop + kvel)xds

+ (Kt − υmsKop + kst)xds

= Mop(kacem + ξ3) +Bop(kvelem + ξ2)

+Kop(em + ξ1) − kstes

which is asymptotically stable provided that the following951

conditions (analogous to those stated in Theorem 1) hold:952

(Mt − υmsMop + kac) > 0

(Bt − υmsBop + kvel) ≥ 0

(Kt − υmsKop + kst) ≥ 0.

Note that the model for the environment is intended to be953

linear because the closed-loop target response has been selected954

to be linear. Furthermore, note that using the algorithm for955

adjusting the target matrices, the stability of the closed loop956

can be guaranteed due to the increase of the target matrix value.957

Consider a nonlinear environmental model subject to958

‖fcontact‖ ≤ kst‖xs‖ + kvel‖xs‖

(which might include a dumping term proportional to velocity)959

where kst, kvel ≥ 0 are now scalars. To extend the stability960

proof to this case, the above upperbounding can be rewritten 961

for a convenient arrangement of the constants kst, kvel as 962

‖fcontact‖ ≤ k1‖xext‖+@ (28)

with k1 ≥ 0, xText = [xds xds ] being an extended state space 963

vector, and @ ≥ 0 is the remaining bounded signal, which 964

appears due to the substitution of xdm = xds + ξ1 and so on 965

when defining xext. Then, (26) can be rewritten as 966

xext =[

0 I−M−1

ts Kts M−1ts Bts

]xext +

[0Ω

]with 967

Ω=

[ 0M−1

ts Mop(em+υmsxds+ξ3)+M−1ts Bop(em+υmsxds+ξ2)

+M−1ts Kop(em+υmsxds+ξ1)+M−1

ts fcontact

]

where M−1ts always exist because Mts is a positive definite 968

matrix. The formal solution of this equation is given by 969

xext = e

[0 I

−M−1ts Kts M−1

ts Bts

]t

xext(0)

+

t∫0

e

[0 I

−M−1ts Kts M−1

ts Bts

](t−τ)

Ω dτ

which can be upperbounded as 970

‖xext‖ = Ce−λt‖xext(0)‖

+

t∫0

Ce−λ(t−u) (σ1 + σ2 ‖xext‖) du (29)

because

∥∥∥∥exp[

0 I−M−1

ts Kts M−1ts Bts

]t

∥∥∥∥ ≤ Ce−λt for some 971

constants σ1, σ2 ≥ 0, C ≥ 1, and λ ≥ 0 because the reference 972

model is strictly stable [58]. Then, applying Gronwall lemma, 973

[38] to (29), it follows that xext is bounded, and thus, xds is 974

bounded as well. 975

Proof of Theorem 1(ii) and (iii): Having proved the sta- 976

bility of the single adaptive teleoperation scheme, the stability 977

of the multiestimation-based setup is carried out following the 978

steps of [59]. 979

ACKNOWLEDGMENT 980

The authors would like to thank the reviewers for their 981

helpful comments, which have helped to improve this paper. 982

REFERENCES 983

[1] H. Kazerooni, T. I. Tsay, and K. Hollerbach, “A controller design frame- 984work for telerobotic systems,” IEEE Trans. Robot. Autom., vol. 1, no. 1, 985pp. 50–62, Mar. 1993. 986

[2] C. C. Cheah, S. Kawamura, and S. Arimoto, “Stability of hybrid position 987and force control for robotic manipulator with kinematics and dynamics 988uncertainties,” Automatica, vol. 39, no. 5, pp. 847–855, May 2003. 989

[3] P. J. Berkelman, P. Cinquin, J. Troccaz, J. M. Ayoubi, C. Létoublon, and 990F. Bochard, “A compact, compliant laparoscopic endoscope manipulator,” 991in Proc. Int. Conf. Robot. Autom., Washington, DC, 2002, pp. 1870–1875. 992

IEEE

Proo

f

IBEAS AND DE LA SEN: ADAPTIVE CONTROL OF ROBOTIC MANIPULATORS WITH FORCE REFLECTION 17

[4] W. S. Kim, B. Hannaford, and A. K. Bejczy, “Force-reflection and shared993compliant control in operating telemanipulators with time delay,” IEEE994Trans. Robot. Autom., vol. 8, no. 2, pp. 176–185, Apr. 1992.995

[5] K. Hashtrudi-Zaad and S. E. Salcudean, “Analysis of control architectures996for teleoperation systems with impedance/admittance master and slave997manipulators,” Int. J. Rob. Res., vol. 20, no. 6, pp. 419–445, 2001.998

[6] D. A. Lawrence, “Stability and transparency in bilateral teleoperation,”999IEEE Trans. Robot. Autom., vol. 9, no. 5, pp. 624–637, Oct. 1993.1000

[7] M. A. Arteaga, “Robot control and parameter estimation with only joint1001position measurements,” Automatica, vol. 39, no. 1, pp. 67–73, Jan. 2003.1002

[8] C. Canudas de Wit, B. Siciliano, and G. Bastin, Theory of Robot Control.1003New York: Springer-Verlag, 1996.1004

[9] C. Y. Lee and J. J. Lee, “Adaptive control for uncertain nonlinear systems1005based on multiple neural networks,” IEEE Trans. Syst., Man, Cybern. B,1006Cybern., vol. 34, no. 1, pp. 325–333, Feb. 2004.1007

[10] L. Sciavicco and B. Siciliano, Modelling and Control of Robot Manipula-1008tors. London, U.K.: Springer-Verlag, 2000.1009

[11] C. H. Wang, T. C. Lin, T. T. Lee, and H. L. Liu, “Adaptive hybrid1010intelligent control for uncertain nonlinear dynamical systems,” IEEE1011Trans. Syst., Man, Cybern. B, Cybern., vol. 32, no. 5, pp. 583–597,1012Oct. 2002.1013

[12] M. A. Arteaga and Y. Tang, “Adaptive control of robots with an improved1014transient performance,” IEEE Trans. Autom. Control, vol. 47, no. 7,1015pp. 1198–1202, Jul. 2002.1016

[13] A. Ibeas, M. de la Sen, and S. Alonso-Quesada, “Stable multi-estimation1017model for single-input single-output discrete adaptive control systems,”1018Int. J. Syst. Sci., vol. 35, no. 8, pp. 479–501, Jul. 2004.1019

[14] A. Ibeas and M. de la Sen, “A mutiestimation based scheme for robustly1020stable adaptive control of robotic manipulators,” in Int. Conf. Control1021Appl., Taipei, Taiwan, R.O.C., 2004, pp. 365–370.1022

[15] K. S. Narendra and J. Balakrishnan, “Adaptive control using multiple1023models,” IEEE Trans. Autom. Control, vol. 42, no. 2, pp. 171–187,1024Feb. 1997.1025

[16] ——, “Improving transient response of adaptive control systems using1026multiple models and switching,” IEEE Trans. Autom. Control, vol. 39,1027no. 9, pp. 1861–1866, Sep. 1994.1028

[17] W. Zhu and S. E. Salcudean, “Teleoperation with adaptive motion/force1029control,” in Proc. IEEE Int. Conf. Robot. Autom., 1999, pp. 231–237.1030

[18] O. Barambones and V. Etxebarria, “Robust neural control for robotic1031manipulators,” Automatica, vol. 38, no. 2, pp. 235–242, 2002.1032

[19] R. Burkan and I. Uzmay, “Variable upper bounding approach for adaptive-1033robust control in robot control,” J. Intell. Robot. Syst., vol. 37, no. 4,1034pp. 427–442, Aug. 2003.1035

[20] J. J. Craig, Adaptive Control of Mechanical Manipulators. Reading,1036MA: Addison-Wesley, 1988.1037

[21] H. G. Sage, M. F. Mathelin, and E. Ostertag, “Robust control of robot1038manipulators: A survey,” Int. J. Control, vol. 72, no. 16, pp. 1498–1522,1039Nov. 1999.1040

[22] M. Zhihong, M. O’Day, and X. Yu, “A robust adaptive terminal slid-1041ing mode control for rigid robotic manipulators,” J. Intell. Robot. Syst.,1042vol. 24, no. 1, pp. 23–41, Jan. 1999.1043

[23] Z. Sun and S. S. Ge, “Analysis and synthesis of switched linear control1044systems,” Automatica, vol. 41, no. 2, pp. 181–195, 2005.1045

[24] D. M. Wolpert and M. Kawato, “Multiple paired forward and inverse1046models for motor control,” Neural Netw., vol. 11, no. 7/8, pp. 1317–1329,1047Oct. 1998.1048

[25] D. Y. Hwang and B. Hannaford, “Teleoperation with a kinematically1049redundant slave robot,” Int. J. Rob. Res., vol. 17, no. 6, pp. 579–597,1050Jun. 1998.1051

[26] K. Ohishi and H. Nozawa, “Robust motion control with the consideration1052algorithm of joint torque saturation for a redundant manipulator,” Adv.1053Robot., vol. 16, no. 4, pp. 345–359, 2002.1054

[27] X. R. Li, Z. Zhao, and X.-B. Li, “General model-set design methods for1055multiple-model approach,” IEEE Trans. Autom. Control, vol. 50, no. 9,1056pp. 1260–1276, Sep. 2005.1057

[28] P. Dumont, B. Armstrong, and V. Hayward, “Elasto-plastic model:1058Contact compliance and stiction,” in Proc. IEEE Amer. Control Conf.,1059Chicago, IL, 2000, pp. 1072–1077.1060

[29] P. Dumont, V. Hayward, B. Armstrong, and F. Altpeter, “Single state1061elastoplastic friction models,” IEEE Trans. Autom. Control, vol. 47, no. 5,1062pp. 787–792, May 2002.1063

[30] R. H. Middleton, G. C. Goodwin, D. J. Hill, and D. Q. Mayne, “Design1064issues in adaptive control,” IEEE Trans. Autom. Control, vol. 33, no. 1,1065pp. 50–58, Jan. 1988.1066

[31] G. Feng, “Analysis of a new algorithm for continuous-time robust adaptive1067control,” IEEE Trans. Autom. Control, vol. 44, no. 9, pp. 1764–1768,1068Sep. 1999.1069

[32] M. Indri and A. Tornambè, “Control of under-actuated mechanical sys- 1070tems subject to smooth impacts,” in Proc. 43rd IEEE Conf. Decision 1071Control, Paradise Island, Bahamas, 2004, pp. 1228–1233. 1072

[33] A. Tornambè, “Modeling and control of impact in mechanical systems: 1073Theory and experimental results,” IEEE Trans. Autom. Control, vol. 44, 1074no. 2, pp. 294–309, Feb. 1999. 1075

[34] M. P. do Carmo, Differential Geometry of Curves and Surfaces. Engle- 1076wood Cliffs, NJ: Prentice-Hall, 2000. 1077

[35] K. Salisbury, D. Brock, T. Massie, N. Swarup, and C. Zilles, “Haptic 1078rendering: Programming touch interaction with virtual objects,” in Proc. 1079Symp. Interactive 3D Graph., Monterey, CA, 1995, pp. 123–130. 1080

[36] N. Hogan, “Controlling impedance at a man/machine interface,” in Proc. 1081IEEE Int. Conf. Robot. Autom., Scottsdale, AZ, 1989, pp. 1626–1631. 1082

[37] H. Kazerooni and C. L. Moore, “An approach to telerobotic mani- 1083pulations,” Trans. ASME, J. Dyn. Syst. Meas. Control, vol. 119, no. 3, 1084pp. 431–438, 1997. 1085

[38] H. K. Khalil, Nonlinear Systems, 3rd ed. Englewood Cliffs, NJ: Prentice- 1086Hall, 2002. 1087

[39] H. Lee and M. J. Chung, “Adaptive controller of a master/slave system 1088for transparent teleoperation,” J. Robot. Syst., vol. 15, no. 8, pp. 465–475, 1089Aug. 1998. 1090

[40] R. W. Daniel and P. R. McAree, “Fundamental limits of performance 1091for force reflecting teleoperation,” Int. J. Rob. Res., vol. 17, no. 8, pp. 811– 1092830, Aug. 1998. 1093

[41] A. Rubio, Ph.D. dissertation (in Spanish), 1999. San Sebastián. 1094[42] H. Flemmer, V. Eriksson, and J. Wikander, “Control design and 1095

stability analysis of a surgical teleoperator,” Mechatronics, vol. 9, no. 7, 1096pp. 843–866, Oct. 1999. 1097

[43] J. H. Ryu and D. S. Kwon, “A novel adaptive bilateral control scheme 1098using similar closed-loop dynamic characteristics of master/slave manip- 1099ulators,” J. Robot. Syst., vol. 18, no. 9, pp. 533–543, 2001. 1100

[44] K. Hashtrudi-Zaad and S. E. Salcudean, “Adaptive transparent impedance 1101reflecting teleoperation,” in Proc. IEEE Int. Conf. Robot. Autom., 1996, 1102pp. 1369–1374. 1103

[45] E. Sánchez, Ph.D. dissertation (in Spanish), 2002. San Sebastián. 1104[46] A. K. Bejczy and M. Handylykken, “Experimental results with six- 1105

degree-of-freedom force-reflecting hand controller,” in Proc. 7th Annu. 1106Conf. Manual Control, 1981, pp. 465–477. 1107

[47] G. J. Raju, G. Verghese, and T. B. Sheridan, “Design issues in 2-port 1108network models of bilateral remote manipulation,” in Proc. IEEE Int. 1109Conf. Robot. Autom., 1989, pp. 1316–1321. 1110

[48] Teleoperation and Robotics in Space, S. B. Skaar and C. F. Ruoff, Eds. 1111Washington, DC: AIAA Press, 1994. 1112

[49] G. C. Goodwin and K. S. Sin, Adaptive Filtering and Control. Engle- 1113wood Cliffs, NJ: Prentice-Hall, 1990. 1114

[50] E. Sánchez, A. Rubio, and A. Avello, “An intuitive force feedback to avoid 1115singularity proximity and workspace boundaries in bilateral controlled 1116systems based on virtual springs,” in Proc. IEEE Int. Conf. Intell. Robots 1117Syst., 2002, vol. 2, pp. 1302–1307. 1118

[51] Z. Sun, “A robust stabilizing law for switched linear systems,” Int. J. 1119Control, vol. 77, no. 4, pp. 389–398, Mar. 2004. 1120

[52] J. M. Brown and J. E. Colgate, “Physics based approach to haptic display,” 1121in Proc. ISMRC, Top. Workshop Virtual Reality, Los Alamitos, CA, 1994, 1122pp. 101–106. 1123

[53] M. S. de Queiroz, D. M. Dawson, S. P. Nagarkatti, and 1124F. Zhang, Lyapunov-Based Control of Mechanical Systems. Boston, 1125MA: Birkhäuser-Verlag, 2000. 1126

[54] P. Huynh, Y. Nakamura, T. Arai, T. Tanikawa, and N. Koyachi, “Symmet- 1127ric damping bilateral control of parallel manipulators,” in Proc. 8th Int. 1128Conf. Adv. Robot., Monterey, CA, 1997, pp. 401–406. 1129

[55] S. E. Salcudean and T. D. Vlaar, “On the emulation of stiff walls and static 1130friction with a magnetically levitated input/output device,” in Proc. ASME 1131Dyn. Syst. Control Conf., Chicago, IL, 1994, vol. 1, pp. 303–309. 1132

[56] D. W. Marhefka and D. E. Orin, “A compliant contact model with 1133nonlinear damping for simulation of robotic systems,” IEEE Trans. 1134Syst., Man, Cybern. A, Syst., Humans, vol. 29, no. 6, pp. 566–572, 1135Nov. 1999. 1136

[57] C. Canudas de Wit, P. Noel, A. Aubin, and B. Brogliato, “Adaptive friction 1137compensation in robot manipulators: Low velocities,” Int. J. Rob. Res., 1138vol. 10, no. 3, pp. 189–199, Jun. 1991. 1139

[58] R. A. Horn and C. R. Johnson, Matrix Theory. Cambridge, U.K.: 1140Cambridge Univ. Press, 1994. 1141

[59] A. Ibeas and M. de la Sen, “A robustly stable multiestimation based 1142adaptive control scheme for robotic manipulators,” Trans. ASME, J. Dyn. 1143Syst. Meas. Control. (in press). 1144

[60] R. V. Patel and F. Shadpey, Control of Redundant Robot Manipulators, 1145Theory and Experiments. New York: Springer-Verlag, 2005. 1146

IEEE

Proo

f

18 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 36, NO. 6, DECEMBER 2006

[61] J. Vertut and P. Coiffet, Robot Technology: Volume 3A: Teleoperation and1147Robotics: Evolution and Development. Englewood Cliffs, NJ: Prentice-1148Hall, 1986.1149

[62] L. J. Love and W. J. Book, “Force reflecting teleoperation with adaptive1150impedance control,” IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 34,1151no. 1, pp. 159–165, Feb. 2004.1152

[63] Y. Yokokohji and T. Yoshikawa, “Bilateral control of master/slave ma-1153nipulators for ideal kinesthetic coupling: Formulation and experiment,”1154IEEE Trans. Robot. Autom., vol. 10, no. 5, pp. 605–620, Oct. 1994.1155

[64] J. Roy and L. L. Whitcomb, “Adaptive force control of position/velocity1156controlled robots: Theory and experiment,” IEEE Trans. Robot. Autom.,1157vol. 18, no. 2, pp. 121–137, Apr. 2002.1158

Asier Ibeas was born in Bilbao, Spain, on July11597, 1977. He received the M.Sc. degree in applied1160physics and the Ph.D. degree in automatic control1161from the University of the Basque Country, Bilbao,1162Spain, in 2000 and 2006, respectively.1163

He was recently appointed Associate Professor of1164electrical engineering at the Autonomous University1165of Barcelona, Barcelona, Spain. His research inter-1166ests include time-delayed systems, robust adaptive1167control, and applications of artificial intelligence to1168control systems design.1169

Manuel de la Sen was born in Arrigorriaga, Bizkaia, 1170Basque Country, Spain. He received the M.Sc. de- 1171gree (Honors) in applied physics (electronics and 1172automation) and the Ph.D. degree (High Honors) in 1173applied physics from the University of the Basque 1174Country, Bilbao, Spain, in 1975 and 1979, respec- 1175tively, and the Docteur-d´Etat-ès-Sciences Physiques 1176(specialité Automatique et Traitement du Signal) 1177(mention très honorable) from the University of 1178Grenoble, Grenoble, France, in 1987. 1179

He held several teaching and research positions in 1180the University of the Basque Country, where he is currently a Professor of 1181systems engineering and automatic control in the Department of Electricity 1182and Electronics and the Head of the Institute of Research and Development 1183of Processes IIDP. He was also a Visiting Professor in the University of 1184Grenoble, France; the University of Newcastle, New South Wales, Australia; 1185and the Australian National University, Canberra, Australia. He was the author 1186or coauthor of a number of papers in the fields of expert and adaptive systems, 1187mathematical systems theory, discrete and time-delay systems, and ordinary 1188differential equations, which are his research interests. 1189

Dr. de la Sen was a member of the Editorial Board of the Electrosoft 1190Journal (CML Mechanical and Computational Engineering Publications) and 1191is currently a ember of the Editorial Board of the journal Nonlinear Analysis, 1192Modelling and Control. He acts and has acted as a Reviewer for several 1193international journals and conferences of control theory and engineering and 1194applied mathematics. 1195

IEEE

Proo

f

AUTHOR QUERIES

AUTHOR PLEASE ANSWER ALL QUERIES

AQ1 = The affiliation address does not match that in the author’s biography. Please check.AQ2 = Running head for approval.AQ3 = Please provide the dissertation title, publisher name and location in Ref. [41].AQ4 = Please provide the dissertation title, publisher name and location in Ref. [45].Notes: 1) The figures were renumbered. The original sequence was 1-2-3-4-5-9-12-14-7-10-13-16-8-11-

15-6. Please check if correct.2) Two different reference items were numbered as [53]. References were renumbered.3) Reference [59] was not cited in the body of the text.3) Figures 3 and 6–16 contain pixelated text and lines.3) Figures 6–16 were processed as grayscale/B&W.

END OF ALL QUERIES