learning and autonomous control

42
14 January 2011 Challenge the future Delft University of Technology Learning and Autonomous Control Jens Kober Cognitive Robotics Delft University of Technology The Netherlands [email protected] www.jenskober.de

Upload: others

Post on 06-Dec-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Learning and Autonomous Control

14 January 2011

Challenge the future

DelftUniversity ofTechnology

Learning and Autonomous ControlJens Kober

Cognitive RoboticsDelft University of TechnologyThe Netherlands

[email protected] www.jenskober.de

Page 2: Learning and Autonomous Control

Cognitive Robotics Department

• Robot Dynamics (dynamic motion control, motor control)

• Intelligent Vehicles (perception and modeling, dynamics,

human factors)

• Human-Robot Interaction (physical human-robot interaction)

• Learning and Autonomous Control (intelligent control,

cognition)

Page 3: Learning and Autonomous Control

Robot Dynamics

Page 4: Learning and Autonomous Control

Intelligent

Vehicles

Page 5: Learning and Autonomous Control

Human-Robot Interaction

Page 6: Learning and Autonomous Control

Robert Babuska

professor

learning and adaptive control

Jens Kober

assistant professor

robotics, learning motor skills

Javier Alonso Mora

assistant professor

robotics, motion planning

Learning and Autonomous Control

Page 7: Learning and Autonomous Control

youtube/ABCUPM

Page 8: Learning and Autonomous Control

Growing Interest in Robotics

Page 9: Learning and Autonomous Control
Page 10: Learning and Autonomous Control
Page 11: Learning and Autonomous Control
Page 12: Learning and Autonomous Control

Challenges

Uncertainty due to environment

Robustness

ComplianceLearning from mistakes

Safety

Autonomy

Power efficiency

Situation awareness

Interaction with humans

Page 13: Learning and Autonomous Control

Future Robots

• Need to operate under unforeseen circumstances

• Interact with humans in an intelligent way

• Learn from mistakes, improve over time (performance, energy)

• Impossible to preprogram all tasks or behaviors

• Economic point of view: short design time and low costs

Learning and adaptation is essential!

Page 14: Learning and Autonomous Control

Main Research Topics

• Adaptive and learning control systems

• Reinforcement learning, optimal control

• Nonlinear adaptive control

• Deep learning for robotics

• Data-effective learning algorithms

• Sensor fusion

• Supervisory and distributed control design

• Motion planning, coordination

• Multi-agent systems

Emphasis on novel, generic, widely applicable techniques

Realistic / real-world case studies and applications

Page 15: Learning and Autonomous Control

Autonomous Multi-Robots Lab

Autonomous navigationPlanning in dynamic environments

AI + ControlConstrained optimization tools, combined with machine learning, formal methods and consensus

Multi-robot systems&

Diverse application fieldsTransportation, Manipulation, Inspection, Ground & aerial vehicles

Page 16: Learning and Autonomous Control

Cooperative mobile manipulation

GOALGOAL

Safe motionin dynamic

environments

Robot dynamics

Environment obstacles

Formation definition

Constrained optimization

Multiple robotsOptimize formation parameters

within convex region in position-time free space

orSingle robot

Non-linearmodel predictive control

J.Alonso-Mora, et al. “Multi-robot formation control and object transport in dynamic environments via constrained optimization”, IJRR, 2017

Page 17: Learning and Autonomous Control

Online planning for aerial vehicles

Safe motion

Robot dynamics

Environment obstacles

Constrained optimization

Objective

Video/Cinematography

Formation control

T. Naegeli, et al., “Trajectory Optimization for Multi-view Aerial Cinematography”, ACM Siggraph, 2017; Videography, RA-L 2017J. Alonso-Mora et al., “Distributed formation control in dynamic environments”, ICRA 2017

Page 18: Learning and Autonomous Control

Motion planning with interaction

W. Schwarting et al., “Safe Nonlinear Trajectory Generation for Parallel Autonomy With a Dynamic Vehicle Model”, T-ITS 2017L. Ferranti et al., “Coordination of Multiple Vessels Via Distributed Nonlinear Model Predictive Control”, ECC 2018B. Zhou et al., ” Joint Multi-Policy Behavior Estimation and Receding-Horizon Trajectory Planning for Automated Urban Driving”, ICRA 2018

Information from other participants

Constrained optimization

Environment models

Safe motion among robots &

humans

Vehicle model

CommunicationDistributed

asynchronous

EstimationProbabilistic models &

chance constraints

or

Page 19: Learning and Autonomous Control

Intelligent Control

Systems and control

model-based design

Artificial intelligence, machine learning

learn a task from experience, without being explicitly programmed for that task

Intelligent control

Page 20: Learning and Autonomous Control

Potential of Machine Learning

• adapt to changes in environment

• find new, better solutions

• teach by demonstration or imitation

in addition:

• modeling is tedious, time consuming, expensive

Page 21: Learning and Autonomous Control

Initial Optimism

• Herbert A. Simon

Artificial Intelligence Pioneer:

“Machines will be capable in 20 years of doing any work a man can do” (1965)

• MIT: Cog Project (1993-2003)

Learning like human children - through constant

interaction with humans and the environment

nobelprize.org

MIT Press

Page 22: Learning and Autonomous Control

Spectrum of Learning Techniques

Unsupervised learning

(without ‘teacher’)

Reinforcement learning

(learn directly to control)

Supervised learning

(with ‘teacher’)

Page 23: Learning and Autonomous Control

Construct a Robot Model from Data

actuate motors and observe the response

Two ways to model the system:

1. Physical modeling

2. System identification

Maarten Vaandrager

Page 24: Learning and Autonomous Control

Imitation Learning

Page 25: Learning and Autonomous Control

Manschitz, Kober, Gienger, & Peters, RAS 2015

Page 26: Learning and Autonomous Control

Spectrum of Learning Techniques

Unsupervised learning

(without ‘teacher’)

Reinforcement learning

(learn directly to control)

Supervised learning

(with ‘teacher’)

Page 27: Learning and Autonomous Control

Unsupervised Learning - Clustering

• Discover automatically groups and structures in data

Applications:

• Robot perception, vision

• Data-driven construction of dynamic models

Page 28: Learning and Autonomous Control

Construction of Nonlinear Models

Nonlinear and uncertain behavior

Experimental data

Page 29: Learning and Autonomous Control

Spectrum of Learning Techniques

Unsupervised learning

(without ‘teacher’)

Reinforcement learning

(learn directly to control)

Supervised learning

(with ‘teacher’)

Page 30: Learning and Autonomous Control

Reinforcement Learning

Goal:

Adapt the control strategy so that the sum of rewards over time is maximal.

Goals

Performanceevaluation

reward

state

Adaptive

controller System

action

Inspiration - animal learning (reward desired behavior)

Page 31: Learning and Autonomous Control

Reinforcement Learning for Control

• Approximation in high-dimensional continuous spaces

• Computationally effective methods

• Constrained learning and convergence

Page 32: Learning and Autonomous Control

Reward-Weighted Imitation

• Maximize reward = optimize strategy

• One possibility: reward-weighted imitation

act

ion

state

strategy

success (high reward) failure (low reward)reward

imitate all examples

act

ion

state

new strategy

imitate only good examples

Kober & Peters, NIPS 2008

Page 33: Learning and Autonomous Control

Trial 1Reward 0,1

Trial 2Reward 0,3

Trial 3Reward 0,8 Trial 4

Reward 1,0

Reward-weighted Imitation

Page 34: Learning and Autonomous Control

Ball-in-a-Cup

Page 35: Learning and Autonomous Control

Celemin, Maeda, Ruiz-del Solar, Peters, & Kober, under review

Page 36: Learning and Autonomous Control
Page 37: Learning and Autonomous Control

Learning a Difference Model

Rastogi, MSc 2017

Page 38: Learning and Autonomous Control

Rastogi, MSc 2017

Page 39: Learning and Autonomous Control

RL-based Compensation

Pane, MSc 2015

Page 40: Learning and Autonomous Control

Learning Abilities

•minimal •high

•today* •future?

•research

openclipart.org

•*or “brilliant idiots”

Page 41: Learning and Autonomous Control

Courses

• (Partially) taught by LAC

• SC42035 Integration Project S&C

• SC42090 Robot Motion Planning and Control

• SC42050 Knowledge Based Control Systems

• IN4320 Machine Learning

• ME41025 Robotics Practicals

• Other recommended courses

• IN4085 Pattern Recognition

• CS4180 Deep Learning

• IN4010 Artificial Intelligence Techniques

• SC42100 Networked and Distributed Control Systems

• ME41105 Intelligent Vehicles

Page 42: Learning and Autonomous Control

Questions?

latd.com