neural robot control

31
Project funded by the Future and Emerging Technologies arm of the IST Programme Project funded by the Future and Emerging Technologies arm of the IST Programme FET-Open scheme FET-Open scheme Neural Robot Control Neural Robot Control Cornelius Weber Hybrid Intelligent Systems University of Sunderland Talk at Nottingham Trent University, 8 th December 2004 on the occasion of returning the MI competition trophy Collaborators: Mark Elshaw, Alex Zochios, Chris Rowan and Stefan Wermter

Upload: meara

Post on 07-Jan-2016

26 views

Category:

Documents


0 download

DESCRIPTION

Neural Robot Control. Cornelius Weber Hybrid Intelligent Systems University of Sunderland Talk at Nottingham Trent University, 8 th December 2004 on the occasion of returning the MI competition trophy Collaborators: Mark Elshaw, Alex Zochios, Chris Rowan and Stefan Wermter. Contents. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Neural Robot ControlNeural Robot ControlNeural Robot ControlNeural Robot Control

Cornelius Weber

Hybrid Intelligent Systems

University of Sunderland

Talk at Nottingham Trent University, 8th December 2004

on the occasion of returning the MI competition trophy

Collaborators: Mark Elshaw, Alex Zochios, Chris Rowan and Stefan Wermter

Cornelius Weber

Hybrid Intelligent Systems

University of Sunderland

Talk at Nottingham Trent University, 8th December 2004

on the occasion of returning the MI competition trophy

Collaborators: Mark Elshaw, Alex Zochios, Chris Rowan and Stefan Wermter

Page 2: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

ContentsContentsContentsContents

• Visual cortex & reinforcement network for docking

• Cortex self-imitation network for docking

• Imitation networks for multiple actions:

1-stage/2-stage hierarchical network

• Outlook

Page 3: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

ContentsContentsContentsContents

• Visual cortex & reinforcement network for docking

• Cortex self-imitation network for docking

• Imitation networks for multiple actions:

1-stage/2-stage hierarchical network

• Outlook

Page 4: Neural Robot Control

Example Task: DockingExample Task: DockingExample Task: DockingExample Task: Docking

Page 5: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Docking ArchitectureDocking ArchitectureInformation FlowInformation Flow

Docking ArchitectureDocking ArchitectureInformation FlowInformation Flow

Page 6: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Docking ArchitectureDocking ArchitectureTraining (1/3)Training (1/3)

Docking ArchitectureDocking ArchitectureTraining (1/3)Training (1/3)

unsupervised traininggenerative modelsparse distributed coding

Page 7: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

V1 Receptive FieldsV1 Receptive Fields(training result)(training result)

V1 Receptive FieldsV1 Receptive Fields(training result)(training result)

Page 8: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

winner

Comparison ofComparison ofResponse CharacteristicsResponse Characteristics

Comparison ofComparison ofResponse CharacteristicsResponse Characteristics

linear sparse competitive

Page 9: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Attractor Network:Attractor Network:Competition via RelaxationCompetition via Relaxation

Attractor Network:Attractor Network:Competition via RelaxationCompetition via Relaxation

weight profile activation profile

activation update

y(t+1) = f (Wlat y(t))

Page 10: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Docking ArchitectureDocking ArchitectureTraining (2/3)Training (2/3)

Docking ArchitectureDocking ArchitectureTraining (2/3)Training (2/3)

supervised training,attractor networkfor pattern completion

Page 11: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Docking ArchitectureDocking ArchitectureVisual SystemVisual System

Docking ArchitectureDocking ArchitectureVisual SystemVisual System

Page 12: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Docking ArchitectureDocking ArchitectureTraining (3/3)Training (3/3)

Docking ArchitectureDocking ArchitectureTraining (3/3)Training (3/3)

reinforcement trainingactor-critic model

Page 13: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

ContentsContentsContentsContents

• Visual cortex & reinforcement network for docking

• Cortex self-imitation network for docking

• Imitation networks for multiple actions:

1-stage/2-stage hierarchical network

• Outlook

Page 14: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Mirror NeuronMirror NeuronDocking ArchitectureDocking Architecture

Information FlowInformation Flow

Mirror NeuronMirror NeuronDocking ArchitectureDocking Architecture

Information FlowInformation Flow

Page 15: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

unsupervised traininggenerative modeldistributed coding

Mirror NeuronMirror NeuronDocking ArchitectureDocking Architecture

TrainingTraining

Mirror NeuronMirror NeuronDocking ArchitectureDocking Architecture

TrainingTraining

Page 16: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

supervised training,attractor networkfor prediction

Mirror NeuronMirror NeuronDocking ArchitectureDocking Architecture

TrainingTraining

Mirror NeuronMirror NeuronDocking ArchitectureDocking Architecture

TrainingTraining

Page 17: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Mirror Neuron Self-ImitationMirror Neuron Self-ImitationDocking ArchitectureDocking Architecture

Information FlowInformation Flow

Mirror Neuron Self-ImitationMirror Neuron Self-ImitationDocking ArchitectureDocking Architecture

Information FlowInformation Flow

Page 18: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Basal Ganglia vs. Motor CortexBasal Ganglia vs. Motor CortexBasal Ganglia vs. Motor CortexBasal Ganglia vs. Motor Cortex

Basal ganglia units are active during early task acquisition but not at a later stage (rat T maze decision task).

early: late:

Basal Ganglia ≙ state space?Motor cortex might take over BG function via self-imitation.

Jog et al. (1999)Jog et al. (1999) Science, 286, 1158-61 Science, 286, 1158-61

Jog et al. (1999)Jog et al. (1999) Science, 286, 1158-61 Science, 286, 1158-61

Page 19: Neural Robot Control

Docking via Mirror NeuronsDocking via Mirror NeuronsDocking via Mirror NeuronsDocking via Mirror Neurons

Page 20: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

ContentsContentsContentsContents

• Visual cortex & reinforcement network for docking

• Cortex self-imitation network for docking

• Imitation networks for multiple actions:

1-stage/2-stage hierarchical network

• Outlook

Page 21: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Simulated Robot EnvironmentSimulated Robot EnvironmentSimulated Robot EnvironmentSimulated Robot Environment

Page 22: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Imitation Model ChoiceImitation Model ChoiceImitation Model ChoiceImitation Model Choice

Page 23: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Areas of Motor- and Language Areas of Motor- and Language RepresentationsRepresentations

Areas of Motor- and Language Areas of Motor- and Language RepresentationsRepresentations

forward back left right

‘go’ ‘pick’ ‘lift’ all

individual unit’sreceptive fieldsin hidden area

motor units

language units

Page 24: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Areas of Task-Specific ActivationsAreas of Task-Specific ActivationsAreas of Task-Specific ActivationsAreas of Task-Specific Activations

‘go’ ‘pick’ ‘lift’

‘go’ ‘pick’ ‘lift’

Recognition:

Production:

Activations agree with the Somatotopy-of-Action-Words Model.

Page 25: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Language InstructedLanguage InstructedImitative BehaviourImitative Behaviour

Language InstructedLanguage InstructedImitative BehaviourImitative Behaviour

‘go’ ‘pick’ ‘lift’

Page 26: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Imitation Model ChoiceImitation Model ChoiceImitation Model ChoiceImitation Model Choice

Page 27: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Neuron’s Receptive Fields in HM AreaNeuron’s Receptive Fields in HM AreaNeuron’s Receptive Fields in HM AreaNeuron’s Receptive Fields in HM Area

motor units

4 SOM-area units

forward backward left right

Page 28: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Conclusion for Imitation NetworkConclusion for Imitation NetworkConclusion for Imitation NetworkConclusion for Imitation Network

A neural network as a generative model for sensory stimuli

• generates interactive action sequences

• allows for context dependent interactive action sequences

Page 29: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

ContentsContentsContentsContents

• Visual cortex & reinforcement network for docking

• Cortex self-imitation network for docking

• Imitation networks for multiple actions:

1-stage/2-stage hierarchical network

• Outlook

Page 30: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Outlook (1/2): Object-Background Outlook (1/2): Object-Background Separation for Enhanced Object Separation for Enhanced Object

LearningLearning

Outlook (1/2): Object-Background Outlook (1/2): Object-Background Separation for Enhanced Object Separation for Enhanced Object

LearningLearning

Page 31: Neural Robot Control

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Project funded by the Future and Emerging Technologies arm of the IST ProgrammeProject funded by the Future and Emerging Technologies arm of the IST ProgrammeFET-Open schemeFET-Open scheme

Outlook (2/2): Docking Range Outlook (2/2): Docking Range Extension by Neural Coordinate Extension by Neural Coordinate

TransformationsTransformations

Outlook (2/2): Docking Range Outlook (2/2): Docking Range Extension by Neural Coordinate Extension by Neural Coordinate

TransformationsTransformations