improving long- term learning of model reference adaptive...

26
Improving Long-Term Learning of Model Reference Adaptive Controllers for Flight Applications: A Sparse Neural Network Approach AIAA Guidance, Navigation, and Control Conference January 2017 Scott A. Nivison Pramod P. Khargonekar Department of Electrical and Computer Engineering University of Florida Distribution A: Approved for public release; distribution is unlimited.

Upload: others

Post on 08-Apr-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Improving Long-Term Learning of Model Reference Adaptive

Controllers for Flight Applications: A Sparse Neural Network Approach

AIAA Guidance, Navigation, and Control Conference January 2017

Scott A. Nivison Pramod P. Khargonekar

Department of Electrical and Computer Engineering University of Florida

Distribution A: Approved for public release; distribution is unlimited.

Page 2: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Outline

Motivation Prior Research MRAC Formulation Unstructured Neural Network (SHL) Structured Neural Network (RBF) Sparse Neural Network Approach (SNN) Simulation Results Future Research Goals

Distribution A: Approved for public release; distribution is unlimited

Page 3: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Motivation

Highly Dynamic Flight Vehicles Trade-offs: Number of Nodes and Learning Rates

Sparse Learning Sparse Auto-encoders Sparse Activation Function: Linear Rectifier Sparse Optimization Techniques: Max-out and

Channel-out

Long-Term Learning Performance: Uncertainty Estimates

Distribution A: Approved for public release; distribution is unlimited.

Page 4: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Prior Research

Develop a MRAC architecture that improves long-term learning and tracking performance of flight vehicles with consistent uncertainties over regions of the flight envelope while utilizing small to moderate learning rates and significant processing constraints.

Research Goals

Enhancements to the MRAC architecture 𝐿1 Adaptive Control Concurrent Learning

Distribution A: Approved for public release; distribution is unlimited.

Page 5: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

MRAC Formulation

System Dynamics:

οΏ½Μ‡οΏ½ = 𝐴π‘₯ + 𝐡Λ 𝑒 + 𝑓 π‘₯ + π΅π‘Ÿπ‘Ÿπ‘Ÿπ‘¦π‘π‘π‘ 𝑦 = 𝐢π‘₯

𝐴 ∈ ℝ𝑛×𝑛,𝐡 ∈ ℝ𝑛×𝑐, π΅π‘Ÿπ‘Ÿπ‘Ÿ ∈ ℝ𝑛×𝑐, 𝐢 ∈ ℝ𝑝×𝑛 are constant known matrices

Ξ› ∈ ℝ𝑐×𝑐 constant unknown diagonal matrix 𝑓(π‘₯) ∈ ℝ𝑐 unknown continuous differentiable function

𝑦𝑐𝑐𝑐 ∈ ℝ𝑐 external bounded time-varying command

Reference Model: οΏ½Μ‡οΏ½π‘Ÿπ‘Ÿπ‘Ÿ = π΄π‘Ÿπ‘Ÿπ‘Ÿπ‘₯π‘Ÿπ‘Ÿπ‘Ÿ + π΅π‘Ÿπ‘Ÿπ‘Ÿπ‘¦π‘π‘π‘

π‘¦π‘Ÿπ‘Ÿπ‘Ÿ = πΆπ‘Ÿπ‘Ÿπ‘Ÿπ‘₯π‘Ÿπ‘Ÿπ‘Ÿ

π΄π‘Ÿπ‘Ÿπ‘Ÿ ∈ ℝ𝑛×𝑛, πΆπ‘Ÿπ‘Ÿπ‘Ÿ ∈ ℝ𝑝×𝑛 are constant known matrices 𝐴, 𝐡Λ is controllable

Distribution A: Approved for public release; distribution is unlimited.

Aref= 𝐴 βˆ’ 𝐡𝐾𝐿𝐿𝐿

Page 6: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

MRAC – Adaptive Augmentation

𝑒 = 𝑒𝐡𝐿 + 𝑒𝐴𝐴 Overall Control:

𝑒𝐡𝐿 = βˆ’πΎπΏπΏπΏπ‘₯, π‘₯ = (𝑒𝐼 , π‘₯𝑝) ∈ ℝ𝑛 𝑒�̇� = 𝑦 βˆ’ 𝑦𝑐𝑐𝑐

Distribution A: Approved for public release; distribution is unlimited.

Page 7: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

MRAC Formulation

State Tracking Error:

𝑒 = π‘₯ βˆ’ π‘₯π‘Ÿπ‘Ÿπ‘Ÿ

Adaptive Controller:

οΏ½Μ‡οΏ½ = π΄π‘Ÿπ‘Ÿπ‘Ÿπ‘’ + 𝐡Λ(𝑒𝐴𝐴 + 𝑓 π‘₯ + 𝐼 βˆ’ Ξ›βˆ’1 𝑒𝐡𝐿)

𝑒𝐴𝐴 = βˆ’πΎοΏ½π΅πΏ(βˆ’πΎπΏπΏπΏπ‘₯) βˆ’ Θ�TΞ¦(π‘₯)

Tracking Objective: limtβ†’βˆž

π‘₯ 𝑑 βˆ’ π‘₯π‘Ÿπ‘Ÿπ‘Ÿ 𝑑 ≀ πœ–

Θ� ∈ ℝ 𝑁+1 ×𝑐, Ξ¦ π‘₯ ∈ ℝ 𝑁+1 are matrices of NN weights

Distribution A: Approved for public release; distribution is unlimited.

Page 8: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Unstructured Neural Network (SHL)

State Tracking Error:

𝑓 π‘₯ = π‘Šπ‘‡πœŽ π‘‰π‘‡πœ‡ + πœ–

π‘Š = π‘Šπ‘‡ π‘π‘Š 𝑇 ∈ ℝ(𝑁+1)×𝑐

NN Approximation Theorem:

𝑉 = 𝑉𝑇 𝑏𝑉 𝑇 ∈ ℝ(𝑛+1)×𝑁

Adaptive Controller:

𝑒𝐴𝐴 = βˆ’π‘ŠοΏ½ π‘‡πœŽ π‘‰οΏ½π‘‡πœ‡ , πœ‡ ∈ ℝ𝑛+1

Adaptive Update Laws:

π‘ŠοΏ½Μ‡ = 𝑃𝑃𝑃𝑃(2Ξ“π‘Š((𝜎(π‘‰οΏ½π‘‡πœ‡)-οΏ½Μ‡οΏ½(π‘‰οΏ½π‘‡πœ‡) π‘‰οΏ½π‘‡πœ‡)𝑒𝑇𝑃𝐡) 𝑉�̇ = 𝑃𝑃𝑃𝑃(2Ξ“π‘‰πœ‡π‘’π‘‡π‘ƒπ΅π‘ŠοΏ½ 𝑇�̇�(π‘‰οΏ½π‘‡πœ‡)) Distribution A: Approved for public release; distribution is unlimited.

Page 9: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Structured Neural Network (RBF)

State Tracking Error:

Adaptive Controller:

𝑒𝐴𝐴 = βˆ’π‘ŠοΏ½ π‘‡πœ™ π‘₯

Adaptive Update Law:

π‘ŠοΏ½Μ‡ = 𝑃𝑃𝑃𝑃 Ξ“π‘Šπœ™ π‘₯ 𝑒𝑇𝑃𝐡

πœ™ π‘₯ = πœ™1 π‘₯ , … ,πœ™π‘ π‘₯ , 1 𝑇 ∈ ℝ𝑁+1 is a vector of 𝑁 RBFs π‘ŠοΏ½ ∈ ℝ(𝑁+1)×𝑐 are the outer layer weights

πœ™π‘– π‘₯ = 𝑒π‘₯βˆ’π‘₯𝑐

2

2𝜎𝐷 , βˆ€π‘– = 1, … ,𝑁 π‘₯𝑐 is the fixed center 𝜎𝐴 is the RBF width

Distribution A: Approved for public release; distribution is unlimited.

Page 10: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Sparse Neural Network (SNN)

State Tracking Error:

Adaptive Controller:

Distribution A: Approved for public release; distribution is unlimited.

Segment flight

envelope into regions and distribute a specified number of nodes to each region.

Activate only a small percentage of the total number of nodes for control at each point in the operating envelope

Page 11: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Sparse Neural Network (SNN)

State Tracking Error:

Adaptive Controller: 𝑃 = 𝑝1, … ,𝑝𝑇 𝑆 = 𝑠1, … , 𝑠𝑇 𝑇 ∈ β„• 𝑇 is the total number of segments 𝐼 = {1, … ,𝑇}

𝑠𝑖 = {π‘₯π‘œπ‘ ∈ 𝑋: 𝐷 π‘₯π‘œπ‘,𝑝𝑖 ≀ 𝐷 π‘₯π‘œπ‘,𝑝𝑗 βˆ€π‘– β‰  𝑃}

𝐷:𝑋 Γ— 𝑋 β†’ ℝ π‘₯π‘œπ‘ ∈ ℝ𝑁

SNN Definitions:

Metric Space 𝑋,𝐷 :

Distribution A: Approved for public release; distribution is unlimited.

Page 12: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Sparse Neural Network (SNN)

State Tracking Error:

Adaptive Controller: 𝑁 ∈ β„• is the total number of nodes 𝑄 ∈ β„• where 𝑄 = 𝑁

𝑇

𝑄 is the number of nodes per segment

πΈπ‘–βˆˆπΌ = {𝑒𝐿 π‘–βˆ’1 +1, … , 𝑒𝑖𝐿} where π‘Œ =βˆͺπ‘–βˆˆπΌ 𝐸𝑖

π‘Œ = 𝑒1, … , 𝑒𝑁 𝐡 = {1, … ,𝑁}

SNN Definitions:

Adaptive Controller: Nodes per Segment:

Distribution A: Approved for public release; distribution is unlimited.

Page 13: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Sparse Neural Network (SNN)

State Tracking Error:

Adaptive Controller: 𝐸𝐴𝑖 = 𝐸𝑖 βˆ€π‘– ∈ 𝐼

Pure Sparse Approach (R=Q):

Adaptive Controller: Blended Approach (R>Q):

𝑅 ∈ β„• is the number of active nodes

𝐸𝐴𝑖 βŠ† 𝐸𝑖 βˆ€π‘– ∈ 𝐼

Distribution A: Approved for public release; distribution is unlimited.

Each segment activates only nodes that were allocated to that segment

Each segment activates R nearby nodes regardless of node segment assignment

Page 14: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Sparse Neural Network (SNN)

State Tracking Error:

Adaptive Controller:

Adaptive Controller:

Adaptive Update Laws:

𝑒𝐴𝐴 = βˆ’π‘Šπ‘–οΏ½ π‘‡πœŽ 𝑉� π‘–π‘‡πœ‡

π‘Šπ‘–οΏ½Μ‡ = 𝑃𝑃𝑃𝑃(2Ξ“π‘Š((𝜎(π‘‰π‘–οΏ½π‘‡πœ‡)-οΏ½Μ‡οΏ½(𝑉𝑖�

π‘‡πœ‡) 𝑉� π‘–π‘‡πœ‡)𝑒𝑇𝑃𝐡)

𝑉𝑖�̇ = 𝑃𝑃𝑃𝑃(2Ξ“π‘‰πœ‡π‘’π‘‡π‘ƒπ΅π‘Šπ‘–οΏ½ 𝑇�̇�(𝑉𝑖�

π‘‡πœ‡))

Distribution A: Approved for public release; distribution is unlimited.

Page 15: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Sparse Neural Network (SNN)

Distribution A: Approved for public release; distribution is unlimited.

Page 16: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Simulation

Adaptive Controller: Longitudinal Short-Period Dynamics for High-Speed Flight Vehicle:

Adaptive Controller: Flight Condition:

Distribution A: Approved for public release; distribution is unlimited.

Page 17: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Results - LQR

Distribution A: Approved for public release; distribution is unlimited.

Page 18: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Results - SHL

Distribution A: Approved for public release; distribution is unlimited.

Page 19: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Results - RBF

Distribution A: Approved for public release; distribution is unlimited.

Page 20: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Results – RBF vs SHL

Distribution A: Approved for public release; distribution is unlimited.

Page 21: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Results - SNN

Adaptive Controller: Longitudinal Short-Period Dynamics for High-Speed Flight Vehicle:

Adaptive Controller:

Distribution A: Approved for public release; distribution is unlimited.

Since the disturbance, 𝑓(π‘₯) , was designed based on a single input variable, 𝛼 , only the 1-D SNN architecture with T=91 segments was employed for simulation results.

Page 22: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Results – SNN

Distribution A: Approved for public release; distribution is unlimited.

Page 23: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Results and Conclusions

Distribution A: Approved for public release; distribution is unlimited.

Page 24: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Conclusions

Traditional Neural Network schemes typically update adaptive weights based solely on the current state vector which leads to poor long-term learning Sparse Neural Network (SNN) adaptive controllers only update a small portion of neurons at each point in the flight envelope Better memory for uncertainty estimates and weights

from previously visited segments Superior tracking performance and uncertainty

estimates for tasks that have consistent uncertainties and disturbances over regions of the flight envelope

Distribution A: Approved for public release; distribution is unlimited.

Page 25: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Future Work

Develop standard analysis tools to explore trade-offs between variations of neural network adaptive controllers Explore effectiveness of high dimensional sparse

neural network (SNN) adaptive controllers against numerous uncertainties

Investigate structured sparse neural networks (SNN) for adaptive control of flight vehicles

Distribution A: Approved for public release; distribution is unlimited.

Page 26: Improving Long- Term Learning of Model Reference Adaptive ...faculty.sites.uci.edu/khargonekar/files/2017/06/...Develop a MRAC architecture that improves long-term learning and tracking

Questions?

Distribution A: Approved for public release; distribution is unlimited.