hopfield methods: application to optimization of …hopfield methods: application to optimization...

55
Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University of California, Berkeley 2019 Grid Science Winter School & Conference Santa Fe, New Mexico USA Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 1

Upload: others

Post on 13-Apr-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Hopfield Methods: Application toOptimization of Distributed Energy Resources

Scott Moura

Assistant Professor | eCAL Director

University of California, Berkeley

2019 Grid Science Winter School & ConferenceSanta Fe, New Mexico USA

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 1

Page 2: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

TianyuHU

SangjaeBAE

Laurel DUNN

SaehongPARK

DongZHANG

BertrandTRAVACCA

Dr. Hector PEREZ

ZheZHOU

ZachGIMA

Dr. HongcaiZHANG

RamonCRESPO

Mathilde BADOUAL

Dylan KATO

TengZENG

Dr. MiladMEMARZADEH

TianyuYANG

Patrick KEYANTUO

Aaron KANDEL

YiqiZHAO

SoominWOO

Pierre-François MASSIANI

Emily YOU

Victoria CHENG

Armando DOMINGOS

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 2

Page 3: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

OptimizationDynamicSystems

&Control DataScience

BatteryManagementSystems(#BATT)

Automated,Connected,&ElectricVehicles(#ACES)

DistributedEnergyResources(#DER)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 3

Page 4: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Laurel DUNN Mathilde

BADOUAL

TengZENG

Victoria CHENG

Armando DOMINGOS

Probabilistic Model for Assessing Distribution Grid Performance under Hazard Scenarios

Virtual Inertia Frequency Regulation for Renewable Integration

Multi-Armed Bandits DER Control

Optimal Bidding Strategy on the Energy Market - A Reinforcement Learning Approach

Solving Overstay in PEV Charging Station Planning via Chance Constrained Optimization

BertrandTRAVACCA

THIS TALK

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 4

Page 5: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 5

Page 6: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Aggregate Flexible Loads into Virtual Power Plant

AirConditioning Heater Water

HeaterEnergyStorageSystem

Plug-inElectricVehicles

Chen, Hashmi, Mathias, Busic, Meyn (2018)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 6

Page 7: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

F PEV Charge Schedule Optimization is a MIP!

Only ≈ 10 % of papers on large-scale optimization of PEVs model DISCRETE charging rates!Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 7

Page 8: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

F Problem Statement

Consider a mixed integer nonlinear program (MINLP):

minimize f(x) (1)

subject to: gj(x) ≤ 0, j = 1, · · · ,m (2)

xi ∈ {0,1}, i = 1, · · · ,p < n (3)

0 ≤ xi ≤ 1, i = p + 1, · · · ,n (4)

x ∈ Rn is the optimization variablethe first p < n variables must be binaryf(·) : Rn → R is quadratic and Lf – smoothgj(·) : Rn → R are quadratic and Lj – smooth

Challenge

Solve LARGE-SCALE MINLPs, e.g. n = 103,104,105, · · ·

P vs NP – Millenium Prize Problem

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 8

Page 9: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

F Problem Statement

Consider a mixed integer nonlinear program (MINLP):

minimize f(x) (1)

subject to: gj(x) ≤ 0, j = 1, · · · ,m (2)

xi ∈ {0,1}, i = 1, · · · ,p < n (3)

0 ≤ xi ≤ 1, i = p + 1, · · · ,n (4)

x ∈ Rn is the optimization variablethe first p < n variables must be binaryf(·) : Rn → R is quadratic and Lf – smoothgj(·) : Rn → R are quadratic and Lj – smooth

Challenge

Solve LARGE-SCALE MINLPs, e.g. n = 103,104,105, · · ·

P vs NP – Millenium Prize ProblemScott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 8

Page 10: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing Convex Relaxation Methods

1 Binary relaxation

2 Lagrangian relaxation

3 Semi-definite relaxation

4 McCormick relaxations[McCormick ’76][Nagarajan ’16]

5 SoA Branch-and-Bound(linear relaxation) [Belotti ’08]

6 SoA Branch-and-Cut[Achterberg ’08]

7 Quadratic Convexrelaxations [Hijazi ’17]

8 Polyhedral relaxations forMIMFs [Nagarajan ’18]

Stochastic approach to recover integer constraint:

Let xr be sol’n to binary relaxation. Feasible x can be drawnrandomly from {0,1} following Bernoulli distribution B(xr).

This can be sub-optimal.

Example

minimizex∈{0,1}

(x− 1

4

)2

=1

16(x? = 0 is opt. sol’n)

If we apply binary relaxation, we get xr = 14 and

Ex∼B(xr)

(x− 1

4

)2= 3

16 >1

16 !

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 9

Page 11: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing Convex Relaxation Methods

1 Binary relaxation

2 Lagrangian relaxation

3 Semi-definite relaxation

4 McCormick relaxations[McCormick ’76][Nagarajan ’16]

5 SoA Branch-and-Bound(linear relaxation) [Belotti ’08]

6 SoA Branch-and-Cut[Achterberg ’08]

7 Quadratic Convexrelaxations [Hijazi ’17]

8 Polyhedral relaxations forMIMFs [Nagarajan ’18]

Stochastic approach to recover integer constraint:

Let xr be sol’n to binary relaxation. Feasible x can be drawnrandomly from {0,1} following Bernoulli distribution B(xr).

This can be sub-optimal.

Example

minimizex∈{0,1}

(x− 1

4

)2

=1

16(x? = 0 is opt. sol’n)

If we apply binary relaxation, we get xr = 14 and

Ex∼B(xr)

(x− 1

4

)2= 3

16 >1

16 !

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 9

Page 12: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Outline

1 Hopfield Methods - What are they?

2 Theoretical Analysis

3 Dual Hopfield Method

4 Example and Application

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 10

Page 13: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

A short history of Hopfield Networks

(1982) J. J. Hopfield used neural nets tomodel collaborative computations

(1985) J. J. Hopfield showed that neuralnets can be used to solve optimizationproblems

(1990’s – 2000’s) Hopfield methodsbecame very popular for solving MIQPs inpower systems optimization

In literature, power system researchersadmit they didn’t fully understand whyHopfield methods work well.

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 11

Page 14: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

The Hopfield Method

Consider MINLP

minimize f(x) (5)

subject to: xi ∈ {0,1}, i = 1, · · · ,p < n (6)

0 ≤ xi ≤ 1, i = p + 1, · · · ,n (7)

Hopfield method follows dynamics:

d

dtxH(t) = −∇f(x(t)); xH(0) = x(0) ∈ (0,1)n (8)

x(t) = σ(xH(t)) (9)

where σ(·) : Rn → [0,1]n is an “activiation function” defined element-wise as:

σ(x) : x 7→ [σ1(x1), · · · , σn(xn)]

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 12

Page 15: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

The Hopfield Method

Consider MINLP

minimize f(x) (5)

subject to: xi ∈ {0,1}, i = 1, · · · ,p < n (6)

0 ≤ xi ≤ 1, i = p + 1, · · · ,n (7)

Hopfield method follows dynamics:

d

dtxH(t) = −∇f(x(t)); xH(0) = x(0) ∈ (0,1)n (8)

x(t) = σ(xH(t)) (9)

where σ(·) : Rn → [0,1]n is an “activiation function” defined element-wise as:

σ(x) : x 7→ [σ1(x1), · · · , σn(xn)]

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 12

Page 16: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

What is activation function σ(x)?

strictly increasing

σ(·) ∈ C1 with Lipschitz constant Lσi

Example: tanh

σi(x) = 12 tanh(βi(x− 1

2 )) + 12 ; βi > 0

“soft projection operator” from R to {0,1}

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 13

Page 17: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

F F Hopfield Method⇒ Nonlinear Gradient Flow

If σ(·) is a homeomorphism, then a nonlinear gradient flow emerges!

d

dtx(t) = −σ′(σ−1(x(t)))�∇f(x(t)) (10)

exp tanh sin pwl

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 14

Page 18: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

F F Hopfield Method⇒ Nonlinear Gradient Flow

If σ(·) is a homeomorphism, then a nonlinear gradient flow emerges!

d

dtx(t) = −σ′(σ−1(x(t)))�∇f(x(t)) (10)

exp tanh sin pwl

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 14

Page 19: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Discretize time dynamics

Forward Euler time discretization of Hopfield dynamics:

xk+1H = xkH − αk∇f(xk); x0

H = x0 ∈ (0,1)n (11)

xk = σ(xkH) (12)

For quadratic f(x) = 12x

TQx

xk+1H = xkH − αkQxk; x0

H = x0 ∈ (0,1)n (13)

xk = σ(xkH) (14)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 15

Page 20: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Discretize time dynamics

Forward Euler time discretization of Hopfield dynamics:

xk+1H = xkH − αk∇f(xk); x0

H = x0 ∈ (0,1)n (11)

xk = σ(xkH) (12)

For quadratic f(x) = 12x

TQx

xk+1H = xkH − αkQxk; x0

H = x0 ∈ (0,1)n (13)

xk = σ(xkH) (14)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 15

Page 21: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Graphical Interpretation of Hopfield MethodForward Simulation of Hopfield Neural Net!

Undirected weighted graph

n nodes, one for each xi

Each node has internal (xH,i ∈ R)and external (xi ∈ R) states

Weights [P0]ij are elements ofgradients of obj fcn

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 16

Page 22: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

F Hopfield vs Projected Gradient Descent

Hopfield

xk+1H = xkH − αk∇f(xk) (15)

xk = σ(xkH) (16)

Projected Gradient Descent

xk+1H = xk − αk∇f(xk) (17)

xk = Proj[0,1](xkH) (18)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 17

Page 23: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

F Hopfield vs Projected Gradient Descent

Hopfield

xk+1H = xkH − αk∇f(xk) (15)

xk = σ(xkH) (16)

Projected Gradient Descent

xk+1 = Proj[0,1](xk − αk∇f(xk)) (17)

No dynamics!

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 17

Page 24: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Simple Comparison

minimizex1,x2 (x1 − 1.5)2 + (x2 − 0.5)2 (18)

subject to: x1 ∈ {0,1} (19)

0 ≤ x2 ≤ 1 (20)

-0.2 0 0.2 0.4 0.6 0.8 1 1.2

-0.2

0

0.2

0.4

0.6

0.8

1

1.2

2 4 6 8 10 12 14

Iterations

0

0.5

1

1.5

2

2.5

f ⋆

fpgd

fhop

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 18

Page 25: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Outline

1 Hopfield Methods - What are they?

2 Theoretical Analysis

3 Dual Hopfield Method

4 Example and Application

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 19

Page 26: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Continuous Improvement to a Fixed Point

Theorem 1: Continuous Improvement

The Hopfield method yields monotonically decreasing iterates, f(xk+1) ≤ f(xk), ∀ k if ...

activation fcn has Lipschitz continuous first derivative: σ(·) ∈ C1 (exp, tanh, sin, pwl)

step-size αk follows an appropriately decreasing schedule

Specifically, the incremental improvement is bounded by:

0 ≤ f(xk)− f(xk+1) ≤ 0.5αk · ∇f(xk)TΣk∇f(xk) where Σk = diag(σ′(xkH))

Corollary: Convergence within a set

There exists a f † such that f(xk)→ f † as k →∞, and xk converges to the (non-empty) set

X =

{x ∈ [0,1]n | xi ∈ {0,1} OR

∂xif(x) = 0, i = 1, · · · ,p

}(21)

Remarks:

Set X includes true minimizer x?, but xk → x? not guaranteed

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 20

Page 27: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Continuous Improvement to a Fixed Point

Theorem 1: Continuous Improvement

The Hopfield method yields monotonically decreasing iterates, f(xk+1) ≤ f(xk), ∀ k if ...

activation fcn has Lipschitz continuous first derivative: σ(·) ∈ C1 (exp, tanh, sin, pwl)

step-size αk follows an appropriately decreasing schedule

Specifically, the incremental improvement is bounded by:

0 ≤ f(xk)− f(xk+1) ≤ 0.5αk · ∇f(xk)TΣk∇f(xk) where Σk = diag(σ′(xkH))

Corollary: Convergence within a set

There exists a f † such that f(xk)→ f † as k →∞, and xk converges to the (non-empty) set

X =

{x ∈ [0,1]n | xi ∈ {0,1} OR

∂xif(x) = 0, i = 1, · · · ,p

}(21)

Remarks:

Set X includes true minimizer x?, but xk → x? not guaranteed

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 20

Page 28: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Convergence Rates

Theorem 2: Sub-linear convergence

If f(x) is convex and σ(·) is smooth and verifies

σ′(σ−1(x)) ≥ min {|x|, |1− x|} , x ∈ [0,1]n (22)

then,

f(xk)− f † = O(

1kr

), with 0 < r < 1

To achieve precision ε, the worst case number of iterations is 2Mn/(β2ε)

M is upper-bound on Hessian: ∇2f(x) � MIn is number of variables x ∈ Rn

β is “hardness” of activation function

Remark: Slower than gradient descent, for which convergence is guaranteed at rate O(

1k

)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 21

Page 29: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Outline

1 Hopfield Methods - What are they?

2 Theoretical Analysis

3 Dual Hopfield Method

4 Example and Application

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 22

Page 30: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Dual Hopfield Method

So far, we have considered Hopfield methods to approximately solve

minimize f(x) (23)

subject to: 0 ≤ xi ≤ 1 i = 1, · · · ,n (24)

xi ∈ {0,1} i = 1, · · · ,p < n (25)

We now consider inequality constraints:

minimize f(x) (26)

subject to: gj(x) ≤ 0, j = 1, · · · ,m (27)

0 ≤ xi ≤ 1 i = 1, · · · ,n (28)

xi ∈ {0,1} i = 1, · · · ,p < n (29)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 23

Page 31: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Dual Hopfield Method

So far, we have considered Hopfield methods to approximately solve

minimize f(x) (23)

subject to: 0 ≤ xi ≤ 1 i = 1, · · · ,n (24)

xi ∈ {0,1} i = 1, · · · ,p < n (25)

We now consider inequality constraints:

minimize f(x) (26)

subject to: gj(x) ≤ 0, j = 1, · · · ,m (27)

0 ≤ xi ≤ 1 i = 1, · · · ,n (28)

xi ∈ {0,1} i = 1, · · · ,p < n (29)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 23

Page 32: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Dual Hopfield MethodApply Lagrangian relaxation

Idea: Instead of considering the “full” Lagrangian relaxation, consider

L(x, µ) = f(x) +m∑j=1

µjgj(x) (30)

Then the dual function is

D(µ) = minx

L(x, µ) = f(x) +m∑j=1

µjgj(x) (31)

subject to: 0 ≤ xi ≤ 1 i = 1, · · · ,n (32)

xi ∈ {0,1} i = 1, · · · ,p < n (33)

which is amenable to Hopfield method, given µ.

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 24

Page 33: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Dual Hopfield MethodApply Lagrangian relaxation

Idea: Instead of considering the “full” Lagrangian relaxation, consider

L(x, µ) = f(x) +m∑j=1

µjgj(x) (30)

Then the dual function is

D(µ) = minx

L(x, µ) = f(x) +m∑j=1

µjgj(x) (31)

subject to: 0 ≤ xi ≤ 1 i = 1, · · · ,n (32)

xi ∈ {0,1} i = 1, · · · ,p < n (33)

which is amenable to Hopfield method, given µ.

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 24

Page 34: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Dual Ascent via Hopfield

Then solve the Dual Problem:

maxµ≥0

D(µ) (34)

D(µ) = minx

L(x, µ) = minx

f(x) +m∑j=1

µjgj(x) (35)

Run Hopfield method to approximately solve D(µ) = minx L(x, µ).

Suppose x?(µ) = arg minx L(x, µ).

The subgradient of D(µ) along dimension j: gj(x?(µ)) ∈ ∂jD(µ)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 25

Page 35: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Dual Ascent via Hopfield

Then solve the Dual Problem:

maxµ≥0

D(µ) (34)

D(µ) = minx

L(x, µ) = minx

f(x) +m∑j=1

µjgj(x) (35)

Run Hopfield method to approximately solve D(µ) = minx L(x, µ).

Suppose x?(µ) = arg minx L(x, µ).

The subgradient of D(µ) along dimension j: gj(x?(µ)) ∈ ∂jD(µ)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 25

Page 36: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Dual Hopfield MethodThe Algorithm

Algorithm 1 Dual (sub)-gradient Ascent via Hopfield Method

Initialize λ0 ≥ 0; Choose β > 0for k = 0,1, · · · , kmax

... (1) use Hopfield method to approximately compute dual function

... for ` = 0, · · · , `max

... ... x`+1H = x`H − α`∇xL(x`, µk)

... ... x` = σ(x`+1H )

... ... xkhop ← x`

... until stopping criterion is met

... (2) update dual variable µ via (sub)-gradient ascent

... µk+1 = µk + βk∑m

j=1 gj(xkhop(µk))

end for

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 26

Page 37: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Outline

1 Hopfield Methods - What are they?

2 Theoretical Analysis

3 Dual Hopfield Method

4 Example and Application

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 27

Page 38: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Examples: Random MIQPs

Consider solving MIQP w.r.t. x ∈ Rn

minimize1

2xTQx + RTx (36)

subject to: Ax ≤ b (37)

Aeqx = beq (38)

lb ≤ x ≤ ub (39)

xi ∈ {0,1}, i = 1, · · · ,p (40)

Randomly generated parameters Q,R,A,b,Aeq,beq, lb,ub for each n

Number of constraints also randomized

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 28

Page 39: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Comparative Analysis

All problems solved on Matlab:

CPLEX MIQP: using function cplexmiqpdeveloped by IBM

Binary Relaxation via CPLEX QP : usingfunction cplexqp

Semi-definite relaxation (SDR):corresponding SDP solved using CVX

Hopfield: Dual Ascent Hopfield Methoduses dual variables from cplexqp

For each method, we compute:

computer running time [sec]

constraint violations (CV):binary CV: 1

p

∑pi=1 d(xi, {0,1})

inequality CV: 1m

∑mj=1 |[Ax− b]j|

equality CV: 1`

∑`k=1 |[Aeqx− beq]k|

objective function value

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 29

Page 40: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Comparative AnalysisComputer running time

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 30

Page 41: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 31

Page 42: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Application: Optimal Economic Dispatch of DERs

Consider n generators with cost: fi(xi) = cix2i + bixi + ai, with ai,bi, ci ≥ 0.

The first p generators can only make binary decisions. That is:

∀ i ∈ {1,p} we have xi ∈ {Pi,min, Pi,max}∀ i ∈ {p + 1,n} we have xi ∈ [Pi,min, Pi,max]

Problem StatementFind the optimal dispatch for generators to minimize cost and meet demand:

minimizen∑i

fi(xi)

subject to:n∑i

xi = D

(constraints above)

Simulation parameters: n = 1000 generators. Other parameters randomly generated.We perform 5000 Monte-Carlo simulations.

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 32

Page 43: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Monte Caroline Simulation Results

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 33

Page 44: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

SUMMARYHopfield Methods for large-scale MINLPs – An old heuristic with new analysis!

EXTENSIONSAlternative descent direction

Nesterov acceleration

Chance constraints

Distributed algorithms via dual decomposition

· · ·

ON-GOING / FUTUREApplication to Large-Scale PEV Charge Scheduling

More comprehensive comparative analysis

Open source codes! hmip

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 34

Page 45: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

VISIT US!Energy, Controls, and Applications Lab (eCAL)

ecal.berkeley.edu

[email protected]

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 35

Page 46: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

APPENDIX SLIDES

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 36

Page 47: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 37

Page 48: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing MethodsConvex Relaxation #1: Binary Relaxation

Stochastic approach to recover integer constraint:

Let xr be solution to binary relaxation. Feasible x can be drawn randomly from {0,1} followingBernoulli distribution B(xr).

This can be sub-optimal.

Example

minimizex∈{0,1}

(x− 1

4

)2

=1

16(x? = 0 is the optimal solution)

If we apply binary relaxation, we get xr = 14 and Ex∼B(xr)

(x− 1

4

)2= 3

16 >1

16 !

Other ideas:

Branch & Bound, Branch & Cut

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 38

Page 49: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing MethodsConvex Relaxation #1: Binary Relaxation

Stochastic approach to recover integer constraint:

Let xr be solution to binary relaxation. Feasible x can be drawn randomly from {0,1} followingBernoulli distribution B(xr).

This can be sub-optimal.

Example

minimizex∈{0,1}

(x− 1

4

)2

=1

16(x? = 0 is the optimal solution)

If we apply binary relaxation, we get xr = 14 and Ex∼B(xr)

(x− 1

4

)2= 3

16 >1

16 !

Other ideas:

Branch & Bound, Branch & Cut

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 38

Page 50: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing MethodsConvex Relaxation #2: Lagrangian Relaxation

Notice that xi ∈ {0,1} is equivalent to satisfying xi(1− xi) = 0

minimize f(x) (41)

subject to: gj(x) ≤ 0, j = 1, · · · ,m (42)

0 ≤ x ≤ 1 (43)

xi(1− xi) = 0, i = 1, · · · ,p < n (44)

Form the Lagrangian:

L(x, µ, µ, µ, λ) = f(x) +m∑j=1

[µjgj(x) + µ

jxi + µj(1− xi)

]+

p∑i=1

λixi(1− xi) (45)

Define the (concave) dual function of Λ = [µ, µ, µ, λ]

D(Λ) = minx∈Rn

L(x, µ, µ, µ, λ) (46)

Weak duality approach: Solve convex program maxΛ D(Λ)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 39

Page 51: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing MethodsConvex Relaxation #2: Lagrangian Relaxation

Notice that xi ∈ {0,1} is equivalent to satisfying xi(1− xi) = 0

minimize f(x) (41)

subject to: gj(x) ≤ 0, j = 1, · · · ,m (42)

0 ≤ x ≤ 1 (43)

xi(1− xi) = 0, i = 1, · · · ,p < n (44)

Form the Lagrangian:

L(x, µ, µ, µ, λ) = f(x) +m∑j=1

[µjgj(x) + µ

jxi + µj(1− xi)

]+

p∑i=1

λixi(1− xi) (45)

Define the (concave) dual function of Λ = [µ, µ, µ, λ]

D(Λ) = minx∈Rn

L(x, µ, µ, µ, λ) (46)

Weak duality approach: Solve convex program maxΛ D(Λ)

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 39

Page 52: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing MethodsConvex Relaxation #2: Lagrangian Relaxation

Notice that xi ∈ {0,1} is equivalent to satisfying xi(1− xi) = 0

minimize f(x) (41)

subject to: gj(x) ≤ 0, j = 1, · · · ,m (42)

0 ≤ x ≤ 1 (43)

xi(1− xi) = 0, i = 1, · · · ,p < n (44)

Form the Lagrangian:

L(x, µ, µ, µ, λ) = f(x) +m∑j=1

[µjgj(x) + µ

jxi + µj(1− xi)

]+

p∑i=1

λixi(1− xi) (45)

Define the (concave) dual function of Λ = [µ, µ, µ, λ]

D(Λ) = minx∈Rn

L(x, µ, µ, µ, λ) (46)

Weak duality approach: Solve convex program maxΛ D(Λ)Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 39

Page 53: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing MethodsConvex Relaxation #3: Semi-definite Relaxation

Introduce new variable X = xxT . This is called “lifting”. Can re-write MIQCQP

minimize1

2Tr(QX) + RTx + S (47)

subject to:1

2Tr(QjX) + RT

j x + Sj ≤ 0, j = 1, · · · ,m (48)

0 ≤ x ≤ 1 (49)

Xii = xi, i = 1, · · · ,p < n (50)

X = xxT (51)

If Q,Qi are positive semi-definite, then only X = xxT makes this non-convex. Relax into convexinequality X � xxT . Using Schur complement:

X � xxT ⇔[X xx 1

]� 0 (52)

This can be cast as a semi-definite program (SDP).

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 40

Page 54: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing MethodsConvex Relaxation #3: Semi-definite Relaxation

Introduce new variable X = xxT . This is called “lifting”. Can re-write MIQCQP

minimize1

2Tr(QX) + RTx + S (47)

subject to:1

2Tr(QjX) + RT

j x + Sj ≤ 0, j = 1, · · · ,m (48)

0 ≤ x ≤ 1 (49)

Xii = xi, i = 1, · · · ,p < n (50)

X = xxT (51)

If Q,Qi are positive semi-definite, then only X = xxT makes this non-convex.

Relax into convexinequality X � xxT . Using Schur complement:

X � xxT ⇔[X xx 1

]� 0 (52)

This can be cast as a semi-definite program (SDP).

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 40

Page 55: Hopfield Methods: Application to Optimization of …Hopfield Methods: Application to Optimization of Distributed Energy Resources Scott Moura Assistant Professor | eCAL Director University

Existing MethodsConvex Relaxation #3: Semi-definite Relaxation

Introduce new variable X = xxT . This is called “lifting”. Can re-write MIQCQP

minimize1

2Tr(QX) + RTx + S (47)

subject to:1

2Tr(QjX) + RT

j x + Sj ≤ 0, j = 1, · · · ,m (48)

0 ≤ x ≤ 1 (49)

Xii = xi, i = 1, · · · ,p < n (50)

X = xxT (51)

If Q,Qi are positive semi-definite, then only X = xxT makes this non-convex. Relax into convexinequality X � xxT . Using Schur complement:

X � xxT ⇔[X xx 1

]� 0 (52)

This can be cast as a semi-definite program (SDP).

Scott Moura | UC Berkeley Hopfield Methods: App to DERs January 11, 2019 | Slide 40