low 201405 opftutorial2parts tcns

Upload: rasimm1146

Post on 22-Feb-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    1/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 1

    Convex Relaxation of Optimal Power Flow

    Part I: Formulations and Equivalence

    Steven H. Low EAS, Caltech [email protected]

    AbstractThis tutorial summarizes recent advances in theconvex relaxation of the optimal power flow (OPF) problem,focusing on structural properties rather than algorithms. PartI presents two power flow models, formulates OPF and theirrelaxations in each model, and proves equivalence relations amongthem. Part II presents sufficient conditions under which the convexrelaxations are exact.

    I. INTRODUCTION

    For our purposes an optimal power flow (OPF) problem

    is a mathematical program that seeks to minimize a certain

    function, such as total power loss, generation cost or user

    disutility, subject to the Kirchhoffs laws as well as capacity,

    stability and security constraints. OPF is fundamental in power

    system operations as it underlies many applications such as

    economic dispatch, unit commitment, state estimation, stability

    and reliability assessment, volt/var control, demand response,

    etc. There has been a great deal of research on OPF since

    Carpentiers first formulation in 1962 [2]. An early solution

    appears in [3] and extensive surveys can be found in e.g. [4],

    [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15].

    Power flow equations are quadratic and hence OPF can be

    formulated as a quadratically constrained quadratic program

    (QCQP). It is generally nonconvex and hence NP-hard. Alarge number of optimization algorithms and relaxations have

    been proposed. A popular approximation is a linear program,

    called DC OPF, obtained through the linearization of the power

    flow equations e.g. [16], [17], [18], [19], [20]. See also [21]

    for a more accurate linear approximation. To the best of our

    knowledge solving OPF through semidefinite relaxation is first

    proposed in [22] as a second-order cone program (SOCP) for

    radial (tree) networks and in [23] as a semidefinite program

    (SDP) for general networks in a bus injection model. It is first

    proposed in [51], [57] as an SOCP for radial networks in the

    branch flow model of [45]. See Remark 6 below for more

    details. While these convex relaxations have been illustrated

    numerically in [22] and [23], whether or when they will turnout to be exact is first studied in [24]. Exploiting graph sparsity

    to simplify the SDP relaxation of OPF is first proposed in [25],

    [26] and analyzed in [27], [28].

    Convex relaxation of quadratic programs has been applied

    to many engineering problems; see e.g. [29]. There is a rich

    theory and extensive empirical experiences. Compared with

    other approaches, solving OPF through convex relaxation offers

    several advantages. First, while DC OPF is useful in a wide

    variety of applications, it is not applicable in other applications;

    A preliminary and abridged version has appeared in [1].

    see Remark 10. Second a solution of DC OPF may not be

    feasible (may not satisfy the nonlinear power flow equations).

    In this case an operator may tighten some constraints in DC

    OPF and solve again. This may not only reduce efficiency but

    also relies on heuristics that are hard to scale to larger systems

    or faster control in the future. Third, when they converge,

    most nonlinear algorithms compute a local optimal usually

    without assurance on the quality of the solution. In contrast

    a convex relaxation provides for the first time the ability to

    check if a solution is globally optimal. If it is not, the solution

    provides a lower bound on the minimum cost and hence abound on how far any feasible solution is from optimality.

    Unlike approximations, if a relaxed problem is infeasible, it

    is a certificate that the original OPF is infeasible.

    This two-part tutorial explains the main theoretical results

    on semidefinite relaxations of OPF developed in the last few

    years. Part I presents two power flow models that are useful in

    different situations, formulates OPF and its convex relaxations

    in each model, and clarifies their relationship. Part II [30]

    presents sufficient conditions that guarantee the relaxations are

    exact, i.e. when one can recover a globally optimal solution of

    OPF from an optimal solution of its relaxations. We focus on

    basic results using the simplest OPF formulation and does not

    cover many relevant works in the literature, such as stochasticOPF e.g. [31], [32], [33], distributed OPF e.g. [34], [35], [36],

    [37], [38], [39], new applications e.g. [40], [41], or what to do

    when relaxation fails e.g. [42], [43], [44], to name just a few.

    Outline of paper

    Many mathematical models have been used to model power

    networks. In Part I of this two-part paper we present two such

    models, we call the bus injection model (BIM) and the branch

    flow model (BFM). Each model consists of a set of power flow

    equations. Each models a power network in that the solutions of

    each set of equations, called the power flow solutions, describe

    the steady state of the network. We prove that these two modelsare equivalent in the sense that there is a bijection between their

    solution sets (Section II). We formulate OPF within each model

    where the power flow solutions define the feasible set of OPF

    (Section III). Even though BIM and BFM are equivalent some

    results are much easier to formulate or prove in one model than

    the other; see Remark 2 in Section II.

    The complexity of OPF formulated here lies in the noncon-

    vexity of power flow equations that gives rise to a nonconvex

    feasible set of OPF. We develop various characterizations of

    the feasible set and design convex supersets based on these

    characterizations. Different designs lead to different convex

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    2/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 2

    relaxations and we prove their relationship (Sections IV and V).

    When a relaxation is exact an optimal solution of the original

    nonconvex OPF can be recovered from any optimal solution of

    the relaxation. In Part II [30] we present sufficient conditions

    that guarantee the exactness of convex relaxations.

    Branch flow models are originally proposed for networks

    with a tree topology, called radial networks, e.g. [45], [46],

    [47], [48], [49], [50], [51], [52], [53]. They take a recursivestructure that simples the computation of power flow solutions,

    e.g. [54], [55], [47]. The model of [45], [46] also has a

    linearization that offers several advantages over DC OPF in

    BIM; see Remark 10. The linear approximation provides simple

    bounds on the branch powers and voltage magnitudes in the

    nonlinear BFM (Section VI). These bounds are used in [56] to

    prove a sufficient condition for exact relaxation.

    Finally we make algorithmic recommendations in Section

    VII based on the key results presented here. Most proofs can

    be found in the original papers (or the arXiv version of this

    tutorial) and are omitted.

    Notations

    Let C denote the set of complex numbers and R the set of

    real numbers. For a C, Rea and Ima denote the real and

    imaginary parts of a respectively. For any set A Cn, convA

    denotes the convex hull ofA. Fora R,[a]+ := max{a, 0}. Fora, bC, a b means Rea Reb and Ima Imb. We abuse

    notation to use the same symbol a to denote either a complex

    number Re a +i Im a or a 2-dimensional real vector a=(Re a,Im a) depending on the context.

    In general scalar or vector variables are in small letters, e.g.

    u,

    w,

    x,

    y,

    z. Most power system quantities however are in capitalletters, e.g. Sjk, Pjk, Qjk,Ij , Vj. A variable without a subscript

    denotes a vector with appropriate components, e.g. s := (sj ,j=0, . . . , n), S:= (Sjk, (j, k) E). For vectors x,y, x y denotescomponentwise inequality.

    Matrices are usually in capital letters. The transpose of

    a matrix A is denoted by AT and its Hermitian (complex

    conjugate) transpose byAH. A matrix A is Hermitian ifA =AH.A is positive semidefinite (or psd), denoted by A 0, if A is

    Hermitian and xHAx 0 for all x Cn; in particular if A 0

    then by definition A= AH. For matrices A,B, A B meansAB is psd. Let Sn be the set of all nn Hermitian matrices

    and Sn+ the set ofnn psd matrices.

    A graphG = (N,E)consists of a set Nof nodes and a set ENNof edges. IfG is undirected then(j, k)Eif and only if(k,j) E. IfG is directed then (j, k) Eonly if(k,j) 6E; inthis case we will use(j, k)and j kinterchangeably to denotean edge pointing from j to k. We sometimes use G= (N, E)to denote a directed graph. By j k we mean an edge (j, k)if G is undirected and either j k or k j if G is directed.

    Sometimes we write j G or (j, k) G to mean j N or(j, k) E respectively. A cycle c:= (j1, . . . , jK) is an orderedset of nodes jkNso that (jk,jk+1) E for k=1, . . . , Kwiththe understanding that jK+1:= j1. In that case we refer to a linkor a node in the cycle by (jk,jk+1) c or jk c respectively.

    I I . POWER FLOW MODELS

    In this section we describe two mathematical models of

    power networks and prove their equivalence. By a mathemat-

    ical model we mean a set of variables and a set of equations

    relating these variables. These equations are motivated by the

    physical system, but mathematically, they are the starting point

    from which all claims are derived.

    A. Bus injection model

    Consider a power network modeled by a connected undi-

    rected graphG(N+,E)whereN+ := {0}N,N:= {1, 2, . . . , n},and EN+N+. Each node in N+ represents a bus and each

    edge in E represents a transmission or distribution line. We

    use bus and node interchangeably and line and edge

    interchangeably. For each edge (i,j) E let yi j C be itsadmittance. A bus j N+ can have a generator, a load, both or

    neither. Let Vj be the complex voltage at bus j N+ and |Vj|

    denote its magnitude. Bus 0 is the slack bus. Its voltage is fixed

    and we assume without loss of generality that V0= 10 per

    unit (pu). Let sj be the net complex power injection (generation

    minus load) at bus j N+.

    The bus injection model (BIM) is defined by the following

    power flow equations that describe the Kirchhoffs laws:

    sj = k:jk

    yHjk Vj(VHj V

    Hk ), j N

    + (1)

    Let the set of power flow solutions V for each s be:

    V(s) := {VCn+1 | V satisfies (1)}

    For convenience we include V0 in the vector variable V :=(Vj ,j N

    +) with the understanding that V0:=10 is fixed.

    Remark 1: Bus types. Each bus j is characterized by two

    complex variablesVj

    andsj

    , or equivalently, four real variables.

    The buses are usually classified into three types, depending

    on which two of the four real variables are specified. For the

    slack bus 0, V0 is given and s0 is variable. For a generator bus

    (also called PV-bus), Re(sj) = pj and |Vj| are specified andIm(sj) =qj and Vj are variable. For a load bus (also calledPQ-bus), sj is specified and Vj is variable. The power flow

    or load flow problem is: given two of the four real variables

    specified for each bus, solve the n + 1 complex equations in (1)for the remaining 2(n + 1) real variables. For instance whenall n buses j 6= 0 are all load buses, the power flow problemsolves (1) for the n complex voltages Vj ,j 6=0, and the powerinjection s0 at the slack bus 0. This can model a distribution

    system with a substation at bus 0 and n constant-power loads atthe other buses. Foroptimal power flowproblems p j and |Vj|ongenerator buses orsj on load buses can be variables as well. For

    instance economic dispatch optimizes real power generations p jat generator buses; demand response optimizes demands sj at

    load buses; and volt/var control optimizes reactive powers qj at

    capacitor banks, tap changers, or inverters. These remarks also

    apply to the branch flow model presented next.

    B. Branch flow model

    In the branch flow model we adopt a connected directed

    graph G= (N+, E) where each node in N+ := {0, 1, . . . , n}

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    3/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 3

    represents a bus and each edge in E N+ N+ represents

    a transmission or distribution line. Fix an arbitrary orientation

    for G and let m:=| E| be the number of directed edges inG. Denote an edge by (j,k) or j k if it points from nodej to node k. For each edge (j,k) E let zjk :=1/yjk be thecomplex impedance on the line; let Ijk be the complex current

    and Sjk= Pjk+ iQjk be the sending-end complex power from

    buses j to k. For each bus j N+

    letVj be the complex voltageat bus j. Assume without loss of generality that V0=10

    pu.

    Let sj be the net complex power injection at bus j.

    The branch flow model (BFM) in [57] is defined by the

    following set of power flow equations:

    k:jk

    Sjk = i:ij

    Si jzi j|Ii j|

    2

    + sj, j N+ (2a)

    Ijk = yjk(VjVk), j k E (2b)

    Sjk = VjIHjk, j k E (2c)

    where (2b) is the Ohms law, (2c) defines branch power, and

    (2a) imposes power balance at each bus. The quantity zi j|Ii j|2

    represents line loss so that Si j zi j|Ii j|2 is the receiving-end

    complex power at bus j from bus i.

    Let the set of solutions x:= (S,I,V) of BFM for each s be:

    X(s) := { x C2m+n+1 | x satisfies (2)}

    For convenience we include V0 in the vector variable V :=(Vj,j N

    +) with the understanding that V0:=10 is fixed.

    C. Equivalence

    Even though the bus injection model (1) and the branch flow

    model (2) are defined by different sets of equations in terms

    of their own variables, both are models of the Kirchhoffs laws

    and therefore must be related. We now clarify the precise sense

    in which these two mathematical models are equivalent. We saytwo sets A and B are equivalent, denoted by AB, if there is

    a bijection between them [58].

    Theorem 1: V(s) X(s) for any power injections s.

    Remark 2: Two models.Given the bijection between the so-

    lution sets V(s)and X(s)any result in one model is in principlederivable in the other. Some results however are much easier to

    state or derive in one model than the other. For instance BIM

    allows a much cleaner formulation of the semidefinite program

    (SDP) relaxation. BFM for radial networks has a convenient

    recursive structure that allows a more efficient computation of

    power flows and leads to a useful linear approximation of BFM;

    see Section VI. The sufficient condition for exact relaxation in[56] provides intricate insights on power flows that are hard

    to formulate or prove in BIM. Finally, since BFM directly

    models branch flows Sjkand currents Ijk, it is easier to use for

    some applications. We will therefore freely use either model

    depending on which is more convenient for the problem at

    hand.

    III. OPTIMAL POWER FLOW

    A. Bus injection model

    As mentioned in Remark 1 an optimal power flow problem

    optimizes both variables V and s over the solution set of the

    BIM (1). In addition all voltage magnitudes must satisfy:

    vj |Vj|2 vj, j N

    + (3)

    where vj and vj are given lower and upper bounds on voltage

    magnitudes. Throughout this paper we assume vj>0 to avoidtriviality. The power injections are also constrained:

    sj

    sj sj, j N+ (4)

    where sj and sj are given bounds on the injections at buses j.

    Remark 3: OPF constraints.If there is no bound on the load

    or on the generation at bus j then sj= i or sj= + irespectively. On the other hand (4) also allows the case where

    sj is fixed (e.g. a constant-power load), by setting sj= sj to thespecified value. For the slack bus 0, unless otherwise specified,

    we always assume v0=v0=1 and s0= i, s0= + i.Therefore we sometimes replace jN+ in (3) and (4) by j N.

    We can eliminate the variables sj from the OPF formulation

    by combining (1) and (4) into

    sj k:(j,k)E

    yHjkVj(VHj V

    Hk ) sj, j N

    + (5)

    Then OPF in the bus injection model can be defined just in

    terms of the complex voltage vector V. Define

    V :={VCn+1 | V satisfies (3), (5)} (6)

    V is the feasible set of optimal power flow problems in BIM.

    Let the cost function be C(V). Typical costs include the costof generating real power at each generator bus or line loss over

    the network. All these costs can be expressed as functions of

    V. Then the problem of interest is:

    OPF:

    minV

    C(V) subject to VV (7)

    Since (5) is quadratic, V is generally a nonconvex set. OPF is

    thus a nonconvex problem and NP-hard to solve in general.

    B. Branch flow model

    Denote the variables in the branch flow model (2) by x:=(S,I,V,s) C2(m+n+1). We can also eliminate the variables sjas for the bus injection model by combining (2a) and (4) but it

    will prove convenient to retain s:= (sj,j N+) as part of the

    variables. Define the feasible set in the branch flow model:

    X :={ x C2(m+n+1) | x satisfies (2), (3), (4)} (8)

    Let the cost function in the branch flow model be C( x). Thenthe optimal power flow problem in the branch flow model is:

    OPF:

    minx

    C( x) subject to x X (9)

    Since (2) is quadratic, X is generally a nonconvex set. OPF is

    thus a nonconvex problem and NP-hard to solve in general.

    Remark 4: OPF equivalence. By Theorem 1 there is a

    bijection between V and X. Throughout this paper we assume

    that the cost functions in BIM and BFM are equivalent under

    this bijection and we abuse notation to denote them by the

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    4/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 4

    same symbol C(). Then OPF (7) in BIM and (9) in BFM areequivalent.

    Remark 5: OPF variants. OPF as defined in (7) and (9) is

    a simplified version that ignores other important constraints

    such as line limits, security constraints, stability constraints,

    and chance constraints; see extensive surveys in [4], [5], [6],

    [7], [8], [9], [10], [11], [12], [13], [14], [15], [59], [60] and

    a recent discussion in [61] on real-life OPF problems. Someof these can be incorporated without any change to the results

    in this paper (e.g. see [57], [62] for models that include shunt

    elements and line limits). Indeed a shunt element yj at bus j

    can be easily included in BIM by modifying (1) into:

    sj = k:jk

    yHjk Vj(VHj V

    Hk ) +y

    Hj|Vj|

    2

    or included in BFM by modifying (2a) into:

    k:jk

    Sjk+yHj |Vj|

    2 = i:ij

    Si j zi j|Ii j|

    2

    + sj

    C. OPF as QCQP

    Before we describe convex relaxations of OPF we first show

    that, whenC(V) := VHCVis quadratic inVfor some Hermitianmatrix C, OPF is indeed a quadratically constrained quadratic

    program (QCQP) by converting it into the standard form. We

    will use the derivation in [62] for OPF (7) in BIM. OPF (9) in

    BFM can similarly be converted into a standard form QCQP.

    Define the (n + 1) (n + 1) admittance matrix Y by

    Yi j =

    k:ki

    yik, if i= j

    yi j, if i 6= j and i j

    0 otherwise

    Y is symmetric but not necessarily Hermitian. Let Ij be the net

    injection current from bus j to the rest of the network. Then

    the current vector Iand the voltage vector Vare related by the

    Ohms law I= YV. BIM (1) is equivalent to:

    sj = VjIHj = (e

    HjV)(I

    Hej)

    where ej is the (n + 1)-dimensional vector with 1 in the jthentry and 0 elsewhere. Hence, since I= YV, we have

    sj = tr

    eHjV VHYHej

    = tr

    YHeje

    Hj

    VVH

    = VHYHj V

    where Yj:= ej eHjYis an(n+1)(n+1)matrix with its jth row

    equal to the jth row of the admittance matrix Y and all other

    rows equal to the zero vector. Yj is in general not Hermitian

    so that VHYHj V is in general a complex number. Its real and

    imaginary parts can be expressed in terms of the Hermitian and

    skew Hermitian components ofYHj defined as:

    j:=1

    2

    YHj +Yj

    and j:=

    1

    2i

    YHj Yj

    Then

    Resj= VHjV and Imsj= V

    HjV

    Let their upper and lower bounds be denoted by

    pj

    :=Resj and pj:=Resj

    qj

    :=Resj and qj:=Resj

    Let Jj:=ejeHj denote the Hermitian matrix with a single 1 in

    the(j,j)th entry and 0 everywhere else. Then OPF (7) can bewritten as a standard form QCQP:

    minVCn+1

    VHCV (10a)

    s.t. VHjV pj, VH(j)V pj (10b)

    VHjV qj, VH(j)V qj (10c)

    VHJjV vj, VH(Jj)V vj (10d)

    where j N+ in (10).

    IV. FEASIBLE SETS AND RELAXATIONS: BIM

    In this and the next section we derive semidefinite relaxations

    of OPF and clarify their relations. The cost function Cof OPF

    is usually assumed to be convex in its variables. The difficultyof OPF formulated here thus arises from the nonconvex feasible

    sets V for BIM and X for BFM. The basic approach to deriving

    convex relaxations of OPF is to design convex supersets of

    (equivalent sets of) V or X and minimize the same cost function

    over these supersets. Different choices of convex supersets lead

    to different relaxations, but they all provide a lower bound to

    OPF. If every optimal solution of a convex relaxation happens

    to lie in V or X then it is also feasible and hence optimal for

    the original OPF. In this case we say the realxation is exact.

    In this section we present three characterizations of the

    feasible set V in BIM. These characterizations naturally suggest

    convex supersets and semidefinite relaxations of OPF, and we

    prove equivalence relations among them. In the next sectionwe treat BFM. In Part II of the paper we discuss sufficient

    conditions that guaranteed exact relaxations.

    A. Preliminaries

    Since OPF is a nonconvex QCQP there is a standard

    semidefinite relaxation through the equivalence relation: for any

    Hermitian matrix M, VHMV= tr MVVH = tr MW for a psdrank-1 matrix W. Applying this transformation to the QCQP

    formulation (10) leads to an equivalent problem of the form:

    minWSn+1

    trCW

    s.t. tr ClW b

    l, W 0, rankW= 1

    for appropriate Hermitian matrices Cl and real numbers bl .

    This problem is equivalent to (10) because given a psd rank-

    1 solution W, a unique solution V of (10) can be recovered

    through rank-1 factorization W =V VH. Unlike (10) whichis quadratic in V this problem is convex in W except the

    nonconvex rank-1 constraint. Removing the rank-1 constraint

    yields the standard SDP relaxation.

    We now generalize this intuition to characterize the feasible

    set V in (6) in terms of partial matrices. These characterizations

    lead naturally to SDP, chordal, and second-order cone program

    (SOCP) relaxations of OPF in BIM, as shown in [58], [28].

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    5/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 5

    We start with some basic definitions on partial matrices and

    their completions; see e.g. [63], [64], [65] for more details. Fix

    any connected undirected graph Fwith n vertices and m edges

    connecting distinct vertices.1 A partial matrix WF is a set of

    2m + n complex numbers defined on F:

    WF := {[WF]j j, [WF]jk, [WF]k j |nodes j and edges (j,k) of F}

    WFcan be interpreted as a matrix with entries partially specifiedby these complex numbers. IfFis a complete graph (in which

    there is an edge between every pair of vertices) then WF is a

    fully specified nn matrix. Acompletion W ofWF is any fully

    specified nn matrix that agrees with WFon graph F, i.e.,

    [W]j j = [WF]j j, [W]jk = [WF]jk for j,(j,k) F

    Given an nn matrixWwe useWFto denote thesubmatrix of

    W on F,i.e., the partial matrix consisting of the entries ofWde-

    fined on graphF. Ifq is a clique (a fully connected subgraph) of

    F then letWF(q) denote the fully-specified principal submatrixof WF defined on q. We extend the definitions of Hermitian,

    psd, and rank-1 for matrices to partial matrices, as follows.

    A partial matrix WF is Hermitian, denoted by WF=WHF , if[WF]jk= [WF]

    Hk j for all (j,k) F; it is psd, denoted byWF 0,

    ifWFis Hermitian and the principal submatrices WF(q)are psdfor all cliquesq ofF; it isrank-1, denoted by rankWF= 1, if theprincipal submatrices WF(q) are rank-1 for all cliques q ofF.We sayWF is 22psd (rank-1), denoted byWF(j,k) 0 (rankWF(j,k) = 1) if, for all edges (j,k) F, the 2 2 principalsubmatrices

    [WF](j,k) :=

    [WF]j j [WF]jk[WF]k j [WF]kk

    are psd (rank-1).F is a chordal graphif eitherFhas no cycle or

    all its minimal cycles (ones without chords) are of length three.

    A chordal extension c(F) ofF is a chordal graph that containsF, i.e.,c(F)has the same vertex set asFbut an edge set that isa superset ofFs edge set. In that case we call the partial matrix

    Wc(F) a chordal extensionof the partial matrixWF. Every graph

    Fhas a chordal extension, generally nonunique. In particular a

    complete supergraph of Fis a trivial chordal extension ofF.

    For our purposes chordal graphs are important because of

    the result [63, Theorem 7] that every psd partial matrix has a

    psd completion if and only if the underlying graph is chordal.

    When a positive definite completion exists, there is a unique

    positive definite completion, in the class of all positive definite

    completions, whose determinant is maximal. Theorem 2 below

    extends this to rank-1 partial matrices.

    B. Feasible sets

    We can now characterize the feasible set V of OPF defined

    in (6). Recall the undirected connected graph G = (N+,E) thatmodels a power network. Given a voltage vector V V define

    a partial matrix WG:= WG(V): for j N+ and (j,k) E,

    [WG]j j := |Vj|2 (11a)

    [WG]jk := VjVHk =: [WG]

    Hk j (11b)

    1In this subsection we abuse notation and use n,m to denote generalintegers unrelated to the number of buses or lines in a power network.

    Then the constraints (5) and (3) imply that the partial matrix

    WG satisfies 2

    sj k:(j,k)E

    yHjk

    [WG]j j [WG]jk sj, j N

    + (12a)

    vj [WG]j j vj, j N+ (12b)

    Following Section III-C these constraints can also be written

    in a (partial) matrix form as:

    pj tr jWG pj

    qj tr jWG qj

    vj tr JjWG vj

    The converse is not always true: given a partial matrix WGthat satisfies (12) it is not always possible to recover a voltage

    vector V in V. Indeed this is possible if and only if WG has

    a completion W that is psd rank-1, because in that case W

    satisfies (12) since yjk=0 if(j,k) 6Eand it can be uniquelyfactored as W= VVH with V V. We hence seek conditionsadditional to (12) on the partial matrix WG that guarantee that

    it has a psd rank-1 completion W from which V V can berecovered. Our first key result provides such a characterization.

    We say that a partial matrix WG satisfies the cycle condition

    if for every cycle c in G

    (j,k)c

    Wjk = 0 mod 2 (13)

    When Wjkrepresent voltage phase differences across each line

    then the cycle condition imposes that they sum to zero (mod 2)

    around any cycle. The next theorem, proved in [58, Theorem

    3] and [28], implies that WG has a psd rank-1 completion W if

    and only ifWG is 22 psd rank-1 on G and satisfies the cycle

    condition (13), if and only if it has a chordal extension Wc(G)

    that is psd rank-1. 3Consider the following conditions on (n + 1) (n + 1) ma-

    trices W and partial matrices Wc(G) andWG:

    W 0, rankW= 1 (14)

    Wc(G) 0, rankWc(G)=1 (15)

    WG(j,k) 0, rankWG(j,k) =1, (j,k) E, (16)

    Theorem 2: Fix a graph G on n + 1 nodes and any chordalextension c(G) of G. Assuming Wj j >0,

    Wc(G)

    j j> 0 and

    [WG]j j > 0, j N+, we have:

    (1) Given an (n + 1) (n + 1) matrix W that satisfies (14),its submatrix Wc(G) satisfies (15).

    (2) Given a partial matrix Wc(G) that satisfies (15), its sub-matrix WG satisfies (16) and the cycle condition (13).

    (3) Given a partial matrix WG that satisfies (16) and the

    cycle condition (13), there is a completionW ofWG that

    satisfies (14).

    2The constraint (12a) can also be written compactly in terms of theadmittance matrix Yas in [66]:

    s diag

    WYH s

    3The theorem also holds with psd replaced by negative semidefinite.

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    6/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 6

    Informally Theorem 2 says that (14) is equivalent to (15) is

    equivalent to (16)+(13). It characterizes a property of the fullmatrix W (rankW= 1) in terms of its submatrices Wc(G) andWG. This is important because the submatrices are typically

    much smaller than W for large sparse networks and much

    easier to compute. The theorem thus allows us to solve simpler

    problems in terms of partial matrices as we now explain.

    Define the set of Hermitian matrices:

    W := {W Sn+1 | W satisfies (12),(14)} (17)

    Fix any chordal extension c(G) of G and define the set ofHermitian partial matrices Wc(G):

    Wc(G) := {Wc(G) | Wc(G) satisfies (12),(15)} (18)

    Finally define the set of Hermitian partial matrices WG:

    WG := {WG | WG satisfies (12), (13),(16)} (19)

    Note that the definition of psd for partial matrices implies that

    Wc(G) and WG are Hermitian. The assumption vj >0,j N+

    implies that all matrices or partial matrices have strictly positive

    diagonal entries.

    Theorem 2 implies that given a partial matrix Wc(G) Wc(G)or a partial matrix WG WG there is a psd rank-1 completion

    WWfrom which a solution VVof OPF can be recovered.

    In fact we know more: given any Hermitian partial matrix WG(not necessarily in WG), the set of all completions ofWG that

    satisfies the condition in Theorem 2(3) consists of a single

    psd rank-1 matrix and infinitely many indefinite non-rank-1

    matrices; see [58, Theorems 5 and 8] and discussions therein.

    Hence the psd rank-1 completion W of a WG WG is unique.

    Corollary 3: Given a partial matrix Wc(G) Wc(G) or WG

    WG there is a unique psd rank-1 completion WW.

    Recall that two sets A and B are equivalent (A B) if thereis a bijection between them. Even though W,Wc(G),WG are

    different kinds of spaces Theorem 2 and Corollary 3 imply

    that they are all equivalent to the feasible set of OPF.

    Theorem 4: VWWc(G) WG.

    Theorem 4 suggests three equivalent problems to OPF. We

    assume the cost function C(V) in OPF depends on V onlythrough the partial matrix WGdefined in (11). For example if the

    cost is total real line loss in the network then C(V) =j Re sj=

    jk:(j,k)ERe

    [WG]j j [WG]jk

    yHjk. If the cost is a weighted

    sum of real generation power then C(V) = j

    cj Re sj+p

    dj

    where pdj

    are the given real power demands at buses j; again

    C(V) is a function of the partial matrix WG. Then Theorem 4implies that OPF (7) is equivalent to

    minW

    C(WG) s.t. W W (20)

    where W is any one of the sets W,Wc(G),WG. Specifically,

    given an optimal solution Wopt in W, it can be uniquely

    decomposed into Wopt =Vopt(Vopt)H. Then Vopt is in V andan optimal solution of OPF (7). Alternatively given an optimal

    solutionWopt

    F in Wc(G) or WG, Corollary 3 guarantees thatWopt

    F

    has a unique psd rank-1 completion Wopt in W from which an

    optimalVoptV can be recovered. In fact given a partial matrix

    WG WG (or Wc(G) Wc(G)) there is a more direct construction

    of a feasible solution VV of OPF than through its completion;

    see Section IV-D.

    C. Semidefinite relaxations

    Hence solving OPF (7) is equivalent to solving (20) over

    any ofW,Wc(G)

    ,WG

    for an appropriate matrix variable. The

    difficulty with solving (20) is that the feasible sets W, Wc(G),

    and WG are still nonconvex due to the rank-1 constraints and

    the cycle condition (13). Their removal leads to SDP, chordal,

    and SOCP relaxations of OPF respectively.

    Relax W, Wc(G) and WG to the following convex supersets:

    W+ := {W Sn+1 |WG satisfies (12),W 0}

    W+c(G) := {Wc(G) |WG satisfies (12),Wc(G) 0}

    W+G := {WG |WG satisfies (12),WG(j,k) 0, (j,k) E}

    Define the problems:

    OPF-sdp:

    minW

    C(WG) s.t. WW+ (21)

    OPF-ch:

    minWc(G)

    C(WG) s.t. Wc(G) W+c(G) (22)

    OPF-socp:

    minWG

    C(WG) s.t. WG W+G (23)

    The condition WG(j,k) 0 in the definition ofW+G is equivalent

    to [WG]jk= [WG]Hk j and (recall the assumption vj > 0,j N

    +)

    [WG]j j>0, [WG]kk> 0, [WG]j j[WG]kk [WG]jk2

    This is a second-order cone and hence OPF-socp is indeed an

    SOCP in the rotated form.

    Remark 6: Literature.SOCP relaxation for OPF seems to be

    first proposed in [22] for the bus injection model (1), and in

    [51], [57] for the branch flow model (2) as explained in the next

    section. By defining a new set of variables vj :=|Vj|2, Rjk:=

    |Vj||Vk|cos(jk), andIjk:= |Vj||Vk|sin(jk)where j:=Vj , [22] rewrites the bus injection model (1) in the complex

    domain as a set of linear equations in these new variables in

    the real domain and the following quadratic equations:

    vjvk = R2jk+I

    2jk

    Relaxing these equalities to vjvk R2jk+I2jk enlarges the so-lution set to a second-order cone that is equivalent to W+G in

    this paper. SDP relaxation is first proposed in [23] for the bus

    injection model and analyzed in [24]. Chordal relaxation for

    OPF is first proposed in [25], [26] and analyzed in [27], [28].

    D. Solution recovery

    When the convex relaxations OPF-sdp, OPF-ch, OPF-socp

    are exact, i.e., if their optimal solutions Wsdp, Wchch, Wsocp

    G

    happen to lie in W, Wc(G), WG respectively, then an optimal

    solutionVopt of the original OPF can be recovered from these

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    7/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 7

    solutions. Indeed the recovery method works not just for an

    optimal solution, but any feasible solution that lies in W, Wc(G)or WG. Moreover, given a W W or a Wc(G) Wc(G), the

    construction of V depends on W or Wc(G) only through their

    submatrix WG. We hence describe the method for recovering

    the uniqueVfrom aWG, which may be a partial matrix in WGor the submatrix of a (partial) matrix in W or Wc(G).

    LetTbe an arbitrary spanning tree ofG rooted at bus 0. LetPj denote the unique path from node 0 to node j in T. Recall

    thatV0=10 without loss of generality. For i=1, . . . , n, let

    |Vi| :=q

    [WG]ii

    Vi := (j,k)Pi

    [WG]jk

    Then it can be checked that V is in (6) and feasible for OPF.

    E. Tightness of relaxations

    Since WW+, Wc(G) W+c(G)

    , WG W+G , the relaxations

    OPF-sdp, OPF-ch, OPF-socp all provide lower bounds on OPF

    (7) in light of Theorem 4. How do these relaxations comparein terms of computational efficiency and tightness?

    OPF-socp is the simplest computationally. OPF-ch usually

    requires more computation than OPF-socp but much less than

    OPF-sdp for large sparse networks (even though OPF-ch can

    be as complex as OPF-sdp in the worse case [64], [65]). The

    relative tightness of the relaxations depends on the network

    topology. For a general mesh network OPF-sdp is as tight a

    relaxation as OPF-ch and they are strictly tighter than OPF-

    socp. For a tree (radial) network the hierarchy collapses and

    all three are equally tight. We now make this precise.

    Consider their feasible sets W+, W+c(G)

    and W+G . We say that

    a set A is an effective subsetof a set B, denoted by A vB, if,

    given a (partial) matrix a A, there is a (partial) matrix b B

    that has the same cost C(a) = C(b). We say A is similar to B,denoted by A 'B, ifA vBand B vA. Note that A Bimplies

    A'Bbut the converse may not be true. The feasible set of OPF

    (7) is an effective subset of the feasible sets of the relaxations;

    moreover these relaxations have similar feasible sets when the

    network is radial. This is a slightly different formulation of the

    same results in [58], [28].

    Theorem 5: V vW+ 'W+c(G) vW

    +G . If G is a tree then

    VvW+ 'W+c(G) 'W

    +G .

    Let Copt,Csdp, Cch, Csocp be the optimal values of OPF (7),

    OPF-sdp (21), OPF-ch (22), OPF-socp (23) respectively. The-

    orem 4 and Theorem 5 directly implyCorollary 6: Copt Csdp =Cch Csocp. If G is a tree then

    Copt Csdp = Cch = Csocp.Remark 7: Tightness.Theorem 5 and Corollary 6 imply that

    for radial networks one should always solve OPF-socp since

    it is the tightest and the simplest relaxation of the three.

    For mesh networks there is a tradeoff between OPF-socp

    and OPF-ch/OPF-sdp: the latter is tighter but requires heavier

    computation. Between OPF-ch and OPF-sdp, OPF-ch is usually

    preferable as they are equally tight but OPF-ch is usually much

    faster to solve for large sparse networks. See [25], [26], [28],

    [27], [67] for numerical studies that compare these relaxations.

    F. Chordal relaxation

    Theorem 2 through Corollary 6 apply to any chordal exten-

    sion c(G) ofG. The choice ofc(G)does not affect the optimalvalue of the chordal relaxation but determines its complexity.

    Unfortunately the optimal choice that minimizes the complexity

    of OPF-ch is NP-hard to compute.

    This difficulty is due to two conflicting factors in choosing

    a c(G). Recall that the constraint Wc(G) 0 in the definitionof W+

    c(G) consists of multiple constraints that the principal

    submatricesWc(G)(q) 0, one for each (maximal) clique q ofc(G). When two cliques q and q0 share a node their submatricesWc(G)(q) and Wc(G)(q

    0) share entries that must be decoupledby introducing auxiliary variables and equality constraints on

    these variables. The choice ofc(G)determines the number andsizes of these submatrices Wc(G)(q) as well as the numbers ofauxiliary variables and additional decoupling constraints. On

    the one hand ifc(G)contains few cliques q then the submatricesWc(G)(q) tend to be large and expensive to compute (e.g. ifc(G) is the complete graph then there is a single clique, butWc(G)= Wand OPF-ch is identical to OPF-sdp). On the otherhand if c(G) contains many small cliques q then there tendsto be more overlap and chordal relaxation tends to require

    more decoupling constraints. Hence choosing a good chordal

    extension c(G) ofG is important but nontrivial. See [64], [65]and references therein for methods to compute efficient chordal

    relaxations of general QCQP. For OPF [27] proposes effective

    techniques to reduce the number of cliques in its chordal

    relaxation. To further reduce the problem size [67] proposes

    to carefully drop some of the decoupling constraints, though

    the resulting relaxation can be weaker.

    V. FEASIBLE SETS AND RELAXATIONS: BFM

    We now present an SOCP relaxation of OPF in BFM pro-posed in [51], [57] in two steps. We first relax the phase angles

    ofV andI in (2) and then we relax a set of quadratic equalities

    to inequalities. This derivation pinpoints the difference between

    radial and mesh topologies. It motivates a recursive version of

    BFM for radial networks (Section VI) and the use of phase

    shifters for convexification of mesh networks (Part II [30]).

    A. Feasible sets

    Consider the following set of equations in the variables x :=(S,`, v, s) in R3(m+n+1):4

    k:jk

    Sjk = i:ij

    (Si jzi j`i j) + sj , j N+ (24a)

    vj vk = 2 RezHjkSjk

    |zjk|

    2jk, j k E

    (24b)

    vj jk = |Sjk|2

    , j k E (24c)

    4The use of complex variables is only a shorthand and should be interpretedas operations in real variables. For instance (24a) is a shorthand for

    k:jk

    Pjk = i:ij

    Pi j ri j`i j

    +pj

    k:jk

    Qjk = i:ij

    Qi j xi j`i j

    + qj

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    8/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 8

    and define the solution set as:

    Xnc := {x R3(m+n+1) | x satisfies (3), (4), (24)}Note that the vector v includes v0 ands includes s0. The model

    (24) is first proposed in [45], [46]. 5 It can be derived as a

    relaxation of BFM (2) as follows. Taking the squared magnitude

    of (2c) and replacing |Vj|2 and|Ijk|

    2 by v j and jk respectively

    yield (24c). To obtain (24b), use (2b)(2c) to write Vk= VjzjkSjkV

    1j and take the squared magnitude on both sides to

    eliminate the phase angles ofV and I. These operations define

    a mappingh :C2(m+n+1)R3(m+n+1) by: for any x = (S,I, V, s),h( x):= (S, `, v, s) with jk=|Ijk|

    2 and vj=|Vj|2.

    Throughout this paper we assume the cost function C( x) inOPF (9) depends on x only through x:=h( x). For examplefor total real line lossC( x) =(j,k) ERezjk jk. If the cost is aweighted sum of real generation power then C( x) =j(cjpj+pdj ) where pj are the real parts ofsj and p

    dj are the given real

    power demands at buses j; again C( x) depends only on x.Then the model (24) is a relaxation of BFM (2) in the sense

    that the feasible set X of OPF in (9) is an effective subset

    ofXnc, X v Xnc, since h(X) Xnc. We now characterize thesubset ofXnc that is equivalent to X.

    Given an x:= (S, `, v, s) R3(m+n+1) define(x) Rm byjk(x) :=

    vjzHjkSjk

    , j k E (25)

    Even though x does not include phase angles ofV, x implies

    a phase difference across each line j k Egiven by jk(x).The subset ofXnc that is equivalent to X are those x for which

    there exists such that jk= jk(x). To state this preciselylet B be the mn(transposed) reduced incidence matrix ofG:

    Bl j =

    1 if edge l E leaves node j

    1 if edge l

    E enters node j

    0 otherwise

    where j N. Consider the set ofx Xnc such that that solves B = (x) mod 2 (26)

    i.e.,(x) is in the range space ofB (mod 2). A solution(x),if exists, is unique in (,]n. Define the set

    X := {x R3(m+n+1) | x satisfies (3), (4), (24), (26)}The following result characterizes the feasible set X of OPF in

    BFM and follows from [57, Theorems 2, 4].

    Theorem 7: XXXnc.The bijection between X and X is given by h defined above

    restricted to X. Its inverse h1(S, `, v, s) = (S,I, V, s) is definedon X in terms of(x) by:

    Vj :=

    vj eij(x)

    , j N (27a)Ijk :=

    pjk e

    i(j(x)Sjk), j k E (27b)

    The condition (26) is equivalent to the cycle condition (13)

    in the bus injection model. To see this fix any spanning tree

    5The original model, called the DistFlow equations, in [45], [46] is forradial (distribution) networks, but its extension here to mesh networks is trivial.

    T= (N,ET) of the (directed) graph G. We can assume withoutloss of generality (possibly after re-labeling the links) that ETconsists of links l=1, . . . , n. Then B can be partitioned into

    B =

    BTB

    where thennsubmatrixBTcorresponds to links in Tand the(mn)n submatrix B corresponds to links in T:=G\T.Similarly partition (x) into

    (x) =

    T(x)(x)

    The next result, proved in [57, Theorems 2 and 4], provides a

    more explicit characterization of (26) in terms of(x). Whenit holds this characterization has the same interpretation of the

    cycle condition in (13): the voltage angle differences implied

    byx sum to zero (mod 2) around any cycle. Formally let bethe extension offrom directed to undirected links: for each

    j k E let jk(x):=jk(x) and k j(x):= jk(x). We sayc := (j1, . . . , jK) is an undirectedcycle if, for each k= 1, . . . , K,

    either jk jk+1 E or jk+1 jk E with the interpretationthat jK+1:= j1; (jk,jk+1) c denotes one of these links.

    Theorem 8: An x Xnc satisfies (26) if and only if aroundeach undirected cycle c we have

    (j,k)c

    jk(x) = 0 mod 2 (28)

    In that case (x) = PB1T T(x)

    is the unique solution of

    (26) in (,]n, where P() projects to (,]n.Theorem 8 determines when the voltage magnitudes v of a

    given x can be assigned phase angles (x) so that the resultingx:=h1(x) is a power flow solution in X.

    B. SOCP relaxation

    The set Xnc that contains the (equivalent) feasible set X of

    OPF is still nonconvex because of the quadratic equalities in

    (24c). Relax them to inequalities:

    vj jk |Sjk|2, (j, k) E (29)and define the set:

    X+ :={x R3(m+n+1) | x satisfies (3), (4), (24a), (24b), (29)}

    Clearly X h X Xnc X+; see Figure 1. Moreover X+ is a

    second-order cone in the rotated form.

    The three sets X, Xnc, X+ define the following problems:OPF:

    minx

    C(x) subject to x X (30)OPF-nc:

    minx

    C(x) subject to x Xnc (31)OPF-socp:

    minx

    C(x) subject to x X+ (32)The next theorem follows from the results in [57] and implies

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    9/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 9

    h

    h!1

    C2(m+n+1)

    R3(m+n+1)

    Xnc

    !X X

    X+

    Fig. 1: Feasible sets X of OPF (9) in BFM, its equivalent set

    X (defined by h) and its relaxations Xnc and X+. IfG is a tree

    then X =Xnc.

    that OPF (9) is equivalent to minimization over X and OPF-

    socp is its SOCP relaxation. Moreover for radial networks

    voltage and current angles can be ignored and OPF (9) is

    equivalent to OPF-nc.

    Theorem 9: XXXnc X+. IfGis a tree then XX=

    Xnc X+

    .Let Copt be the optimal cost of OPF (9) in the branch

    flow model. Let Copf, Cnc, Csocp be the optimal costs of OPF

    (30), OPF-nc (31), OPF-socp (32) respectively defined above.

    Theorem 9 implies

    Corollary 10: Copt = CopfCnc Csocp. IfG is a tree thenCopt = Copf = Cnc Csocp.

    Remark 8: SOCP relaxation. Suppose one solves OPF-socp

    and obtains an optimal solution xsocp := (S,`, v, s) X+. Forradial networks if xsocp attains equality in (29) then xsocp

    Xnc and Theorem 9 implies that an optimal solution xopt :=

    (S,I,V, s) X of OPF (9) can be recovered from xsocp. Indeedxopt =h1(xsocp) where h1 is defined in (27). Alternatively

    one can use the angle recovery algorithms in [57, Part I] torecover xopt. For mesh networks xsocp needs to both attain

    equality in (29) and satisfy the cycle condition (26) in order

    for an optimal solution xopt to be recoverable. Our experience

    with various practical test networks suggests that xsocp usually

    attains equality in (29) but, for mesh networks, rarely satisfes

    (26) [51], [57], [56], [28]. Hence OPF-socp is effective for

    radial networks but not for mesh networks (in both BIM and

    BFM).

    C. Equivalence

    Theorem 9 establishes a bijection between X and the feasible

    setX

    of OPF (9) in BFM. Theorem 4 establishes a bijectionbetween WG and the feasible set V of OPF (7) in BIM.

    Theorem 1 hence implies that X X V WG. Moreover

    their SOCP relaxations are equivalent in these two models [58],

    [28]. Define the set of partial matrices defined on G that are

    22 psd rank-1 but do not satisfy the cycle condition (13):

    Wnc := {Wnc | WG satisfies (12),WG(j,k) 0,

    rankWG(j, k) =1 for all (j, k) E}

    ClearlyWG Wnc W+G in general and WG=Wnc W

    +G for

    radial networks.

    Theorem 11: XWG, Xnc Wnc and X+ W

    +G .

    The bijection between X+ and W+G is defined as follows. Let

    WGC2m+n+1 denote the set of Hermitian partial matrices (in-

    cluding[WG]00=v0 which is given). Let x:= (S,`,v, s) denotevectors in R3(m+n+1). Define the linear mapping g : WG X

    +

    by x=g(WG) where

    Sjk := yHjk

    [WG]j j [WG]jk

    , j k

    `jk := |yjk|

    2 [WG]j j+ [WG]kk [WG]jk [WG]k j

    , j k

    vj := [WG]j j, j N+

    sj := k:jk

    yHjk

    [WG]j j [WG]jk

    , j N+

    Its inverse g1 :X+ WG isWG=g1(x) where[WG]j j:=vj

    for j N+ and [WG]jk:=vjzHjkSjk=:[WG]

    Hk j for j k. The

    mapping g and its inverse g1 restricted to WG (Wnc) and X(Xnc) define the bijection between them.

    VI. BFM FOR RADIAL NETWORKS

    Theorem 9 implies that for radial networks the model (24) is

    exact. This is because the reduced incident matrix B in (26) is

    nn and invertible, so the cycle condition is always satisfied[57, Theorem 4]. Hence a solution in Xnc can be mapped to a

    branch flow solution in X by the mapping h1 defined in (27).

    For radial networks this model has two advantages: (i) it has

    a recursive structure that simplifies computation, and (ii) it has

    a linear approximation that provides simple bounds on branch

    powers Sjk and voltage magnitudes vj , as we now show.

    A. Recursive equations and graph orientation

    The model (24) holds for any graph orientation of G. It has

    a recursive structure when G is a tree. In that case different

    orientations have different boundary conditions that initialize

    the recursion and may be convenient for different applications.Without loss of generality we take bus 0 as the root of the

    tree. We discuss two different orientations: one where every

    link points away from bus 0 and the other where every link

    points towards bus 0. 6

    Case I: Links point away from bus 0. Model (24) reduces to:

    k:jk

    Sjk = Si jzi j`i j+ sj, j N+ (33a)

    vjvk = 2 RezHjkSjk

    |zjk|

    2jk, j k E (33b)

    vj jk = |Sjk|2, j k E (33c)

    where bus i in (33a) denotes the unique parent of node j (on

    the unique path from node 0 to node j), with the understandingthat if j=0 then Si0:=0 and `i0:=0. Similarly when j is aleaf node7 all Sjk= 0 in (33a). The model (33) is called theDistFlowequations and first proposed in [45], [46].

    Its recursive structure is exploited in [47] to analyze the

    power flow solutions given an (sj,j N), as we now explainusing the special case of a linear network with n + 1 busesthat represents a main feeder. To simplify notation denote

    6An alternative model is to use an undirected graph and, for each link(j,k),the variables (Sjk, jk) and (Sk j,`k j) are defined for both directions, with theadditional equations Sjk+ Sk j = zjk jk and `k j= `jk.

    7A node j is a leafnode if there exists no i such that i j E.

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    10/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 10

    Sj(j+1), j(j+1)

    and zj(j+1) by (Sj , j) and zj respectively.

    Then the DistFlow equations (33) reduce to (v0 is given):

    Sj+1 = Sjzj j+ sj+1, j=0, . . . , n1 (34a)

    vj+1 = vj2Re(zHj Sj) |zj|

    2j , j=0, . . . , n1 (34b)

    vj j = |Sj|2

    , j=0, . . . , n1 (34c)

    S0 = s0, Sn = 0 (34d)

    Let xj:= (Sj , j , vj), j N+. If s0 were known then one can

    start with(v0, s0) and use the recursion (34a)(34c) to computexj in terms ofs0= S0, i.e., (34) can be collapsed into functionsof the scalar variable s0 (recall that (sj ,j N) are given):

    xj = fj(s0), j N+ (35)

    Use the boundary condition (34d) Sn = fn(s0) = 0 to solvefor the scalar variable s0. The other variables xj can then

    be computed from (35). This method can be extended to

    a general radial network with laterals [47]. See also [68],

    [69] for techniques for solving the nonlinear equations (35),

    and [54], [55] for a different recursive approach called the

    forward/backward sweep for radial networks.Case II: Links point towards bus 0. Model (24) reduces to:

    Sji = k:kj

    Sk j zk jk j

    + sj , j N

    + (36a)

    vk vj = 2 RezHk j

    Sk j |zk j|

    2 k j , k j E (36b)

    vkk j = |Sk j|2

    , k j E (36c)

    where i in (36a) denotes the node on the unique path between

    node 0 and node j. The boundary condition is defined by Sji= 0in (36a) when j=0 and Sk j= 0, `k j= 0 in (36a) when j is aleaf node. An advantage of this orientation is illustrated in the

    next subsection in proving a simple bound on vj.

    B. Linear approximation and bounds

    By setting jk= 0 in (33) we obtain a linear approximationof the the branch flow model, with the graph orientation where

    all links point away from bus 0:

    k:jk

    Slinjk = Slini j + sj , j N

    + (37a)

    vlinj vlink = 2 Re

    zHjkS

    linjk

    , j kE (37b)

    where bus i in (37a) denotes the unique parent of bus j. The

    boundary condition is:Slini0 := 0 in (37a) when j = 0, andSlinjk= 0

    in (37a) when j is a leaf node. This is called the simplifiedDistFlow equationsin [46], [70]. It is a good approximation of

    (33) because the loss zjk jkis typically much smaller than the

    branch power flow Sjk.

    The next result provides simple bounds on (S, v) in terms oftheir linear approximations(Slin, vlin). Denote by Tj the subtreerooted at bus j, including j. We write k Tj to mean node

    kofTj and (k, l) Tj to mean edge(k, l) ofTj. Denote byPk the set of links on the unique path from bus 0 to bus k.

    Lemma 12: Fix any v0 and s R2(n+1). Let (S, `, v) and

    (Slin, vlin) be solutions of (33) and (37) respectively with thegiven v0 and s. Then

    (1) For i j E

    Slini j = kTj

    sk

    Si j = kTj

    sk +

    zi j`i j+

    (k,l)Tj

    zkl `kl

    (2) For i j E, Si j Slini j with equality if only if `i j andall `kl in Tj are zero.

    (3) For j N+

    vlinj = v0 (i,k)Pj

    2 Re

    zHikSlinik

    vj = v0 (i,k)Pj

    2 Re

    zHikSik

    |zik|

    2`ik

    (4) For j N+, vj vlinj .

    Lemma 12 says that the power flow Si j on line (i,j) equalsthe total load kTjsk in the subtree rooted at node j

    plus the total line loss in supplying these loads. The linear

    approximation Slini j neglects the line loss and underestimates

    the required power to supply these loads.

    Lemma 12(1)(3) can be easily proved by recursing on

    (33a)(33b) and (37). Since Si j Slini j but|zik|

    2`ik 0, a direct

    proof of Lemma 12(4) is not obvious. Instead, one can make

    use of Lemma 13 below and define a bijection between the

    solutions (S, `, v) of (33) and the solutions (S, ,v) of (36) inwhich v= v. It can be checked that the solutions of (37) andthose of (38) are related by Slin = Slin and vlin = vlin. ThenLemma 13(4) implies Lemma 12(4).

    A linear approximation of (36) is (setting k j= 0):

    Slinji = k:kj

    Slink j + sj , j N+ (38a)

    vlink vlinj = 2 Re

    zHk j

    Slink j

    , k j E (38b)

    Lemma 13: Fix any v0 and s R2(n+1). Let (S, ,v) and

    (Slin,vlin) be solutions of (36) and (38) respectively with thegiven v0 and s. Then

    (1) For all j i E

    Slinji = kTj

    sk

    Sji = kTj

    sk (k,l)Tj

    zklkl

    (2) For all j i E, Sji Slinji with equality if and only ifall `kl in Tj are zero.

    (3) For j N+

    vlinj = v0 + (i,k)Pj

    2 Re

    zHikSlinik

    vj = v0 + (i,k)Pj

    2 Re

    zHik

    Sik |zik|

    2 ik

    (4) For j N+, vj vlinj .

    Lemma 13 says that the branch power Sji (towards bus 0)

    equals the total injection kTjsk in the subtree rooted at bus

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    11/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 11

    j minus the line loss in that subtree. The linear approximation

    Slinji neglects the loss and hence overestimates the branch power

    flow. It can be proved by recursing on (36a)(36b) and (38).

    Remark 9: Bounds for SOCP relaxation.Lemmas 12 and 13

    do not depend on the quadratic equalities (33c) and (36c) as

    long as jk 0. In particular the lemmas hold if the equalities

    have been relaxed to inequalities vj jk |Sjk|2. These bounds

    are used in [56] to prove a sufficient condition for exact SOCPrelaxation for radial networks.

    Remark 10: Linear approximations.For radial networks the

    linear approximations (37) and (38) of BFM have two ad-

    vantages over the (linear) DC approximation of BIM. First

    they have a simple recursive structure that leads to simple

    bounds on power flow quantities. Second DC approximation

    assumes a lossless network (rjk= 0), fixes voltage magnitudes,

    and ignores reactive power, whereas (37) and (38) do not.

    This is important for distribution systems where loss is much

    higher than in transmission systems, voltages can fluctuate

    significantly and reactive powers are used to regulate them.

    On the other hand (37) and (38) are applicable only for radial

    networks whereas DC approximation applies to mesh networksas well. See also [21] for a more accurate linearization of BIM

    that addresses the shortcomings of DC OPF.

    VII. CONCLUSION

    We have presented a bus injection model and a branch flow

    model, formulated several relaxations of OPF, and proved their

    relations. These results suggest a new approach to solving OPF

    summarized in Figure 2. For radial networks we recommend

    !"#$%&'(

    !"# %&*+,&-

    ./'&0/1

    &1'2'*/

    '&-34,&-

    516-7$8

    !"#$'9 !"#$%3(

    5

    WG

    socp

    5: ;/%9

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    12/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, 1(1):1527, MARCH 2014 12

    [28] Subhonmesh Bose, Steven H. Low, Mani Chandy, Thanchanok Teer-aratkul, and Babak Hassibi. Equivalent relaxations of optimal powerflow. Submitted for publication, 2013.

    [29] Z. Luo, W. Ma, A.M.C. So, Y Ye, and S. Zhang. Semidefinite relaxationof quadratic optimization problems. Signal Processing Magazine, IEEE,27(3):20 34, May 2010.

    [30] S. H. Low. Convex relaxation of optimal power flow, II: exactness. IEEETrans. on Control of Network Systems, 2014. to appear.

    [31] P. P. Varaiya, F. F. Wu, and J. W. Bialek. Smart operation of smart grid:

    Risk-limiting dispatch. Proceedings of the IEEE, 99(1):4057, January2011.

    [32] M. Vrakopoulou, M. Katsampani, K. Margellos, J. Lygeros, and G. An-dersson. Probabilistic security-constrained ac optimal power flow. InPowerTech (POWERTECH), 2013 IEEE Grenoble, pages 16, June 2013.

    [33] Russell Bent, Daniel Bienstock, and Michael Chertkov. Synchronization-aware and algorithm-efficient chance constrained optimal power flow. In

    IREP Symposium Bulk Power System Dynamics and Control (IREP),Rethymnon, Greece, August 2013.

    [34] B. H. Kim and R. Baldick. Coarse-grained distributed optimal powerflow. IEEE Trans. Power Syst., 12(2):932939, May 1997.

    [35] R. Baldick, B. H. Kim, C. Chase, and Y. Luo. A fast distributedimplementation of optimal power flow. IEEE Trans. Power Syst.,14(3):858864, August 1999.

    [36] G. Hug-Glanzmann and G. Andersson. Decentralized optimal power flowcontrol for overlapping areas in power systems. IEEE Trans. Power Syst.,24(1):327?336, February 2009.

    [37] F. J. Nogales, F. J. Prieto, and A. J. Conejo. A decomposition method-ology applied to the multi-area optimal power flow problem. Ann. Oper.

    Res., (120):99116, 2003.

    [38] Albert Lam, Baosen Zhang, and David N Tse. Distributed algorithms foroptimal power flow problem. In IEEE CDC, pages 430437, 2012.

    [39] M. Kraning, E. Chu, J. Lavaei, and S. Boyd. Dynamic network energymanagement via proximal message passing. Foundations and Trends inOptimization, 1(2):70122, 2013.

    [40] K. Turitsyn, P. Sulc, S. Backhaus, and M. Chertkov. Options for controlof reactive power by distributed photovoltaic generators. Proc. of the

    IEEE, 99(6):1063 1073, June 2011.

    [41] Albert Y.S. Lam, Baosen Zhang, Alejandro Domnguez-Garca, andDavid Tse. Optimal distributed voltage regulation in power distributionnetworks. arXiv, April 2012.

    [42] Dzung T. Phan. Lagrangian duality and branch-and-bound algorithms foroptimal power flow. Operations Research, 60(2):275285, March/April2012.

    [43] A. Gopalakrishnan, A.U. Raghunathan, D. Nikovski, and L.T. Biegler.Global optimization of optimal power flow using a branch amp; boundalgorithm. In Proc. of the 50th Annual Allerton Conference on Commu-nication, Control, and Computing (Allerton), pages 609616, Oct 2012.

    [44] C. Josz, J. Maeght, P. Panciatici, and J. Ch. Gilbert. Application of themoment-SOS approach to global optimization of the OPF problem. arXiv,November 2013.

    [45] M. E. Baran and F. F Wu. Optimal Capacitor Placement on radialdistribution systems. IEEE Trans. Power Delivery, 4(1):725734, 1989.

    [46] M. E Baran and F. F Wu. Optimal Sizing of Capacitors Placed on ARadial Distribution System. IEEE Trans. Power Delivery, 4(1):735743,1989.

    [47] H-D. Chiang and M. E. Baran. On the existence and uniqueness of loadflow solution for radial distribution power networks. IEEE Trans. Circuitsand Systems, 37(3):410416, March 1990.

    [48] Hsiao-Dong Chiang. A decoupled load flow method for distribution powernetworks: algorithms, analysis and convergence study. International

    Journal Electrical Power Energy Systems, 13(3):130138, June 1991.

    [49] R. Cespedes. New method for the analysis of distribution networks. IEEETrans. Power Del., 5(1):391396, January 1990.

    [50] A. G. Exposito and E. R. Ramos. Reliable load flow technique forradial distribution networks. IEEE Trans. Power Syst., 14(13):10631069,August 1999.

    [51] Masoud Farivar, Christopher R. Clarke, Steven H. Low, and K. ManiChandy. Inverter VAR control for distribution systems with renewables.In Proceedings of IEEE SmartGridComm Conference, October 2011.

    [52] Joshua A. Taylor and Franz S. Hover. Convex models of distributionsystem reconfiguration. IEEE Trans. Power Systems, 2012.

    [53] J. Lavaei, D. Tse, and B. Zhang. Geometry of power flows andoptimization in distribution networks. Power Systems, IEEE Transactionson, PP(99):112, 2013.

    [54] W. H. Kersting.Distribution systems modeling and analysis. CRC, 2002.

    [55] D. Shirmohammadi, H. W. Hong, A. Semlyen, and G. X. Luo. Acompensation-based power flow method for weakly meshed distributionand transmission networks. IEEE Transactions on Power Systems,3(2):753762, May 1988.

    [56] Lingwen Gan, Na Li, Ufuk Topcu, and Steven H. Low. Exact convexrelaxation of optimal power flow in radial networks. IEEE Trans.

    Automatic Control, 2014.[57] Masoud Farivar and Steven H. Low. Branch flow model: relaxations and

    convexification (parts I, II). IEEE Trans. on Power Systems, 28(3):25542572, August 2013.

    [58] Subhonmesh Bose, Steven H. Low, and Mani Chandy. Equivalence ofbranch flow and bus injection models. In 50th Annual Allerton Conferenceon Communication, Control, and Computing, October 2012.

    [59] Deqiang Gan, Robert J. Thomas, and Ray D. Zimmerman. Stability-constrained optimal power flow. IEEE Trans. Power Systems, 15(2):535540, May 2000.

    [60] F. Capitanescua, J.L. Martinez Ramosb, P. Panciaticic, D. Kirschend,A. Marano Marcolinie, L. Platbroodf, and L. Wehenkelg. State-of-the-art,challenges, and future trends in security constrained optimal power flow.

    Electric Power Systems Research, 81(8):17311741, August 2011.[61] Brian Stott and Ongun Alsac. Optimal power flow: Basic requirements

    for real-life problems and their solutions. White paper, July 2012.[62] S. Bose, D. Gayme, K. M. Chandy, and S. H. Low. Quadratically

    constrained quadratic programs on acyclic graphs with application topower flow. arXiv:1203.5599v1, March 2012.

    [63] R. Grone, C. R. Johnson, E. M. Sa, and H. Wolkowicz. Positivedefinite completions of partial Hermitian matrices. Linear Algebra andits Applications, 58:109124, 1984.

    [64] Mituhiro Fukuda, Masakazu Kojima, Kazuo Murota, and KazuhideNakata. Exploiting sparsity in semidefinite programming via matrixcompletion I: General framework. SIAM Journal on Optimization,11:647674, 2001.

    [65] K. Nakata, K. Fujisawa, M. Fukuda, M. Kojima, and K. Murota.Exploiting sparsity in semidefinite programming via matrix completionII: Implementation and numerical results. Mathematical Programming,95(2):303327, 2003.

    [66] Baosen Zhang and David Tse. Geometry of the injection region of powernetworks. IEEE Trans. Power Systems, 28(2):788797, 2013.

    [67] Martin S. Andersen, Anders Hansson, and Lieven Vandenberghe.Reduced-complexity semidefinite relaxations of optimal power flow prob-lems. arXiv:1308.6718v1, August 2013.

    [68] R.D. Zimmerman and Hsiao-Dong Chiang. Fast decoupled powerflow for unbalanced radial distribution systems. Power Systems, IEEETransactions on, 10(4):20452052, 1995.

    [69] M.S. Srinivas. Distribution load flows: a brief review. In PowerEngineering Society Winter Meeting, 2000. IEEE, volume 2, pages 942945 vol.2, 2000.

    [70] M.E. Baran and F.F. Wu. Network reconfiguration in distribution systemsfor loss reduction and load balancing. IEEE Trans. on Power Delivery,4(2):14011407, Apr 1989.

    Steven H. Low (F08) is a Professor of the Comput-ing+Mathematical Sciences and Electrical Engineer-ing at Caltech, and a Changjiang Chair Professor ofZhejiang University. Before that, he was with AT&TBell Labs, Murray Hill, NJ, and the University of

    Melbourne, Australia. He was a co-recipient of IEEEbest paper awards, the R&D 100 Award, and anOkawa Foundation Research Grant. He is on the Tech-nical Advisory Board of Southern California Edisonand was a member of the Networking and InformationTechnology Technical Advisory Group for the US

    Presidents Council of Advisors on Science and Technology (PCAST). He isa Senior Editor of the IEEE Journal on Selected Areas in Communications,the IEEE Trans. Control of Network Systems, and the IEEE Trans. NetworkScience & Engineering, and is on the editorial board of NOW Foundations andTrends in Networking, and in Electric Energy Systems. He is an IEEE Fellowand received his BS from Cornell and PhD from Berkeley both in EE. Hisresearch interests are in power systems and communication networks.

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    13/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, JUNE 2014 1

    Convex Relaxation of Optimal Power Flow

    Part II: Exactness

    Steven H. Low EAS, Caltech [email protected]

    AbstractThis tutorial summarizes recent advances in theconvex relaxation of the optimal power flow (OPF) problem,focusing on structural properties rather than algorithms. PartI presents two power flow models, formulates OPF and theirrelaxations in each model, and proves equivalence relationsamong them. Part II presents sufficient conditions under whichthe convex relaxations are exact.

    I. INTRODUCTION

    The optimal power flow (OPF) problem is fundamentalin power systems as it underlies many applications such

    as economic dispatch, unit commitment, state estimation,

    stability and reliability assessment, volt/var control, demand

    response, etc. OPF seeks to optimize a certain objective

    function, such as power loss, generation cost and/or user

    utilities, subject to Kirchhoffs laws as well as capacity,

    stability and security constraints on the voltages and power

    flows. There has been a great deal of research on OPF since

    Carpentiers first formulation in 1962 [1]. Recent surveys can

    be found in, e.g., [2], [3], [4], [5], [6], [7], [8], [9], [10], [11],

    [12], [13].

    OPF is generally nonconvex and NP-hard, and a large

    number of optimization algorithms and relaxations have been

    proposed. To the best of our knowledge solving OPF through

    semidefinite relaxation is first proposed in [14] as a second-

    order cone program (SOCP) for radial (tree) networks and in

    [15] as a semidefinite program (SDP) for general networks

    in a bus injection model. It is first proposed in [16], [17]

    as an SOCP for radial networks in the branch flow model

    of [18], [19]. While these convex relaxations have been

    illustrated numerically in [14] and [15], whether or when they

    will turn out to be exact is first studied in [20]. Exploiting

    graph sparsity to simplify the SDP relaxation of OPF is first

    proposed in [21], [22] and analyzed in [23], [24].

    Solving OPF through convex relaxation offers severaladvantages, as discussed in Part I of this tutorial [25, Section

    I]. In particular it provides the ability to check if a solution

    is globally optimal. If it is not, the solution provides a

    lower bound on the minimum cost and hence a bound on

    Acknowledgment: We thank the support of NSF through NetSECNS 0911041, ARPA-E through GENI DE-AR0000226, Southern CaliforniaEdison, the National Science Council of Taiwan through NSC 103-3113-P-008-001, the Los Alamos National Lab (DoE), and Caltechs ResnickInstitute. A preliminary and abridged version has appeared in Proceedingsof the IREP Symposium - Bulk Power System Dynamics and Control - IX,Rethymnon, Greece, August 25-30, 2013.

    how far any feasible solution is from optimality. Unlike

    approximations, if a relaxed problem is infeasible, it is a

    certificate that the original OPF is infeasible.

    This tutorial presents main results on convex relaxations

    of OPF developed in the last few years. In Part I [25], we

    present the bus injection model (BIM) and the branch flow

    model (BFM), formulate OPF within each model, and prove

    their equivalence. The complexity of OPF formulated here

    lies in the quadratic nature of power flows, i.e., the nonconvex

    quadratic constraints on the feasible set of OPF. We character-

    ize these feasible sets and design convex supersets that lead

    to three different convex relaxations based on semidefinite

    programming (SDP), chordal extension, and second-order

    cone programming (SOCP). When a convex relaxation is

    exact, an optimal solution of the original nonconvex OPF can

    be recovered from every optimal solution of the relaxation.

    In Part II we summarize main sufficient conditions that

    guarantee the exactness of these relaxations.

    Network topology turns out to play a critical role in

    determining whether a relaxation is exact. In Section II we

    review the definitions of OPF and their convex relaxations

    developed in [25]. We also define the notion of exactnessadopted in this paper. In Section III we present three types

    of sufficient conditions for these relaxations to be exact for

    radial networks. These conditions are generally not necessary

    and they have implications on allowable power injections,

    voltage magnitudes, or voltage angles:

    A Power injections: These conditions require that not

    both constraints on real and reactive power injections

    be binding at both ends of a line.

    B Voltages magnitudes:These conditions require that the

    upper bounds on voltage magnitudes not be binding.

    They can be enforced through affine constraints on

    power injections.

    C Voltage angles: These conditions require that the volt-

    age angles across each line be sufficiently close. This

    is needed also for stability reasons.

    These conditions and their references are summarized in

    Tables I and II. Some of these sufficient conditions are proved

    using BIM and others using BFM. Since these two models are

    equivalent (in the sense that there is a linear bijection between

    their solution sets [24], [25]), these sufficient conditions

    apply to both models. The proofs of these conditions typically

    do not require that the cost function be convex (they focus

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    14/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, JUNE 2014 2

    type condition model reference remark

    A power injections BIM, BFM [26], [27], [28], [29], [30]

    [31], [16], [17]

    B voltage magnitudes BFM [32], [33], [34], [35] allows general injection region

    C voltage angles BIM [36], [37] makes use of branch power flows

    TABLE I: Sufficient conditions for radial (tree) networks.

    network condition reference remark

    with phase shifters type A, B, C [17, Part II], [38] equivalent to radial networks

    direct current type A [17, Part I], [20], [39] assumes nonnegative voltages

    type B [40], [41] assumes nonnegative voltages

    TABLE II: Sufficient conditions for mesh networks

    on the feasible sets and usually only need the cost function

    to be monotonic). Convexity is required however for efficient

    computation. Moreover it is proved in [35] using BFM that

    when the cost function is convex then exactness of the SOCP

    relaxation implies uniqueness of the optimal solution for

    radial networks. Hence the equivalence of BIM and BFM

    implies that any of the three types of sufficient conditions

    guarantees that, for a radial network with a convex cost

    function, there is a unique optimal solution and it can be

    computed by solving an SOCP. Since the SDP and chordal

    relaxations are equivalent to the SOCP relaxation for radial

    networks [24], [25], these results apply to all three types

    of relaxations. Empirical evidences suggest some of these

    conditions are likely satisfied in practice. This is important

    as most power distribution systems are radial.

    These conditions are insufficient for general mesh net-

    works because they cannot guarantee that an optimal solution

    of a relaxation satisfies the cycle condition discussed in [25].

    In Section IV we show that these conditions are however

    sufficient for mesh networks that have tunable phase shifters

    at strategic locations. The phase shifters effectively make a

    mesh network behave like a radial network as far as convex

    relaxation is concerned. The result can help determine if a

    network with a given set of phase shifters can be convexi-

    fied and, if not, where additional phase shifters are needed

    for convexification. These conditions are also sufficient for

    direct current (dc) mesh networks where all variables arein the real rather than complex domain. Counterexamples

    are known where SDP relaxation is not exact, especially for

    AC mesh networks without tunable phase shifters [42], [43].

    We discuss three recent approaches for global optimization

    of OPF when the semidefinite relaxations discussed in this

    tutorial fail.

    We conclude in Section V. All proofs are omitted and can

    be found in the original papers or the arXiv version of this

    tutorial.

    II. OPF AND ITS RELAXATIONS

    We use the notations and definitions from Part I of thispaper. In this section we summarize the OPF problems and

    their relaxations developed there; see [25] for details.

    We adopt in this paper a strong sense of exactness where

    we require the optimal solution set of the OPF problem

    and that of its relaxation be equivalent. This implies that

    an optimal solution of the nonconvex OPF problem can

    be recovered from every optimal solution of its relaxation.

    This is important because it ensures any algorithm that

    solves an exact relaxation always produces a globally optimal

    solution to the OPF problem. Indeed interior point methods

    for solving SDPs tend to produce a solution matrix with a

    maximum rank [44], so can miss a rank-1 solution if the

    relaxation has non-rank-1 solutions as well. It can be difficultto recover an optimal solution of OPF from such a non-

    rank-1 solution, and our definition of exactness avoids this

    complication. See Section II-C for detailed justifications.

    A. Bus injection model

    The BIM adopts an undirected graph G 1 and can be

    formulated in terms of just the complex voltage vector

    V Cn+1. The feasible set is described by the following

    constraints:

    sj k:(j,k)E

    yHjkVj(VHj V

    Hk ) sj, j N

    + (1a)

    vj |Vj|2

    vj,

    j N+

    (1b)

    where sj, sj,vj,vj, possibly i, are given bounds onpower injections and voltage magnitudes. Note that the vector

    VincludesV0 which is assumed given (v0= v0 and V0= 0)

    unless otherwise specified. The problem of interest is:

    OPF:

    minVCn+1

    C(V) subject to V satisfies (1) (2)

    1We will use bus and node interchangeably and line and linkinterchangeably.

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    15/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, JUNE 2014 3

    For relaxations consider the partial matrix WG defined on

    the network graph G that satisfies

    sj k:(j,k)E

    yHjk

    [WG]j j [WG]jk sj, j N

    + (3a)

    vj [WG]j j vj,

    j N

    +

    (3b)We say that WG satisfies the cycle condition if for every

    cycle c in G

    (j,k)c

    [WG]jk = 0 mod 2 (4)

    We assume the cost functionCdepends onVonly through

    VVH and use the same symbol Cto denote the cost in terms

    of a full or partial matrix. Moreover we assume Cdepends

    on the matrix only through the submatrix WG defined on

    the network graph G. See [25, Section IV] for more details

    including the definitions of Wc(G) 0 and WG(j, k) 0.Define the convex relaxations:

    OPF-sdp:

    minWSn+1

    C(WG) subject to WG satisfies (3),W 0 (5)

    OPF-ch:

    minWc(G)

    C(WG) subject to WG satisfies (3),Wc(G) 0 (6)

    OPF-socp:

    minWG

    C(WG) subject to WG satisfies (3),

    WG(j, k) 0, (j, k) E (7)

    For BIM, we say that OPF-sdp (5) is exactif every optimal

    solutionWsdp of OPF-sdp is psd rank-1; OPF-ch (6) is exact

    if every optimal solutionWchc(G) of OPF-ch is psd rank-1 (i.e.,

    the principal submatrices Wchc(G)(q) of W

    chc(G) are psd rank-1

    for all maximal cliques q of the chordal extension c(G) ofgraph G); OPF-socp (7) is exact if every optimal solution

    Wsocp

    G of OPF-socp is 22 psd rank-1 and satisfies the cycle

    condition (4). To recover an optimal solution Vopt of OPF

    (2) from Wsdp orWchc(G) or W

    socpG , see [25, Section IV-D].

    B. Branch flow model

    The BFM adopts a directed graph G and is defined by the

    following set of equations:

    k:jk

    Sjk = i:ij

    Si jzi j|Ii j|

    2

    + sj, j N+ (8a)

    Ijk = yjk(VjVk), j k E (8b)

    Sjk = VjIHjk, j k

    E (8c)

    Denote the variables in BFM (8) by x := (S,I,V, s) C2(m+n+1). Note that the vectors V and s include V0 (given)

    and s0 respectively. Recall from [25] the variables x :=(S,`, v,s) R3(m+n+1) that is related to x by the mappingx= h( x) with jk:=|Ijk|

    2 and vj :=|Vj|2. The operational

    constraints are:

    vj vj vj, j N+ (9a)

    sj sj sj, j N+ (9b)

    We assume the cost function depends on x only through

    x=h( x). Then the problem in BFM is:OPF:

    minx

    C(x) subject to x satisfies (8), (9) (10)

    For SOCP relaxation consider:

    k:jk

    Sjk = i:ij

    (Si j zi j`i j) + sj, j N+ (11a)

    vj vk = 2 RezHjkSjk

    |zjk|

    2jk, j k E (11b)

    vj jk |Sjk|2, j k E (11c)

    We say that x satisfies the cycle condition if

    Rn such that B= (x) mod 2 (12)

    where B is the m n reduced incidence matrix and, given

    x:= (S,`, v, s), jk(x):=(vjzHjkSjk) can be interpreted as

    the voltage angle difference across line j k implied by x

    (See [25, Section V]). The SOCP relaxation in BFM is

    OPF-socp:

    minx

    C(x) subject to x satisfies (11), (9) (13)

    For BFM, OPF-socp (13) in BFM is exactif every optimal

    solution xsocp attains equality in (11c) and satisfies the cycle

    condition (12). See [25, Section V-A] for how to recover an

    optimal solution xopt of OPF (10) from any optimal solution

    xsocp of its SOCP relaxation.

    C. Exactness

    The definition of exactness adopted in this paper is more

    stringent than needed. Consider SOCP relaxation in BIM

    as an illustration (the same applies to the other relaxations

    in BIM and BFM). For any sets A and B, we say that

    A is equivalent to B, denoted by A B, if there is a

    bijection between these two sets. Let M(A) denote the setof minimizers when a certain function is minimized over A.

    Let V and W+G denote the feasible sets of OPF (2) and

    OPF-socp (7) respectively:

    V := {V Cn+1 | V satisfies (1)}

    W+G := {WG | WG satisfies (3),WG(j, k) 0, (j, k) E}

    Consider the following subset ofW+G :

    WG := {WG | WG satisfies (3), (4),WG(j,k) 0,

    rankWG(j, k) =1,(j,k) E}

    Our definition of exact SOCP relaxation is that M(W+G ) WG. In particular, all optimal solutions of OPF-socp must

    be 22 psd rank-1 and satisfy the cycle condition (4). Since

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    16/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, JUNE 2014 4

    WG V(see [25]), exactness requires that the set of optimal

    solutions of OPF-socp (7) be equivalent to that of OPF (2),

    i.e., M(W+G ) = M(WG)M(V).IfM(W+G ) )M(WG) M(V) then OPF-socp (7) is not

    exact according to our definition. Even in this case, however,

    every sufficient condition in this paper guarantees that anoptimal solution of OPF can be easily recovered from an

    optimal solution of the relaxation that is outside WG. The dif-

    ference between M(W+G ) =M(WG) and M(W+G ))M(WG)

    is often minor, depending on the objective function; see

    Remarks 1 and 2 and comments after Theorems 5 and 8

    in Section III. Hence we adopt the more stringent definition

    of exactness for simplicity.

    III. RADIAL NETWORKS

    In this section we summarize the three types of sufficient

    conditions listed in Table I for semidefinite relaxations of

    OPF to be exact for radial (tree) networks. These results are

    important as most distribution systems are radial.

    For radial networks, if SOCP relaxation is exact then SDP

    and chordal relaxations are also exact (see [25, Theorems

    5, 9]). We hence focus in this section on the exactness of

    OPF-socp in both BIM and BFM. Since the cycle conditions

    (4) and (12) are vacuous for radial networks, OPF-socp (7)

    is exact if all of its optimal solutions are 2 2 rank-1 and

    OPF-socp (13) is exact if all of its optimal solutions attain

    equalities in (11c). We will freely use either BIM or BFM

    in discussing these results. To avoid triviality we make the

    following assumption throughout the paper:

    The voltage lower bounds satisfy vj > 0, j N+. The

    original problems OPF (2) and (10) are feasible.

    A. Linear separability

    We will first present a general result on the exactness of

    the SOCP relaxation of general QCQP and then apply it to

    OPF. This result is first formulated and proved using a duality

    argument in [27], generalizing the result of [26]. It is proved

    using a simpler argument in [31].

    Fix an undirected graph G = (N+,E) where N+ :={0, 1, . . . , n} and EN+N+. Fix Hermitian matrices Cl Sn+1, l=0, . . . ,L, defined on G, i.e., [Cl ]jk=0 if(j, k) 6E.Consider QCQP:

    minxCn+1

    xHC0x (14a)

    subject to xHClx bl , l=1, . . . ,L (14b)

    where C0,Cl C(n+1)(n+1), bl R, l = 1, . . . ,L, and its

    SOCP relaxation where the optimization variable ranges over

    Hermitian partial matrices WG:

    minWG

    trC0WG (15a)

    subject to trClWG bl , l=1, . . . ,L (15b)

    WG(j, k) 0, (j, k) E (15c)

    The following result is proved in [27], [31]. It can be

    regarded as an extension of [45] on the SOCP relaxation

    of QCQP from the real domain to the complex domain.

    Consider: 2

    A1: The cost matrixC0 is positive definite.

    A2: For each link (j, k) E there exists an jk such that [Cl ]jk [i j,i j+] for all l=0, . . . ,L.

    LetCopt and Csocp denote the optimal values of QCQP (14)

    and SOCP (15) respectively.

    Theorem 1: Suppose G is a tree and A2 holds. Then

    Copt = Csocp and an optimal solution of QCQP (14) can berecovered from every optimal solution of SOCP (15).

    Remark 1: The proof of Theorem 1 prescribes a simple

    procedure to recover an optimal solution of QCQP (14)

    from any optimal solution of its SOCP relaxation (15). The

    construction does not need the optimal solution of SOCP

    (15) to be 22 rank-1. Hence the SOCP relaxation may not

    be exact according to our definition of exactness, i.e., someoptimal solutions of (15) may be 2 2 psd but not 2 2

    rank-1. If the objective function is strictly convex however

    then the optimal solution sets of QCQP (14) and SOCP (15)

    are indeed equivalent.

    Corollary 2: Suppose G is a tree and A1A2 hold. Then

    SOCP (15) is exact.

    We now apply Theorem 1 to our OPF problem. Recall that

    OPF (2) in BIM can be written as a standard form QCQP

    [27]:

    minxCn

    VHC0V

    s.t. VHjV pj, VH(j)V pj (16a)

    VHjV qj, VH(j)V qj (16b)

    VHJjV vj, VH(Jj)V vj

    for some Hermitian matrices C0,j ,j,Jj where j N+.

    A2 depends only on the off-diagonal entries ofC0, j, j(Jj are diagonal matrices). It implies a simple pattern on the

    power injection constraints (16a)(16b). Let yjk= gjk ibjkwith gjk> 0, bjk> 0. Then we have (from [27]):

    [k]i j =

    12

    Yi j = 12

    (gi j ibi j) if k=i12

    YHi j = 12

    (gi j+ ibi j) if k= j

    0 if k6 {i,j}

    [k]i j =

    12i

    Yi j = 12

    (bi j+ igi j) ifk=i12i

    YHi j = 12

    (bi j igi j) ifk= j

    0 ifk6 {i,j}

    Hence for each line (j, k) Ethe relevant angles for A2 are

    2All angles should be interpreted as mod 2, i.e., projected onto(,].

  • 7/24/2019 Low 201405 OPFTutorial2parts TCNS

    17/25

    IEEE TRANS. ON CONTROL OF NETWORK SYSTEMS, JUNE 2014 5

    those of[C0]jk and

    [j]jk = 1

    2(gjk ibjk)

    [k]jk = 1

    2 (gjk+ ibjk)

    [j]jk = 1

    2(bjk+ igjk)

    [k]jk = 1

    2(bjk igjk)

    as well as