linear and quadratic reformulations of nonlinear ......linear and quadratic reformulations of...

Linear and quadratic reformulations of nonlinearoptimization problems in binary variables

Elisabeth Rodrıguez-HeckRWTH Aachen University, Chair of Operations Research

Results of PhD thesisadvised by Yves Crama, University of Liege

G-Scop, Grenoble, February 7, 2019

Introduction

Pseudo-Boolean optimization

General problem: pseudo-Boolean optimizationGiven a pseudo-Boolean function f : 0, 1n → R

minx∈0,1n

f (x).

Theorem (Hammer, Rosenberg, & Rudeanu, 1963)Every pseudo-Boolean function f : 0, 1n → R admits a uniquemultilinear expression.

I Given f , finding its unique multilinear representation can becostly! (Size of the input: O(2n))

minx∈0,1n

f (x).

minx∈0,1n

f (x).

Multilinear 0–1 optimization

Assumption: f given as a multilinear polynomialSet of monomials S ⊆ 2[n], aS 6= 0 for S ∈ S.

min∑S∈S

aS∏i∈S

s. t. xi ∈ 0, 1, for i = 1, . . . , n

Example:

f (x1, x2, x3) = 9x1x2x3 + 8x1x2 − 6x2x3 − 2x1

Application in computer vision: image restoration

Input: blurred image Output: restored image

Image from the Corel database.

Applications

I Location problemsI Joint supply chain design and inventory managementI Statistical mechanicsI Quantum computingI ...

Linear and quadratic reformulations

Multilinear 0-1 optimizationminxi

∑S∈S aS

∏i∈S xi

Linear problemminxi ,yj l(x , y)

additional y variablesconstraints

Quadratic problemminxi ,yj q(x , y)

additional y variableswithout constraints (choice)

reformulate as reformulate as

Overview

Introduction:Multilinear 0-1 optimization

Part I: Linear re-formulations

Part II: Quadraticreformulations

Part III: Computa-tional experiments

Conclusions andPerspectives

Overview

Introduction to linearizations

The standard linearization (SL)Nonlinear problem

min∑S∈S

aS∏i∈S

xi +n∑

i=1ci xi

Linearized problem

min∑S∈S

aSyS +n∑

i=1ci xi

Standard Linearization (Fortet, 1959; Glover & Woolsey, 1973)

yS =∏i∈S

When xi ∈ 0, 1 it is equivalent to say:

yS ≤ xi ∀i ∈ S,∀S ∈ S (1)yS ≥

∑i∈S

xi − (|S| − 1) ∀S ∈ S (2)

The standard linearization (SL)Nonlinear problem

min∑S∈S

aS∏i∈S

xi +n∑

i=1ci xi

Linearized problem

min∑S∈S

aSyS +n∑

i=1ci xi

Standard Linearization (Fortet, 1959; Glover & Woolsey, 1973)

yS =∏i∈S

When xi ∈ 0, 1 it is equivalent to say:

yS ≤ xi ∀i ∈ S,∀S ∈ S (1)yS ≥

∑i∈S

xi − (|S| − 1) ∀S ∈ S (2)

SL drawback: The continuous relaxation given by theSL is very weak!

Image from A. Schrijver’s book cover:Theory of linear and integer programming.p.8

When do the SL inequalities provide a perfectformulation?

Hypergraph associated with a polynomialA multilinear polynomial can be associated to a hypergraphH = (V = 1, . . . , n,S).

min∑S∈S

aS∏i∈S

xi +∑i∈V

Example:

f (x1, x2, x3) = 9x1x2x3 − 8x1x2 + 6x2x3 − 2x1

• •

Characterization for the unsigned case

Theorem 1: (Buchheim, Crama, & Rodrıguez-Heck, 2019)Given a hypergraph H, the following statements are equivalent:(a) The SL inequalities define an integer polytope.(b) The matrix of coefficients of the SL inequalities is balanced.(c) The hypergraph H is Berge-acyclic.

I Obtained independently and simultaneously by (Del Pia &Khajavirad, 2018).

I Theorem 1 is a Corollary of a more general result consideringthe signs of the coefficients.

I The balancedness condition can be checked in polynomialtime.

A class of valid inequalities for multilinear 0–1optimization problems

The 2-link inequalities

Definition (Crama & Rodrıguez-Heck, 2017)For S,T ∈ S and yS , yT such that yS =

∏i∈S xi , yT =

∏i∈T xi ,

the 2-link associated with (S,T ) is the linear inequality

yS ≤ yT −∑

i∈T\S xi + |T\S|

Interpretation

S TyS = 1⇒ ∀i ∈ S, xi = 1

yT = 0 and yS = 1⇒ ∃j ∈ T\S, xj = 0

A complete description for the case of two monomials

Theorem 2: (Crama & Rodrıguez-Heck, 2017)For the case of two nonlinear monomials, P∗SL = P2links

SL , i.e., thestandard linearization and the 2-links provide a completedescription of P∗SL.

A complete description for the case of two monomials

Theorem 2: (Crama & Rodrıguez-Heck, 2017)For the case of two nonlinear monomials, P∗SL = P2links

SL , i.e., thestandard linearization and the 2-links provide a completedescription of P∗SL.

Proof idea: yS∩T =∏

i∈S∩T xi , yS = yS∩T∏

i∈S\T xi , yT = yS∩T∏

i∈T\S xi

I Consider the extended formulation (with variables in [0, 1])

yS∩T ≤ xi , ∀i ∈ S ∩ T , (3)

yS∩T ≥∑

i∈S∩T

xi − (|S ∩ T | − 1), (4)

yS ≤ yS∩T , (5)yS ≤ xi , ∀i ∈ S\T , (6)

yS ≥∑

i∈S\T

xi + yS∩T − |S\T |, (7)

yT ≤ yS∩T , (8)yT ≤ xi , ∀i ∈ T\S, (9)

yT ≥∑

i∈T\S

xi + yS∩T − |T\S|, (10)

I Notice that the two polytopes P0 and P1 obtained by fixing variableyS∩T to 0 and 1, resp., are integral.

I Compute conv(P0 ∪ P1) using (Balas, 1974) and see that it isP2links

SL .p.15

Overview

Introduction to quadratizations

Quadratization: definition and desirable properties

Definition (Anthony, Boros, Crama, & Gruber, 2017)Given a pseudo-Boolean function f (x) where x ∈ 0, 1n, aquadratization g(x , y) is a function satisfyingI g is quadraticI g(x , y) depends on the original variables x and on m auxiliary

variables yI satisfies