+ All Categories
Home > Documents > Thiago Serra Egon Balas & Francois¸ Margot · Thiago Serra [email protected]@thserraAdvisors: Egon...

Thiago Serra Egon Balas & Francois¸ Margot · Thiago Serra [email protected]@thserraAdvisors: Egon...

Date post: 04-Jul-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
1
Reformulating the disjunctive cut generating linear program Thiago Serra [email protected] @thserra Advisors: Egon Balas & Fran¸ cois Margot 1.| Introduction Lift-and-project resembles a Swiss army knife to define and generate valid cuts. But why using just one blade to cut in a single way? Perhaps we could avoid getting increasingly more parallel and shallow cuts by diversifying in the CGLP. 2.| Background Consider a binary linear program min{c T x : Ax b , x ∈{0, 1} n } with linear relaxation P := {x : Ax b , 0 x 1} := {x : ˜ Ax ˜ b } having a fractional basic solution ¯ x = arg min{c T x : x P }, 0 < ¯ x j < 1. Feasible solutions are covered by D j :={x : ˜ Ax ˜ b , - x j 0} ∪{x : ˜ Ax ˜ b , x j 1}, which convexifies as P j := conv (D j ). From Farkas lemma, any valid α T x β for P j is found in the (α, β )-projection of α -u T ˜ A +u 0 e j = 0 α -v T ˜ A -v 0 e j = 0 β -u T ˜ b 0 β -v T ˜ b -v 0 0 u , v , u 0 , v 0 0 (C) j For P j , Balas et al. [1993] show that all non-dominated inequalities are found in α -u T ˜ A +u 0 e j = 0 α -v T ˜ A -v 0 e j = 0 β -u T ˜ b = 0 β -v T ˜ b -v 0 = 0 u , v , u 0 , v 0 0 ( C) j and it has as dual the lift-and-project description of x P j as x = x 0 + x 1 for ˜ Ax 0 - ˜ bx 0 0 0 -x 0 j 0 ˜ Ax 1 - ˜ bx 1 0 0 x 1 j -x 1 0 0 x 0 0 + x 1 0 = 1 ( D) j with the nonnegativity of x 0 0 , x 1 0 0 relaxed [Ceria and Soares, 1997]. We look for α T x β separating ¯ x with a Cut Generating Linear Program (CGLP) minimizing α T x - β and bounding ( C) j : min α T ¯ x - β s.t. ( C) j u T e + v T e + u 0 + v 0 = 1 (CGLP) j However, the cut may be strictly dominated [Fischetti et al., 2011]: For a split on j = 1, (CGLP) j yields: (c1) if k 8; (c2) is k = 8; (c3) if k 8 3.| Reverse polar CGLP For some p P j , we propose the following formulation: min α T p - β s.t. ( C) j β - α T ¯ x = 1 (RP-CGLP) p j Using (C) j instead, (RP-CGLP) p j is an extended formulation of the reverse polar ( P j - ¯ x ) - := y : y T (x - ¯ x ) 1 x P j . Following the reverse polar notation, we denote cuts in the form ¯ y T (x - ¯ x ) 1. Cuts from (RP-CGLP) p j define supporting hyperplanes of P j x p Among parallel inequalities, getting closer to p reduces ky k since distance to ¯ x × cut norm = 1, thus reducing y T (p - ¯ x ). No cutting plane is strictly dominated How facets of P j are combined in cutting planes from (RP-CGLP) p j ? Let the facet-defining inequalities of P j define F = (γ i ) T (x - ¯ x ) δ i i F with F finite for A, b rational. W.l.o.g., let us partition F = F + F 0 F - , where: δ i = 1, i F + 0, i F 0 -1, i F - A cut ¯ y T (x - ¯ x ) 1 from (RP-CGLP) p j has a nonnegative combination { ¯ λ i } i F : X i F ¯ λ i γ i =y X i F + ¯ λ i - X i F - ¯ λ i =1 For a cut ¯ y T (x - ¯ x ) 1 from (RP-CGLP) p j , either ¯ λ i = 0 or (γ i ) T (p - ¯ x ) - δ i = 0 i F 0 F - If neither holds, we could find a face-defining cut with strictly better objective value by reducing such ¯ λ i and accordingly other multipliers ¯ λ j , j F + . If p int (P j ), a cut from (RP-CGLP) p j is a combination of facet-defining inequalities separating ¯ x If p int (P j ), then ¯ λ i = 0 i F 0 F - . For a cut ¯ y T (x - ¯ x ) 1 from (RP-CGLP) p j , it holds that ¯ y T (p - ¯ x )=(γ i ) T (p - ¯ x ) i F + : ¯ λ i > 0 If facet i has larger slack than facet j in a proper combination, there is a face-defining cut with strictly better value where ¯ λ j ¯ λ j + ¯ λ i and ¯ λ i = 0. A cut ¯ y T (x - ¯ x ) 1 from (RP-CGLP) p j is a combination of inequalities in F that first intersect the ray from ¯ x to p The objective becomes min α T (p - ¯ x ) - 1 by replacing β . Points defining the same ray with ¯ x yield the same cuts. For those closest to ¯ x , (γ i ) T (p - ¯ x ) - δ i = 0 i F + . 4.| Equivalent formulations A similar reformulation is proposed by Balas and Perregaard [2002]: min α T ¯ x - β s.t. ( C) j α T (p - ¯ x )= 1 (BP-CGLP) p j They prove and for (BP-CGLP) p j Inspired by Cadoux and Lemar´ echal [2013], we denote as the polar CGLP the following bounded CGLP formulation: min α T ¯ x - β s.t. ( C) j α T p - β = 1 (P-CGLP) p j Property follows similarly. With objective restated as min α T ( ¯ x - p ) - 1, getting closer to p increases kαk. Buchheim et al. [2008] enumerates extreme points to define a Generalized Intersection Cut (GIC) equivalent [Balas and Kis, 2016] of formulation (P-CGLP) p j , but replacing P j with P ∩{0, 1} n . (RP-CGLP) p j , (BP-CGLP) p j , and (P-CGLP) p j yield the same cuts Let us denote cuts separating ¯ x as μ T x ν , kν k = 1. For each CGLP, there is a scale factor θ> 0 :(α, β )= θ (μ, ν ). (RP-CGLP) p j : θ = 1 ν -μ T ¯ x min α T p - β = min θ (μ T p - ν )= min μ T p -ν ν -μ T ¯ x (P-CGLP) p j : θ = 1 μ T p -ν min α T ¯ x - β = min θ (μ T ¯ x - ν )= max ν -μ T ¯ x μ T p -ν (BP-CGLP) p j : θ = 1 (μ) T (p - ¯ x ) min α T ¯ x - β = ... = min μ T p -ν ν -μ T ¯ x + 1 CGLP dual yields first intersection The lift-and-project primal of (BP-CGLP) p j yields x 0 + x 1 as the intersection from : min ω s.t. ( D) j x 0 + x 1 + ω ( ¯ x - p )= ¯ x (BP-L&P) p j 5.| Finding p Any proper convex combination of points in the relative interior of each term of D j is in the relative interior of P j Any point in the relative interior of P j has a convex combination of points in the relative interior of each term of D j For any cut, all we need is a point in the relative interior of each term of D j Using , we can get a cut satisfying property if P j is full-dimensional. On each term of the disjunction, we identify inequalities always satisfied at equality and get a point maximizing the minimum slack on the remaining ones. Let p be a combination in proportion to the minimum slacks of those points. 6.| Results Similar performance in the first round and better performance in the second Gap closed (%) by adding split cuts on fractional variables with standard (S) and RP CGLP (R) and resolving on MIPLIB: 1 round 2 rounds S R S+S S+R R+S R+R bm23 5.6 5.7 12.2 13.4 11.4 13.6 lseu 4.2 4.2 14.8 31.2 17.0 31.2 mod008 0.1 0.1 1.6 1.9 0.5 1.9 p0033 2.6 2.2 5.6 11.0 6.6 11.0 p0040 6.7 6.7 11.5 11.4 11.0 11.4 p0201 0.0 0.0 16.9 16.5 15.6 0.0 p0282 48.4 62.9 54.3 71.5 74.0 78.3 p0291 50.5 46.9 84.9 84.9 91.0 91.3 sentoy 8.2 7.8 15.2 16.3 14.3 17.0 Varied CGLPs on ¯ x and p int (P j ) have good properties and yield the same cuts The cut is a combination of facets from P j separating ¯ x and laying between p and ¯ x . Contraposes the concern by Cadoux and Lemar´ echal [2013] with reverse polar normalization due to its unboundedness. Extends properties in Buchheim et al. [2008] to lift-and-project formulations, both bringing the supporting hyperplane method from Veinott [1967] to MIP. RP CGLP is better for multiple root cuts The feasible set is invariant to p , hence changing it only affects cut evaluation. 7.| Next steps Beyond finding just some p int (P j ): find p yielding one, facet-defining cut find such points yielding disjoint cuts If P j is not full-dimensional, we could integrate facial decomposition [Borwein and Wolkowicz, 1981] to the CGLP. Generating p and solving the CGLP is expensive. Learning what we need from both, we should aim for slim equivalents: | References E. Balas and T. Kis. On the relationship between standard intersection cuts, lift-and-project cuts, and generalized intersection cuts. Mathematical Programming (Articles in Advance), 2016. E. Balas and M. Perregaard. Lift-and-project for mixed 0–1 programming: recent progress. Discrete Applied Mathematics, 123:129–154, 2002. E. Balas, S. Ceria, and G. Cornu´ ejols. A lift- and-project cutting plane algorithm for mixed 0-1 programs. Math. Progr., 58:295–324, 1993. J. Borwein and H. Wolkowicz. Regularizing the abstract convex program. Journal of Mathematical Analysis and Applications, 83(2):495 – 530, 1981. C. Buchheim, F. Liers, and M. Oswald. Local cuts revisited. Ops. Res. Letters, 36:430–433, 2008. F. Cadoux and C. Lemar´ echal. Reflections on generating (disjunctive) cuts. EURO J. Comput. Optim., 1(1-2):51–69, 2013. S. Ceria and J. Soares. Disjunctive cuts for mixed 0-1 programming: duality and lifting. GSB, 1997. M. Fischetti, A. Lodi, and A. Tramontani. On the separation of disjunctive cuts. Mathematical Programming A, 128:205–230, 2011. A.F. Jr. Veinott. The supporting hyperplane method for unimodal programming. Operations Research, 15(1):147–152, 1967.
Transcript
Page 1: Thiago Serra Egon Balas & Francois¸ Margot · Thiago Serra tserra@cmu.edu@thserraAdvisors: Egon Balas & Francois¸ Margot 1.jIntroduction Lift-and-project resembles a Swiss army

Reformulating the disjunctive cut generating linear programThiago Serra [email protected] @thserra Advisors: Egon Balas & Francois Margot

1.| Introduction

Lift-and-project resembles a Swiss armyknife to define and generate valid cuts.

But why using just one blade to cut in asingle way? Perhaps we could avoidgetting increasingly more parallel andshallow cuts by diversifying in the CGLP.

2.| Background

Consider a binary linear program

min{cTx : Ax ≥ b, x ∈ {0,1}n}with linear relaxation

P := {x : Ax ≥ b, 0 ≤ x ≤ 1}:= {x : Ax ≥ b}

having a fractional basic solution

x = arg min{cTx : x ∈ P}, 0 < xj < 1.

Feasible solutions are covered by

Dj :={x : Ax ≥ b, − xj ≥ 0}∪{x : Ax ≥ b, xj ≥ 1},

which convexifies as Pj := conv(Dj).From Farkas lemma, any valid αTx ≥ βfor Pj is found in the (α, β)-projection of

α −uT A +u0ej = 0α −vT A −v0ej = 0β −uT b ≤ 0β −vT b −v0 ≤ 0

u, v , u0, v0 ≥ 0

(C)j

For Pj, Balas et al. [1993] show that allnon-dominated inequalities are found in

α −uT A +u0ej = 0α −vT A −v0ej = 0β −uT b = 0β −vT b −v0 = 0

u, v , u0, v0 ≥ 0

(C)j

and it has as dual the lift-and-projectdescription of x ∈ Pj as x = x0 + x1 for

Ax0 −bx00 ≥ 0

−x0j ≥ 0

Ax1 −bx10 ≥ 0

x1j −x1

0 ≥ 0x0

0 + x10 = 1

(D)j

with the nonnegativity of x00 , x

10 ≥ 0

relaxed [Ceria and Soares, 1997].

We look for αTx ≥ β separating x with aCut Generating Linear Program (CGLP)minimizing αTx − β and bounding (C)j :

min αT x − βs.t. (C)j

uTe + vTe + u0 + v0 = 1(CGLP)j

However, the cut may be strictlydominated [Fischetti et al., 2011]:

For a split on j = 1, (CGLP)j yields:(c1) if k ≤ 8; (c2) is k = 8; (c3) if k ≥ 8

3.| Reverse polar CGLP

For some p ∈ Pj, we propose thefollowing formulation:

min αTp − βs.t. (C)j

β − αT x = 1(RP-CGLP)pj

Using (C)j instead, (RP-CGLP)pj is an

extended formulation of the reverse polar(Pj−x

)−:={

y : yT (x − x) ≥ 1 ∀x ∈ Pj}

.

Following the reverse polar notation, wedenote cuts in the form yT (x − x) ≥ 1.

¶ Cuts from (RP-CGLP)pj define

supporting hyperplanes of Pj

x

p

Among parallel inequalities, gettingcloser to p reduces ‖y‖ since distance to x× cut norm = 1, thus reducing yT (p − x).

Ü No cutting plane is strictly dominated

How facets of Pj are combined in cuttingplanes from (RP-CGLP)p

j ?

Let the facet-defining inequalities of Pj

define F ={

(γ i)T (x − x) ≥ δi}

i∈F with Ffinite for A,b rational. W.l.o.g., let uspartition F = F + ∪ F 0 ∪ F−, where:

δi =

1, i ∈ F +

0, i ∈ F 0

−1, i ∈ F−

A cut yT (x − x) ≥ 1 from (RP-CGLP)pj

has a nonnegative combination {λi}i∈F :

∑i∈F

λiγi =y∑

i∈F+

λi −∑i∈F−

λi =1

· For a cut yT (x − x) ≥ 1 from(RP-CGLP)p

j , either λi = 0 or(γ i)T (p − x)− δi = 0 ∀i ∈ F 0 ∪ F−

If neither holds, we could find aface-defining cut with strictly betterobjective value by reducing such λi andaccordingly other multipliers λj, j ∈ F +.

¸ If p ∈ int(Pj), a cut from (RP-CGLP)pj

is a combination of facet-defininginequalities separating x

If p ∈ int(Pj), then λi = 0 ∀i ∈ F 0 ∪ F−.

¹ For a cut yT (x − x) ≥ 1 from(RP-CGLP)p

j , it holds thatyT (p − x) = (γ i)T (p − x) ∀i ∈ F + : λi > 0

If facet i has larger slack than facet j in aproper combination, there is aface-defining cut with strictly bettervalue where λj ← λj + λi and λi = 0.

Î A cut yT (x − x) ≥ 1 from (RP-CGLP)pj

is a combination of inequalities in Fthat first intersect the ray from x to p

The objective becomes minαT (p − x)− 1by replacing β. Points defining the sameray with x yield the same cuts. For thoseclosest to x , (γ i)T (p − x)− δi = 0 ∀i ∈ F +.

4.| Equivalent formulations

A similar reformulation is proposedby Balas and Perregaard [2002]:

min αT x − βs.t. (C)j

αT (p − x) = 1(BP-CGLP)pj

Ü They prove ¶ and º for (BP-CGLP)pj

Inspired by Cadoux and Lemarechal[2013], we denote as the polar CGLP thefollowing bounded CGLP formulation:

min αT x − βs.t. (C)j

αTp − β = 1(P-CGLP)pj

Ü Property ¶ follows similarly. Withobjective restated as minαT (x − p)− 1,getting closer to p increases ‖α‖.

Buchheim et al. [2008] enumeratesextreme points to define a GeneralizedIntersection Cut (GIC) equivalent [Balasand Kis, 2016] of formulation (P-CGLP)p

j ,but replacing Pj with P∩{0,1}n.

» (RP-CGLP)pj , (BP-CGLP)p

j , and(P-CGLP)p

j yield the same cuts

Let us denote cuts separating x asµTx ≥ ν, ‖ν‖ = 1. For each CGLP, there isa scale factor θ > 0 : (α, β) = θ(µ, ν).

(RP-CGLP)pj : θ = 1

ν−µT x

minαTp − β = min θ(µTp − ν) = min µT p−νν−µT x

(P-CGLP)pj : θ = 1

µT p−ν

minαT x −β = min θ(µT x − ν) = max ν−µT xµT p−ν

(BP-CGLP)pj : θ = 1

(µ)T (p−x)

minαT x − β = . . . = min µT p−νν−µT x + 1

¼ CGLP dual yields first intersection

The lift-and-project primal of (BP-CGLP)pj

yields x0 + x1 as the intersection from º:

min ω

s.t. (D)jx0 + x1 + ω(x − p) = x

(BP-L&P)pj

5.| Finding p

½ Any proper convex combination ofpoints in the relative interior of eachterm of Dj is in the relative interior of Pj

¾ Any point in the relative interior of Pj

has a convex combination of points inthe relative interior of each term of Dj

Ü For any cut, all we need is a point inthe relative interior of each term of Dj

Using ½, we can get a cut satisfyingproperty ¸ if Pj is full-dimensional.

On each term of the disjunction, weidentify inequalities always satisfied atequality and get a point maximizing theminimum slack on the remaining ones.

Let p be a combination in proportion tothe minimum slacks of those points.

6.| Results

Similar performance in the first roundand better performance in the second

Gap closed (%) by adding split cuts onfractional variables with standard (S) andRP CGLP (R) and resolving on MIPLIB:

1 round 2 roundsS R S+S S+R R+S R+R

bm23 5.6 5.7 12.2 13.4 11.4 13.6lseu 4.2 4.2 14.8 31.2 17.0 31.2mod008 0.1 0.1 1.6 1.9 0.5 1.9p0033 2.6 2.2 5.6 11.0 6.6 11.0p0040 6.7 6.7 11.5 11.4 11.0 11.4p0201 0.0 0.0 16.9 16.5 15.6 0.0p0282 48.4 62.9 54.3 71.5 74.0 78.3p0291 50.5 46.9 84.9 84.9 91.0 91.3sentoy 8.2 7.8 15.2 16.3 14.3 17.0

Varied CGLPs on x and p ∈ int(Pj) havegood properties and yield the same cuts

The cut is a combination of facets from Pj

separating x and laying between p and x .

Ü Contraposes the concern by Cadouxand Lemarechal [2013] with reverse polarnormalization due to its unboundedness.

Ü Extends properties in Buchheim et al.[2008] to lift-and-project formulations,both bringing the supporting hyperplanemethod from Veinott [1967] to MIP.

RP CGLP is better for multiple root cuts

The feasible set is invariant to p, hencechanging it only affects cut evaluation.

7.| Next steps

Beyond finding just some p ∈ int(Pj):

Ü find p yielding one, facet-defining cut

Ü find such points yielding disjoint cuts

If Pj is not full-dimensional, we couldintegrate facial decomposition [Borweinand Wolkowicz, 1981] to the CGLP.

Generating p and solving the CGLP isexpensive. Learning what we need fromboth, we should aim for slim equivalents:

| References

E. Balas and T. Kis. On the relationship betweenstandard intersection cuts, lift-and-project cuts,and generalized intersection cuts. MathematicalProgramming (Articles in Advance), 2016.

E. Balas and M. Perregaard. Lift-and-project formixed 0–1 programming: recent progress.Discrete Applied Mathematics, 123:129–154, 2002.

E. Balas, S. Ceria, and G. Cornuejols. A lift-and-project cutting plane algorithm for mixed0-1 programs. Math. Progr., 58:295–324, 1993.

J. Borwein and H. Wolkowicz. Regularizing theabstract convex program. Journal of MathematicalAnalysis and Applications, 83(2):495 – 530, 1981.

C. Buchheim, F. Liers, and M. Oswald. Local cutsrevisited. Ops. Res. Letters, 36:430–433, 2008.

F. Cadoux and C. Lemarechal. Reflections ongenerating (disjunctive) cuts. EURO J. Comput.Optim., 1(1-2):51–69, 2013.

S. Ceria and J. Soares. Disjunctive cuts for mixed0-1 programming: duality and lifting. GSB, 1997.

M. Fischetti, A. Lodi, and A. Tramontani. On theseparation of disjunctive cuts. MathematicalProgramming A, 128:205–230, 2011.

A.F. Jr. Veinott. The supporting hyperplanemethod for unimodal programming. OperationsResearch, 15(1):147–152, 1967.

Recommended