0% found this document useful (0 votes)

26 views37 pages

Practical Guidelines For Solving Difficult Mixed Integer Linear Programs

...

Uploaded by

Miloš Milenković

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views37 pages

Practical Guidelines For Solving Difficult Mixed Integer Linear Programs

...

Uploaded by

Miloš Milenković

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

1 Practical Guidelines for Solving Difficult Mixed Integer Linear

2 Programs
3 Ed Klotz† • Alexandra M. Newman‡
†IBM, 926 Incline Way, Suite 100, Incline Village, NV 89451
‡Division of Economics and Business, Colorado School of Mines, Golden, CO 80401
[email protected] • [email protected]

4 Abstract

5 Even with state-of-the-art hardware and software, mixed integer programs can require hours,
6 or even days, of run time and are not guaranteed to yield an optimal (or near-optimal, or any!)
7 solution. In this paper, we present suggestions for appropriate use of state-of-the-art optimizers
8 and guidelines for careful formulation, both of which can vastly improve performance.

9 “Problems worthy of attack prove their worth by hitting back.”

10 –Piet Hein, Grooks 1966

11 “Everybody has a plan until he gets hit in the mouth.”

12 –Mike Tyson

13 Keywords: mixed integer linear programming, memory use, run time, tight formulations, cuts,
14 heuristics, tutorials

15 1 Introduction

16 Operations research practitioners have been formulating and solving integer programs since the
17 1950s. As computer hardware has improved (Bixby and Rothberg, 2007), practitioners have taken
18 the liberty to formulate increasingly detailed and complex problems, assuming that the correspond-
19 ing instances can be solved. Indeed, state-of-the-art ptimizers such as CPLEX (IBM, 2012), Gurobi
20 (Gurobi, 2012), MOPS (MOPS, 2012), Mosek (MOSEK, 2012), and Xpress-MP (FICO, 2012) can
21 solve many practical large-scale integer programs effectively. However, even if these “real-world”
22 problem instances are solvable in an acceptable amount of time (seconds, minutes or hours, de-
23 pending on the application), other instances require days or weeks of solution time. Although not
24 a guarantee of tractability, carefully formulating the model and tuning standard integer program-
25 ming algorithms often result in significantly faster solve times, in some cases, admitting a feasible
26 or near-optimal solution which could otherwise elude the practitioner.

1
27 In this paper, we briefly introduce integer programs and their corresponding commonly used
28 algorithm, show how to assess optimizer performance on such problems through the respective
29 algorithmic output, and demonstrate methods for improving that performance through careful for-
30 mulation and algorithmic parameter tuning. Specifically, there are many mathematically equivalent
31 ways in which to express a model, and each optimizer has its own set of default algorithmic parame-
32 ter settings. Choosing from these various model expressions and algorithmic settings can profoundly
33 influence solution time. Although it is theoretically possible to try each combination of parameter
34 settings, in practice, random experimentation would require vast amounts of time and would be
35 unlikely to yield significant improvements. We therefore guide the reader to likely performance-
36 enhancing parameter settings given fixed hardware, e.g., memory limits, and suggest methods for
37 avoiding performance failures a priori through careful model formulation. All of the guidelines we
38 present here apply to the model in its entirety. Many relaxation and decomposition methods, e.g.,
39 Lagrangian Relaxation, Benders’ Decomposition, and Column Generation (Dantzig-Wolfe Decom-
40 position), have successfully been used to make large problems more tractable by partitioning the
41 model into subproblems and solving these iteratively. A description of these methods is beyond
42 the scope of our paper; the practitioner should first consider attempting to improve algorithmic
43 performance or tighten the existing model formulation, as these approaches are typically easier and
44 less time consuming than reformulating the model and applying decomposition methods.
45 The reader should note that we assume basic familiarity with fundamental mathematics, such
46 as matrix algebra, and with optimization, in particular, with linear programming and the concepts
47 contained in Klotz and Newman (To appear). We expect that the reader has formulated linear
48 integer programs and has a conceptual understanding of how the corresponding problems can be
49 solved. Furthermore, we present an algebraic, rather than a geometric, tutorial, i.e., a tutorial based
50 on the mathematical structure of the problem and corresponding numerical algorithmic output,
51 rather than based on graphical analysis. The interested reader can refer to basic texts such as
52 Rardin (1998) and Winston (2004) for more detailed introductions to mathematical programming,
53 including geometric interpretations.
54 We have attempted to write this paper to appeal to a diverse audience. Readers with limited
55 mathematical programming experience who infrequently use optimization software and do not
56 wish to learn the details regarding how the underlying algorithms relate to model formulations
57 can still benefit from this paper by learning how to identify sources of slow performance based
58 on optimizer output. This identification will allow them to use the tables in the paper that list
59 potential performance problems and parameter settings that address them. More experienced
60 practitioners who are interested in the way in which the optimizer algorithm relates to the model

2
61 formulation will gain insight into new techniques for improving model formulations, including those
62 different from the ones discussed in this paper. While intended primarily for practitioners seeking
63 performance enhancements to practical models, theoretical researchers may still benefit. The same
64 guidelines that can help tighten specific practical models can also help in the development of the
65 theory associated with fundamental algorithmic improvements in integer programming, e.g., new
66 cuts and new techniques for preprocessing.
67 The remainder of the paper is organized as follows: In Section 2, we introduce integer programs,
68 the branch-and-bound algorithm, and its variants. Section 3 provides suggestions for successful al-
69 gorithm performance. Section 4 presents guidelines for and examples of tight formulations of integer
70 programs that lead to faster solution times. Section 5 concludes the paper with a summary. Section
71 2, with the exception of the tables, may be omitted without loss of continuity for the practitioner
72 interested only in formulation and algorithmic parameter tuning without detailed descriptions of
73 the algorithms themselves. To illustrate the concepts we present in this paper, we show output logs
74 resulting from having run a commercial optimizer on a standard desktop machine. Unless otherwise
75 noted, this optimizer is CPLEX 12.2.0.2, and the machine possesses four single-core 3.0 gigahertz
76 Xeon chips and 8 gigabytes of memory.

77 2 Fundamentals

78 Consider the following system in which C is a set of indices on our variables x such that xj , j ∈ C
79 are nonnegative, continuous variables, and I is a set of indices on the variables x such that xj , j ∈ I
80 are nonnegative, integer variables. Correspondingly, cC and AC are the objective function and left-
81 hand-side constraint coefficients, respectively, on the nonnegative, continuous variables, and cI
82 and AI are the objective function and left-hand-side constraint coefficients, respectively, on the
83 nonnegative, integer variables. For the constraint set, the right-hand-side constants, b, are given as
84 an m × 1 column vector.

(PM IP ) : min cTC xC + cTI xI

subject to AC xC + AI xI = b
86

xC , xI ≥ 0, xI integer

87 Three noteworthy special cases of this standard mixed integer program are (i) the case in which
88 xI is binary, (ii) the case in which cC , AC , and xC do not exist and xI is general integer, and (iii)
89 the case in which cC , AC , and xC do not exist and xI is binary. Note that (iii) is a special case of

3
90 (i) and (ii). We refer to the first case as a mixed binary program, the second case as a pure integer
91 program, and the third case as a binary program. These cases can benefit from procedures such
92 as probing on binary variables (Savelsbergh, 1994), or even specialized algorithms. For example,
93 binary programs lend themselves to some established techniques in the literature that do not exist
94 if the algorithm is executed on an integer program. These techniques are included in most standard
95 branch-and-bound optimizers; however, some features that are specific to binary-only models, e.g.,
96 the additive algorithm of Balas (1965), can be lacking.
97 Branch-and-bound uses intelligent enumeration to arrive at an optimal solution for a (mixed)
98 integer program or any special case thereof. This involves construction of a search tree. Each node
99 in the tree consists of the original constraints in (PM IP ), along with some additional constraints on
100 the bounds of the integer variables, xI , to induce those variables to assume integer values. Thus,
101 each node is also a mixed integer program (MIP). At each node of the branch-and-bound tree, the
102 algorithm solves a linear programming relaxation of the restricted problem, i.e., the MIP with all
103 its variables relaxed to be continuous.
104 The root node at the top of the tree is (PM IP ) with the variables xI relaxed to assume continuous
105 values. Branch-and-bound begins by solving this problem. If the root node linear program (LP)
106 is infeasible, then the original problem (which is more restricted than its linear programming
107 relaxation) is also infeasible, and the algorithm terminates with no feasible solution. Similarly, if the
108 optimal solution to the root node LP has no integer restricted variables with fractional values, then
109 the solution is optimal for (PM IP ) as well. The most likely case is that the algorithm produces an
110 optimal solution for the relaxation with some of the integer-restricted variables assuming fractional
111 values. In this case, such a variable, xj = f , is chosen and branched on, i.e., two subproblems are
112 created – one with a restriction that xj ≤ ⌊f ⌋ and the other with a restriction that xj ≥ ⌈f ⌉. These
113 subproblems are successively solved, which results in one of the following three outcomes:
114 Subproblem Solution Outcomes (for a minimization problem)

115 • (i) The subproblem is optimal with all variables in I assuming integer values. In
116 this case, the algorithm can update its best integer feasible solution; this update tightens
117 the upper bound on the optimal objective value. Because the algorithm only seeks a single
118 optimal solution, no additional branches are created from this node; examining additional
119 branches cannot yield a better integer feasible solution. Therefore, the node is fathomed or
120 pruned.

121 • (ii) The subproblem is infeasible. In this case, no additional branching can restore
122 feasibility. As in (i), the node is fathomed.

4
123 • (iii) The subproblem has an optimal solution, but with some of the integer-
124 restricted variables in I assuming fractional values. There are two cases:

125 ⋆ a. The objective function value is dominated by the objective of the best integer feasible
126 solution. In other words, the optimal node LP objective is no better than the previously
127 established upper bound on the optimal objective for (PM IP ). In this case, no additional
128 branching can improve the objective function value of the node, and, as in (i), the node
129 is fathomed.

130 ⋆ b. The objective function value is not dominated by that of the best integer feasible
131 solution. The algorithm then processes the node in that it chooses a fractional xj ′ =
132 f ′ ; j ′ ∈ I to branch on by creating two child nodes and their associated subproblems –
133 one with a restriction that xj ′ ≤ ⌊f ′ ⌋ and the other with a restriction that xj ′ ≥ ⌈f ′ ⌉.
134 These restrictions are imposed on the subproblem in addition to any others from previous
135 branches in the same chain stemming from the root; each of these child subproblems is
136 subsequently solved. Note that while most implementations of the algorithm choose a
137 single integer variable from which to create two child nodes, the algorithm’s convergence
138 only requires that the branching divides the feasible region of the current node in a
139 mutually exclusive manner. Thus, branching on groups of variables or expressions of
140 variables is also possible.

141 Due to the exponential growth in the size of such a tree, exhaustive enumeration would quickly
142 become hopelessly computationally expensive for MIPs with even dozens of variables. The effective-
143 ness of the branch-and-bound algorithm depends on its ability to prune nodes. Effective pruning
144 relies on the fundamental property that the objective function value of each child node is either the
145 same as or worse than that of the parent node (both for the MIP at the node and the associated
146 LP relaxation). This property holds because every child node consists of the MIP in the parent
147 node plus an additional constraint (typically, the bound constraint on the branching variable).
148 As the algorithm proceeds, it maintains the incumbent integer feasible solution with the best
149 objective function determined thus far in the search. The algorithm performs updates as given in
150 (i) of Subproblem Solution Outcomes. The updated incumbent objective value provides an upper
151 bound on the optimal objective value. A better incumbent increases the number of nodes that can
152 be pruned in case (iii), part (a) by more easily dominating objective function values elsewhere in
153 the tree.
154 In addition, the algorithm maintains an updated lower bound on the optimal objective for
155 (PM IP ). The objective of the root node LP establishes a lower bound on the optimal objective

5
156 because its feasible region contains all integer feasible solutions to (PM IP ). As the algorithm
157 proceeds, it dynamically updates the lower bound by making use of the property that child node
158 objectives are no better than those of their parent. Because a better integer solution can only be
159 produced by the children of the currently unexplored nodes, this property implies that the optimal
160 objective value for (PM IP ) can be no better than the best unexplored node LP objective value.
161 As the algorithm continues to process nodes, the minimum LP objective of the unexplored nodes
162 can dynamically increase, improving the lower bound. When the lower bound meets the upper
163 bound, the algorithm terminates with an optimal solution. Furthermore, once an incumbent has
164 been established, the algorithm uses the difference between the upper bound and lower bound to
165 measure the quality of the solution relative to optimality. Thus, on difficult models with limited
166 computation time available, practitioners can configure the algorithm to stop as soon as it has
167 an integer feasible solution within a specified percentage of optimality. Note that most other
168 approaches to solving integer programs (e.g., tabu search, genetic algorithms) lack any sort of
169 bound, although it may be possible to derive one from the model instance. However, even if it is
170 possible to derive a bound, it is likely to be weak, and it probably remains static. Note that in the
171 case of a maximization problem, the best integer solution provides a lower bound on the objective
172 function value and the objective of the root node LP establishes an upper bound on the optimal
173 objective; the previous discussion holds, but with this reversal in bounds. Unless otherwise noted,
174 our examples are minimization problems, as given by our standard form in (PM IP ).
175 Figure 1 provides a tree used to solve a hypothetical integer program of the form (PM IP ) with
176 the branch-and-bound algorithm. Only the relevant subset of solution values is given at each node.
177 The numbers in parentheses outside the nodes denote the order in which the nodes are processed, or
178 examined. The inequalities on the arcs indicate the bound constraint placed on an integer-restricted
179 variable in the original problem that possesses a fractional value in a subproblem.

180 Node (1) is the root node. Its objective function value provides a lower bound on the mini-
181 mization problem. Suppose x1 , an integer-restricted variable in the original problem, possesses a
182 fractional value (3.5) at the root node solve. To preclude this fractional value from recurring in
183 any subsequent child node solve, we create two subproblems, one with the restriction that x1 ≤ 3,
184 i.e., x1 ≤ ⌊3.5⌋, and the other with the restriction that x1 ≥ 4, i.e., x1 ≥ ⌈3.5⌉. This is a mutually
185 exclusive and collectively exhaustive set of outcomes for x1 (and, hence, the original MIP) given
186 that x1 is an integer-restricted variable in the original problem.
187 Node (2) is the child node that results from branching down on variable x1 at node (1). Among
188 possibly others, x7 is an integer-restricted variable that assumes a fractional value when this sub-

6
(1)

root
3 x1 = 3.5 x1
≤ ≥
(2) x1 4 (3)

x7 = 2.3 x9 = 0.1
2 0
x7

x9
≤ ≤
≥

≥
x7 x9

1
3
(4) (5) (6) (7)

Integer x3 = 0.6 ??? Fathomed

0
x3
≤
≥
x3
1
(8) (9)

x8 = 0.3 Infeasible
x8
0
≤

≥
x8

(10) (11)

Integer ???

Figure 1: Branch-and-bound algorithm

189 problem at node (2) is solved; the solve consists of the root node problem and the additional
190 restriction that x1 ≤ 3. Because of this fractional value, we create two subproblems emanating
191 from node (2) in the same way in which we create them from node (1). The subproblem solve at
192 node (4), i.e., the solve consisting of the root node subproblem plus the two additional restrictions
193 that x1 ≤ 3 and x7 ≤ 2, results in an integer solution. At this point, we can update the upper
194 bound. That is, the optimal solution for this problem, an instance of (PM IP ), can never yield an
195 objective worse than that of the best feasible solution obtained in the tree.
196 At any point in the tree, nodes that require additional branching are considered active, or
197 unexplored. Nodes (6) and (11) remain unexplored. Additional processing has led to pruned nodes
198 (4), (7), and (9), either because the subproblem solve was infeasible, e.g., node (9), or because the
199 objective function value was worse than that of node (4), regardless of whether or not the resulting
200 solution was integer. As the algorithm progresses, it establishes an incumbent solution at node
201 (10). Because nodes (6) and (11) remain unexplored, improvement on the current incumbent can
202 only come from the solutions of the subproblems at nodes (6) and (11) or their descendants. The
203 descendants have an objective function value no better than that of either of these two nodes;
204 therefore, the optimal solution objective is bounded by the minimum of the optimal LP objectives

7
205 of nodes (6) and (11). Without loss of generality, assume node (11) possesses the lesser objective.
206 That objective value then provides a lower bound on the optimal objective for (PM IP ). We can
207 continue searching through the tree in this fashion, updating lower and upper bounds, until either
208 the gap is acceptably small, or until all the nodes have been processed.
209 The previous description of the branch-and-bound algorithm focuses on its fundamental steps.
210 Advances in the last 20 years have extended the algorithm from branch and bound to branch and
211 cut. Branch and cut, the current choice of most integer programming solvers, follows the same
212 steps as branch and bound, but it also can add cuts. Cuts consist of constraints involving linear
213 expressions of one or more variables that are added at the nodes to further improve performance. As
214 long as these cuts do not remove any integer feasible solutions, their addition does not compromise
215 the correctness of the algorithm. If done judiciously, the addition of such cuts can yield significant
216 performance improvements.

217 3 Guidelines for Successful Algorithm Performance

218 There are four common reasons that integer programs can require a significant amount of solution
219 time:

220 • (i) There is lack of node throughput due to troublesome linear programming node solves.

221 • (ii) There is lack of progress in the best integer solution, i.e., the upper bound.

222 • (iii) There is lack of progress in the best lower bound.

223 • (iv) There is insufficient node throughput due to numerical instability in the problem data
224 or excessive memory usage.

225 By examining the output of the branch-and-bound algorithm, one can often identify the cause(s)
226 of the performance problem. Note that integer programs can exhibit dramatic variations in run
227 time due to seemingly inconsequential changes to a problem instance. Specifically, differences such
228 as reordering matrix rows or columns, or solving a model with the same optimizer, but on a different
229 operating system, only affect the computations at very low-order decimal places. However, because
230 most linear programming problems drawn from practical sources have numerous alternate optimal
231 basic solutions, these slight changes frequently suffice to alter the path taken by the primal or dual
232 simplex method. The fractional variables eligible for branching are basic in the optimal node LP
233 solution. Therefore, alternate optimal bases can result in different branching variable selections.
234 Different branching selections, in turn, can cause significant performance variation if the model

8
235 formulation or optimizer features are not sufficiently robust to consistently solve the model quickly.
236 This notion of performance variability in integer programs is discussed in more detail in Danna
237 (2008) and Koch et al. (2011). However, regardless of whether an integer program is consistently
238 or only occasionally difficult to solve, the guidelines described in this section can help address
239 the performance problem. We now discuss each potential performance bottleneck and suggest an
240 associated remedy.

241 3.1 Lack of Node Throughput Due to Troublesome Linear Programming Node
242 Solves

243 Because processing each node in the branch-and-bound tree requires the solution of a linear pro-
244 gram, the choice of a linear programming algorithm can profoundly influence performance. An
245 interior point method may be used for the root node solve; it is less frequently used than the sim-
246 plex method at the child nodes because it lacks a basis and hence, the ability to start with an initial
247 solution, which is important when processing tens or hundreds of thousands of nodes. However,
248 conducting different runs in which the practitioner invokes the primal or the dual simplex method
249 at the child nodes is a good idea. Consider the following two node logs, the former corresponding
250 to solving the root and child node linear programs with the dual simplex method and the latter
251 with the primal simplex method.

252

253 Node Log #1: Node Linear Programs Solved with Dual Simplex

Nodes Cuts/ ItCnt

Node Left Objective IInf Best Integer Best Node
0 0 -89.0000 6 -89.0000 5278
0 0 -89.0000 6 Fract: 4 12799
0 2 -89.0000 6 -89.0000 12799
1 1 infeasible -89.0000 20767
2 2 -89.0000 5 -89.0000 27275
3 1 infeasible -89.0000 32502
...
8 2 -89.0000 8 -89.0000 65717
9 1 infeasible -89.0000 73714
...
Solution time = 177.33 sec. Iterations = 73714 Nodes = 10 (1)

9
254

255

256 Node Log #2: Node Linear Programs Solved with Primal Simplex

Nodes Cuts/ ItCnt

Node Left Objective IInf Best Integer Best Node
0 0 -89.0000 5 -89.0000 6603
0 0 -89.0000 5 Fract: 5 7120
0 2 -89.0000 5 -89.0000 7120
1 1 infeasible -89.0000 9621
2 2 -89.0000 5 -89.0000 10616
3 1 infeasible -89.0000 12963
...
8 2 -89.0000 8 -89.0000 21522
9 1 infeasible -89.0000 23891
...
Solution time = 54.37 sec. Iterations = 23891 Nodes = 10 (1)

257

258 The iteration count for the root node solve shown in Node Log #1 that occurred without
259 any advanced start information indicates 5,278 iterations. Computing the average iteration count
260 across all node LP solves, there are 11 solves (10 nodes, and 1 extra solve for cut generation at node
261 0) and 73,714 iterations, which were performed in a total of 177 seconds. The summary output in
262 gray indicates in parentheses that one unexplored node remains. So, the average solution time per
263 node is approximately 17 seconds, and the average number of iterations per node is about 6,701.
264 In Node Log #2, the solution time is 54 seconds, at which point the algorithm has performed 11
265 solves, and the iteration count is 23,891. The average number of iterations per node is about 2,172.
266 In Node Log #1, the 10 child node LPs require more iterations, 6,844, on average, than the
267 root node LP (which requires 5,278), despite the advanced basis at the child node solves that was
268 absent at the root node solve. Any time this is true, or even when the average node LP iteration
269 count is more than 30-50% of the root node iteration count, an opportunity for improving node
270 LP solve times exists by changing algorithms or algorithmic settings. In Node Log #2, the 10
271 child node LPs require 1,729 iterations, on average, which is much fewer than those required by
272 the root node solve, which requires 6,603 (solving the LP from scratch). Hence, switching from the

10
273 dual simplex method in Node Log #1 to the primal simplex method in Node Log #2 increases
274 throughput, i.e., decreases the average number of iterations required to solve a subproblem in the
275 branch-and-bound tree.
276 The different linear programming algorithms can also benefit by tuning the appropriate opti-
277 mizer parameters. See Klotz and Newman (To appear) for a detailed discussion of this topic.

278 3.2 Lack of Progress in the Best Integer Solution

279 An integer programming algorithm may struggle to obtain good feasible solutions. Node Log #3
280 illustrates a best integer solution found before node 300 of the solve that has not improved by node
281 7800 of the same solve:

282

283 Node Log #3: Lack of Progress in Best Integer Solution

284 Nodes Cuts/ ItCnt Gap

285 Node Left Objective IInf Best Integer Best Node
286 ...
287 300 229 22.6667 40 31.0000 22.0000 4433 29.03%
288 400 309 cutoff 31.0000 22.3333 5196 27.96%
289 500 387 26.5000 31 31.0000 23.6667 6164 26.88%
290 ...
291 7800 5260 28.5000 23 31.0000 25.6667 55739 17.20%

292

293 Many state-of-the-art optimizers have built-in heuristics to determine initial and improved in-
294 teger solutions. However, it is always valuable for the practitioner to supply the algorithm with an
295 initial solution, no matter how obvious it may appear to a human. Such a solution may provide
296 a better starting point than what the algorithm can derive on its own, and algorithmic heuristics
297 may perform better in the presence of an initial solution, regardless of the quality of its objective
298 function value. In addition, the faster progress in the cutoff value associated with the best inte-
299 ger solution may enable the optimizer features such as probing to fix additional variables, further
300 improving performance. Common tactics to find such starting solutions include the following:

301 • Provide an obvious solution based on specific knowledge of the model. For example, models
302 with integer penalty variables may benefit from a starting solution with a significant number
303 (or even all) of the penalty variables set to non-zero values.

11
304 • Solve a related, auxiliary problem to obtain a solution (e.g., via the Feasopt method in
305 CPLEX, which looks for feasible solutions by minimizing infeasibilities), provided that the
306 gain from the starting solution exceeds the auxiliary solve time.

307 • Use the solution from a previous solve for the next solve when solving a sequence of models.

308 To see the advantages of providing a starting point, compare Node Log #5 with Node Log
309 #4. Log #4 shows that CPLEX with default settings takes about 1589 seconds to find a first
310 feasible solution, with an associated gap of 4.18%. Log #5 illustrates the results obtained by
311 solving a sequence of five faster optimizations (see Lambert et al. (to appear) for details) to obtain
312 a starting solution with a gap of 2.23%. The total computation time to obtain the starting solution
313 is 623 seconds. So, the time to obtain the first solution is faster by providing an initial feasible
314 solution, and if we let the algorithm with the initial solution run for an additional 1589 − 623 = 966
315 seconds, the gap for the instance with the initial solution improves to 1.53%.

316

317 Node Log #4: No initial practitioner-supplied solution

Root relaxation solution time = 131.45 sec.

Nodes Cuts/
Node Left Objective IInf Best Integer Best Node ItCnt Gap

0 0 1.09590e+07 2424 1.09590e+07 108111

0 0 1.09570e+07 2531 Cuts: 4 108510
0 0 1.09405e+07 2476 Cuts: 2 109208
Heuristic still looking.
Heuristic still looking.
Heuristic still looking.
Heuristic still looking.
Heuristic still looking.
0 2 1.09405e+07 2476 1.09405e+07 109208
Elapsed real time = 384.09 sec. (tree size = 0.01 MB)
1 3 1.08913e+07 2488 1.09405e+07 109673
2 4 1.09261e+07 2326 1.09405e+07 109977
...
1776 1208 1.05645e+07 27 1.09164e+07 474242

12
1814 1246 1.05588e+07 31 1.09164e+07 478648
1847 1277 1.05554e+07 225 1.09164e+07 484687
* 1880+ 1300 1.04780e+07 1.09164e+07 491469 4.18%
1880 1302 1.05474e+07 228 1.04780e+07 1.09164e+07 491469 4.18%
Elapsed real time = 1589.38 sec. (tree size = 63.86 MB)

318

319

320 Node Log #5: An initial solution supplied by the practitioner

Root relaxation solution time = 93.92 sec.

Nodes Cuts/
Node Left Objective IInf Best Integer Best Node ItCnt Gap

* 0+ 0 1.07197e+07 108111 ---

0 0 1.09590e+07 2424 1.07197e+07 1.09590e+07 108111 2.23%
0 0 1.09570e+07 2531 1.07197e+07 Cuts: 4 108538 2.21%
...

485 433 1.09075e+07 2398 1.07197e+07 1.08840e+07 244077 1.53%

487 434 1.08237e+07 2303 1.07197e+07 1.08840e+07 244350 1.53%
497 439 1.08637e+07 1638 1.07197e+07 1.08840e+07 245391 1.53%
Elapsed real time = 750.11 sec. (tree size = 32.61 MB)
501 443 1.08503e+07 1561 1.07197e+07 1.08840e+07 245895 1.53%
...
Elapsed real time = 984.03 sec. (tree size = 33.00 MB)
1263 674 1.08590e+07 2574 1.07197e+07 1.08840e+07 314814 1.53%

321

322 In the absence of a readily identifiable initial solution, various branching strategies can aid in
323 obtaining initial and subsequent solutions. These branching strategies may be based purely on the
324 algebraic structure of the model. For example, by using depth-first search, the branch-and-bound
325 algorithm never defers processing a node until it has been pruned. This strategy helps find integer

13
326 feasible solutions sooner, although it potentially slows progress in the best bound. (Recall, the best
327 lower bound for a minimization problem is updated once all nodes with relaxation objective value
328 equal to the lower bound have been processed.) In other cases, branching strategies may involve
329 specific aspects of the model. For example, branching up, i.e., processing the subproblem associated
330 with the greater bound as a restriction on its branch, in the presence of many set partitioning
P
331 constraints ( i xi = 1, xi binary) not only fixes the variable on the associated branch in the
332 constraint to 1, but it also fixes all other variables in the constraint to a value of 0 in the children
333 of the current node. By contrast, branching down does not yield the ability to fix any additional
334 variables.
335 Improvements to the model formulation can also yield better feasible solutions faster. Differ-
336 entiation in the data, e.g., by adding appropriate discounting factors to cost coefficients in the
337 objective function, helps the algorithm distinguish between dominated and dominating solutions,
338 which expedites the discovery of improving solutions.

339 3.3 Lack of Progress in the Best Bound

340 The branch-and-bound depiction in Figure 1 and the corresponding discussion illustrate how the
341 algorithm maintains and updates a lower bound on the objective function value for the minimization
342 integer program. (Note that this would correspond to an upper bound for a maximization problem.)
343 The ability to update the best bound effectively depends on the best objective function value of
344 all active subproblems, i.e., the associated LP objective function value of the nodes that have not
345 been fathomed. If successive subproblems, i.e., subproblems corresponding to nodes lying deeper
346 in the tree, do not possess significantly worse objective function values, the bound does not readily
347 approach the true objective function value of the original integer program. Furthermore, the greater
348 the number of active, i.e., unfathomed, nodes deeper in the tree, the smaller the chance of a tight
349 bound, which always corresponds to the weakest (lowest, for a minimization problem) objective
350 function value of any active node. These objective function values, and the associated bounds they
351 generate, in turn, depend on the strength of the model formulation, i.e., the difference between
352 the polyhedron associated with the LP relaxation of (PM IP ) and the polyhedron consisting of the
353 convex hull of all integer feasible solutions to (PM IP ). Figure 2 provides an illustration. The region
354 P1 represents the convex hull of all integer feasible solutions of the MIP, while P2 represents the
355 feasible region of the LP relaxation. Adding cuts yields the region P3 , which contains all integer
356 solutions of the MIP, but contains only a subset of the fractional solutions feasible for P2 .
357 Node log #6 exemplifies progress in best integer solution but not in the best bound:

358

14
âT1 x ≤ b̂1
âT2 x ≤ b̂2
P1 := conv{x ∈ Zn : Ax ≤ b, x ≥ 0}
P2 := {x ∈ Rn : Ax ≤ b, x ≥ 0}
P3 := P2 ∩ {x ∈ Rn : Âx ≤ b̂}
P2
P3 Cuts must satisfy
1) âTi x ≤ b̂i ∀x ∈ P1 (validity)

P1 2) ∃ x ∈ P2 : âTi x > b̂i (separation)

âT3 x ≤ b̂3

Figure 2: Convex hull

359 Node Log #6: Progress in Best Integer Solution but not in the Best Bound

360 Nodes Cuts/ ItCnt Gap

361 Node Left Objective IInf Best Integer Best Node
362

363 300 296 2018.0000 27 3780.0000 560.0000 3703 85.19%

364 * 300+ 296 0 2626.0000 560.0000 3703 78.67%
365 * 393 368 0 2590.0000 560.0000 4405 78.38%
366 400 372 560.0000 291 2590.0000 560.0000 4553 78.38%
367 500 472 810.0000 175 2590.0000 560.0000 5747 78.38%
368 ...
369 * 7740+ 5183 0 1710.0000 560.0000 66026 67.25%
370 7800 5240 1544.0000 110 1710.0000 560.0000 66279 67.25%
371 7900 5325 944.0000 176 1710.0000 560.0000 66801 67.25%
372 8000 5424 1468.0000 93 1710.0000 560.0000 67732 67.25%

373

374 To strengthen the bound, i.e., to make its value closer to that of the optimal objective function
375 value of the integer program, we can modify the integer program by adding special constraints.
376 These constraints, or cuts, do not excise any integer solutions that are feasible in the unmodified
377 integer program. A cut that does not remove any integer solutions is valid. However, the cuts
378 remove portions of the feasible region that contain fractional solutions. If the removed area contains
379 the fractional solution resulting from the LP relaxation of the integer program, we say the cut

15
380 is useful (Rardin, 1998), or that the cut separates the fractional solution from the resulting LP
381 relaxation feasible region. In this case, the cut improves the bound by increasing the original LP
382 objective. There are various problem structures that lend themselves to different types of cuts.
383 Thus, we have a general sense of cuts that could be useful. However, without the LP relaxation
384 solution, it is difficult to say a priori which cuts are definitely useful.

x1 x2

x3 x4 x5

EX
Figure 3: Conflict Graph for numerical example (PBinary )

385 Let us consider the following numerical example, in this case, for ease of illustration, a maxi-
386 mization problem:

EX
(PBinary ) max 3x1 + 2x2 + x3 + 2x4 + x5 (1)

subject to x1 + x2 ≤ 1 (2)

x1 + x3 ≤ 1 (3)

x2 + x3 ≤ 1 (4)

4x3 + 3x4 + 5x5 ≤ 10 (5)

x1 + 2x4 ≤ 2 (6)

3x2 + 4x5 ≤ 5 (7)

xi binary ∀i (8)

387
EX
A cover cut based on the knapsack constraint of (PBinary ), 4x3 +3x4 +5x5 ≤ 10, is x3 +x4 +x5 ≤ 2.
388 That is, at most two of the three variables can assume a value of 1 while maintaining feasibility of the
389 knapsack constraint (5). Adding this cut is valid since it is satisfied by all integer solutions feasible
390 for the constraint. It also separates the fractional solution (x1 = 0, x2 = 0, x3 = 1, x4 = 31 , x5 = 1)
391 from the LP relaxation feasible region. Now consider the three packing constraints, (2) − (4):

16
392 x1 + x2 ≤ 1, x1 + x3 ≤ 1, and x2 + x3 ≤ 1. We can construct a conflict graph (see Figure 3) for the
393 whole model, with each vertex corresponding to a binary variable and each edge corresponding to
394 a pair of variables, both of which cannot assume a value of 1 in any feasible solution. A clique is a
395 set of vertices such that every two in the set are connected by an edge. At most one variable in a
396 clique can equal 1. Hence, the vertices associated with x1 , x2 and x3 form a clique, and we derive
397 the cut: x1 + x2 + x3 ≤ 1. In addition, constraints (6) and (7) generate the edges {1, 4} and {2, 5}
398 in the conflict graph, revealing the cuts x1 + x4 ≤ 1 and x2 + x5 ≤ 1. One could interpret these
399 cuts as either clique cuts from the conflict graph, or cover cuts derived directly from constraints
400 (6) and (7). Note that not only does each of these clique cuts separate fractional solutions from
401 the LP relaxation feasible region (as did the cover cut above), but they are also useful in that they
402 remove the LP relaxation solution ( 21 , 1 1 3 7
2, 2, 4, 8) from the feasible region.
403 The derivations of both clique and cover cuts rely on identifying a linear expression of variables
404 that assumes an integral value in any integer feasible solution, then determining the integer upper
405
EX
(right-hand-side) limit on the expression. In the case of the cover cut for our example (PBinary ),
406 x3 , x4 and x5 form a cover, which establishes that x3 + x4 + x5 ≥ 3 is infeasible for any integer
407 solution to the model. Therefore, x3 + x4 + x5 ≤ 2 is valid for any integer feasible solution to
408
EX
(PBinary ). Similarly, the clique in the conflict graph identifies the integral expression x1 + x2 + x3
409 and establishes that x1 + x2 + x3 ≥ 2 is infeasible for any integer solution to the model. Therefore,
410
EX
x1 + x2 + x3 ≤ 1 is valid for any integer feasible solution to (PBinary ). This cut removes fractional
411 solutions such as (x1 = 21 , x2 = 21 , x3 = 12 ). Making use of fractional infeasibility relative to integer
412 expressions is a useful technique for deriving additional cuts, and is a special case of disjunctive
413 programming (Balas, 1998).
414 Another mechanism to generate additional cuts includes the examination of the complementary
415 system, i.e., one in which a binary variable xi is substituted with 1 − xi . Consider a constraint
P
416 similar to the knapsack constraint, but with the inequality reversed: i ai xi ≥ b (with ai , b > 0).
P
417 Let x̄i = 1 − xi . Multiplying the inequality on the knapsack-like constraint by -1 and adding i ai
P P P
418 to both sides, we obtain: i ai − i ai xi ≤ −b + i ai . Substituting the complementary variables
P P
419 yields: i ai x̄i ≤ −b+ i ai . Note that when the right hand side is negative, the original constraint
420 is infeasible. Otherwise, this yields a knapsack constraint on x̄i from which cuts can be derived.
421 Cover cuts involving the x̄i can then be translated into cuts involving the original xi variables.
422 We summarize characteristics of these and other potentially helpful cuts in Table 1. A detailed
423 discussion of each of these cuts is beyond the scope of this paper; see Achterberg (2007) or Wolsey
424 (1998) for more details, as well as extensive additional references. State-of-the-art optimizers tend
425 to implement cuts that are based on general polyhedral theory that applies to all integer programs,

17
426 or on special structure that occurs on a sufficiently large percentage of practical models. Table 1
427 can help the practitioner distinguish cuts that a state-of-the-art optimizer is likely to implement
428 from those that are specific to particular types of models, and are less likely to be implemented
429 in a generic optimizer (and, hence, more likely to help performance if the practitioner uses his
430 knowledge to derive them).

Cut name Mathematical description of cut Structure of original MILP

that generates the cut
P
Clique† z ≤1 Packing constraints
Pi i
Cover† i zi ≤ b, b integer Knapsack constraints
P ′ P ′′
Disjunctive∗ Constraint derived from an LP solution ′
i ai xi ≥ b or
′′
i ai xi ≥ b ,
xi continuous or integer
Mixed Integer Rounding∗ Use of floors and ceilings of coefficients aC xC + aI xI = b,
and integrality of original variables x≥0
P
Generalized Upper Bound† i xi ≤ b, b integer Knapsack constraints
with precedence or packing
P
Implied Bound† xi ≤ abi i ai xi ≤ bz, x ≥ 0
Gomory∗ Mixed integer rounding applied to āC xC + āI/k xI/k + xk = b̄,
a simplex tableau row ā associated xk integer, x ≥ 0
with optimal node LP basis
Zero-half∗ λT Ax ≤ ⌊λT b⌋, Constraints containing integer
λi ∈ {0, 1/2} variables and coefficients
Flow Cover† Linear combination of flow and binary Fixed charge network
variables involving a single node
Flow Path† Linear combination of flow and binary Fixed charge network
variables involving a path of nodes
Multicommodity flow† Linear combination of flow and binary Fixed charge network
variables involving nodes in a network cut

Table 1: Different types of cuts and their characteristics, where z is binary unless otherwise noted,
and x is continuous; ∗ based on general polyhedral theory; †based on specific, commonly occurring
problem structure

431 Adding cuts does not always help branch-and-bound performance. While it can remove integer
432 infeasibilities, it also results in more constraints in each node LP. More constraints can increase
433 the time required to solve these linear programs. Without a commensurate speed-up in solution
434 time associated with processing fewer nodes, cuts may not be worth adding. Some optimizers have
435 internal logic to automatically assess the trade-offs between adding cuts and node LP solve time.
436 However, if the optimizer lacks such logic or fails to make a good decision, the practitioner may need
437 to look at the branch-and-bound output in order to assess the relative increase in performance due
438 to fewer examined nodes and the potential decrease in the rate at which the algorithm processes
439 the nodes. In other cases, the computational effort required to derive the cuts needed to effectively

18
440 solve the model may exceed the performance benefit they provide. Similar to node LP solve time
441 and node throughput, a proper comparison of the reduction in solution time the cuts provide with
442 the time spent calculating them may be necessary. (See Achterberg (2007).)
443 Most optimizers offer parameter settings that can improve progress of the best node, either
444 by strengthening the formulation or by enabling more node pruning. Features that are commonly
445 available include:

446 • (i) Best Bound node selection By selecting the node with the minimal relaxation objective
447 value, the algorithm updates the best node value faster. However, by considering node LP
448 objective values while ignoring the number of integer infeasibilities, best bound node selection
449 may cause the optimizer to find fewer integer feasible solutions. Therefore, best bound node
450 selection is most likely to help performance on models in which the optimizer finds integer
451 feasible solutions easily, but has trouble making sufficient progress in the best node.

452 • (ii) Strong branching By running a modest number of dual simplex iterations on multiple
453 branching variable candidates at each node, the algorithm can exploit any infeasible branches
454 to tighten additional variable bounds, resulting in a stronger formulation of the MIP at
455 the node in question, and faster pruning of its descendants. Strong branching increases the
456 computation at each node, so the performance improvement from the additional node pruning
457 must compensate for the diminished rate of node throughput to make this a reasonable feature
458 to employ.

459 • (iii) Probing By fixing a binary variable to a value of 0 or 1 and propagating this bound
460 change to other variables through the intersecting constraints, the optimizer can often identify
461 binary variables that can only assume one value in any feasible solution. For example, if fixing
462 a binary variable to 0 establishes that (PM IP ) is infeasible, then the variable must be 1 in
463 any integer feasible solution. Probing computation time primarily occurs as a preprocessing
464 step before starting the branch-and-bound algorithm. Identifying binary variables to fix can
465 tighten the formulation and improve node throughput by reducing the size of the problem.
466 However, it can be computationally expensive, so the practitioner must compare the time
467 spent performing the initial probing computations with the subsequent performance gains.

468 • (iv) More aggressive levels of cut generation Generating more cuts can further tighten
469 the formulation. However, the practitioner must properly assess the trade-off between the
470 tighter formulation and the potentially slower rate of node processing due to the additional
471 constraints in the node LPs.

19
472 If alternate parameter settings are insufficient to yield progress in the best node, the following
473 guidelines, while requiring more work, can help address this performance problem:

474 • (i) Careful model formulation It is sometimes possible to use alternate variable definitions.
475 For example, in Bertsimas and Stock Patterson (1998), the authors use variables to denote
476 whether an aircraft (flight) has arrived at a sector in the airspace by time period t, and
477 postulate that the variables represented in this manner “define connectivity constraints that
478 are facets of the convex hull of solutions,” which greatly improves the tractability of their
479 model. Similarly, in a model designed to determine a net present value-maximizing schedule
480 for extracting three-dimensional notional blocks of material in an open pit mine, we can define
481 xbt = 1 if block b is extracted by time period t, 0 otherwise, as opposed to the more intuitive
482 x̂bt = 1 if block b is extracted at time period t, 0 otherwise (Lambert et al., to appear). The
483 definitions in these two references result in models with significant differences in performance,
484 as illustrated theoretically and empirically.

485 • (ii) Careful use of elastic variables, i.e., variables that relax a constraint by allow-
486 ing for violations (which are then penalized in the objective) Adding elastic variables
487 can result in MIPs that remove the infeasibilities on integer expressions essential to standard
488 cut generation. This leads to a weaker model formulation in which most cut generation
489 mechanisms are disabled. If the use of elastic variables is necessary, consider first minimizing
490 the sum of the elastic variables, then optimizing the original objective while constraining the
491 elastic variable values to their minimized values.

492 3.4 Data and Memory Problems

493 Because the optimizer solves linear programs at each node of the branch-and-bound tree, the
494 practitioner must be careful to avoid the numerical performance issues described in Section 3 of
495 Klotz and Newman (To appear). Specifically, it is important to avoid large differences in orders
496 of magnitude in data to preclude the introduction of unnecessary round-off error. Such differences
497 of input values create round-off error in floating point calculations which makes it difficult for the
498 algorithm to distinguish between this error and a legitimate value. If the algorithm makes the
499 wrong distinction, it arrives at an incorrect solution. Integer programs may contain the construct
500 “if z = 0, then x = 0. Otherwise, x can be arbitrarily large.” Arbitrarily large values of x can be
501 carelessly modeled with a numerical value designed to represent infinity (often referred to as “big
502 M ” in the literature). In reality, the value for this variable can be limited by other constraints in
503 the problem; if so, we reduce its value, as in the following:

20
x − 100000000000z ≤ 0 (9)

0 ≤ x ≤ 5000; z binary (10)

504 In this case, we should use a coefficient of 5000 on z, which allows us to eliminate the explicit
505 upper bound on x as well. In addition to improving the scaling of the constraint, this change to
506 the numerical value enables the optimizer to better identify legitimate solutions to the conditions
507 being modeled. For example, the unmodified constraint accepts values of z = 10−8 and x =
508 1000 as an integer feasible solution. Most optimizers use an integrality tolerance and, by default,
509 accept an integrality violation of this order of magnitude. Therefore, the big M coefficient on the
510 original constraint enables the optimizer to accept a solution that, while feasible in a finite precision
511 computing environment, does not satisfy the intended meaning of the constraint. See Camm et al.
512 (1990) for further discussion.
513 Branch-and-bound can be generalized to other logic, which is important because it removes the
514 urge to use these numerically problematic “big M ’s” by allowing, for example, direct branching
515 on an indicator constraint. The indicator formulation of (9) is z = 0 ⇒ x ≤ 0. An indicator
516 infeasibility that requires branching occurs when a node relaxation solution has z = 0 but x > 0.
517 The indicator branches would be: x ≤ 0 and z = 1. By contrast, large values in (9) or elsewhere
518 in the model (whether truly infinite or some big M approximation) can result in a wide range
519 of coefficients that can easily lead to numerical problems. So, using indicators eliminates these
520 potentially large values from the matrix coefficients used to approximate an infinite value. For the
521 case in which the large values impose meaningful limits in the model, the indicator formulation
522 moves the coefficients from the matrix into the variable bounds, which improves the numerical
523 characteristics of the model.
524 Indicator constraints also support more general conditions, e.g., z = 0 ⇒ aT x ≤ b. In this
525 case, the indicator branches would be aT x ≤ b and z = 1. However, relaxations of indicator
526 constraints remove the constraint completely and can therefore be potentially weaker than their
527 less numerically stable big M counterpart. As of this writing, recent improvements in indicator
528 preprocessing in CPLEX have helped address this drawback.
529 Integer programs require at least as much memory as their linear programming equivalents.
530 Running out of memory is therefore as frequent, if not more frequent, a problem when trying to
531 solve integer programs, as opposed to linear programs. The same suggestions as those that appear
532 in Subsection 3.3 of Klotz and Newman (To appear) apply.

21
533 Table 2 provides suggestions for the branch-and-bound settings to use under the circumstances
534 mentioned in this section.

Characteristic Recognition Suggested tactic(s)

• Troublesome LPs • Large iteration counts per • Switch algorithms between primal
node, especially regarding and dual simplex; if advanced starts do
root node solve not help simplex, consider barrier method
• Lack of progress in best • Little or no change in best • Use best estimate or depth-first search
integer integer solution in log after • Apply heuristics more frequently
hundreds of nodes • Supply an initial solution
• Apply discount factors in the objective
• Branch up or down to resolve
integer infeasibilities
• Lack of progress in best • Little or no change in • Use breadth-first search
node best node in log after • Use aggressive probing
hundreds of nodes • Use aggressive algorithmic cut generation
• Apply strong branching
• Derive cuts a priori
• Reformulate with different variables
• Data and memory problems • Slow progress in node solves • Avoid large differences in size of data
• Out of memory error • Reformulate “big M ” constraints
• Rectify LP problems, e.g., degeneracy
• Apply memory emphasis setting
• Buy more memory

Table 2: Under various circumstances, different formulations and algorithmic settings have a greater
chance of faster solution time on an integer programming problem instance.

535 4 Tighter Formulations

536 When optimizer parameter settings (including aggressive application of cuts) fail to yield the desired
537 improvements, the practitioner may obtain additional performance gains by adding cuts more
538 specific to the model. The cuts added by the optimizer typically rely either on general polyhedral
539 theory that applies to all MIPs, or on special structure that appears in a significant percentage of
540 MIPs. In some cases, the cuts needed to improve performance rely on special structure specific
541 to individual MIPs. These less applicable cuts are unlikely to be implemented in any state-of-
542 the-art optimizer. In such cases, the practitioner may need to formulate his own cuts, drawing
543 on specific model knowledge. One can find a staggering amount of theory on cut derivation in
544 integer programming (Grötschel, 2004). While more knowledge of sophisticated cut theory adds
545 to the practitioner’s quiver of tactics to improve performance, run time enhancements can be
546 effected with some fairly simple techniques, provided the practitioner uses them in a disciplined,

22
547 well organized fashion. To that end, this section describes guidelines for identifying cuts that can
548 tighten a formulation of (PM IP ) and yield significant performance improvements. These guidelines
549 can help both novice practitioners and those who possess extensive familiarity with the underlying
550 theories of cut generation. See Rebennack et al. (2012) for an example of adding cuts based on
551 specific model characteristics.
552 Before tightening the formulation, the practitioner must identify elements of the model that
553 make it difficult, specifically, those that contain the constraints and variables from which useful
554 cuts can be derived. The following steps can help in this regard.
555 Determining How a MIP Can Be Difficult to Solve

556 • (i) Simplify the model if necessary. For example, try to identify any constraints or inte-
557 grality restrictions that are not involved in the slow performance by systematically removing
558 constraints and integrality restrictions and solving the resulting model. Such filtering can
559 be done efficiently by grouping similar constraints and variables and solving model instances
560 with one or more groups omitted. If the model remains difficult to solve after discarding a
561 group of constraints, the practitioner can tighten the formulation without considering those
562 constraints. Or, he can try to reproduce the problem with a smaller instance of the model.

563 • (ii) Identify the constraints that prevent the objective from improving. With a
564 minimization problem, this typically means identifying the constraints that force activities
565 to be performed. In other words, practical models involving nonnegative cost minimization
566 inevitably have some constraints that prevent the trivial solution of zero from being viable.

567 • (iii) Determine how removing integrality restrictions allows the root node relax-
568 ation objective to improve. In weak formulations, the root node relaxation objective
569 tends to be significantly better than the optimal objective of the associated MIP. The vari-
570 ables with fractional solutions in the root node relaxation help identify the constraints and
571 variables that motivate additional cuts. Many models have a wealth of valid cuts that could
572 be added purely by examining the model. But, many of those cuts may actually help little
573 in tightening the formulation. By focusing on how relaxing integrality allows the objective to
574 improve, the practitioner focuses on identifying the cuts that actually tighten the formulation.

575 Having identified the constraints and variables most likely to generate good cuts, the practitioner
576 faces numerous ways to derive the cuts. While a sophisticated knowledge of the literature provides
577 additional opportunities for tightening formulations, practitioners with limited knowledge of the
578 underlying theory can still effectively tighten many formulations using some fairly simple techniques.

23
579 Model Characteristics from which to Derive Cuts

580 • (i) Linear or logical combinations of constraints By combining constraints, one can
581 often derive a single constraint in which fractional values can be rounded to produce a tighter
582 cut. The clique cuts previously illustrated with the conflict graph provide an example of
583 how to identify constraints to combine. The conflict graph in that example occurs in a
584 sufficient number of practical MIPs so that many state-of-the-art optimizers use it. But,
585 other MIPs may have different graphs associated with their problem structure that do not
586 occur frequently. Identifying such graphs and implementing the associated cuts can often
587 tighten the formulation and dramatically improve performance.

588 • (ii) The optimization of one or more related models By optimizing a related model
589 that requires much less time to solve, the practitioner can often extract useful information
590 to apply to the original model. For example, minimizing a linear expression involving integer
591 variables and integer coefficients can provide a cut on that expression. This frequently helps
592 on models with integer penalty variables.

593 • (iii) Use of the incumbent solution objective value Because cuts are often based on in-
594 feasibility, models with soft constraints that are always feasible can present unique challenges
595 for deriving cuts. However, while any solution is feasible, the incumbent solution objective
596 value allows the practitioner to derive cuts based on the implicit, dynamic constraint defined
597 by the objective function and the incumbent objective value.

598 • (iv) Disjunctions Wolsey (1998) provides a description of deriving cuts from disjunctions,

599 which were first developed by Balas (1998). In general, suppose X1 = x : aT x ≥ b and X2 =
n o
600 x : âT x ≥ b̂ . Let u be the componentwise maximum of a and â, i.e., uj = max {aj , âj }.
n o
601 And, let ū = min b, b̂ . Then
uT x ≥ ū (11)

602 is valid for X1 ∪ X2 , which implies it is also valid for the convex hull of X1 and X2 . These
603 properties of disjunctions can be used to generate cuts in practice.

604 • (v) The exploitation of infeasibility As previously mentioned, cover, clique and other
605 cuts can be viewed as implicitly using infeasibility to identify cuts to tighten a formulation
606 of (PM IP ). Generally, for any linear expression involving integer variables with integer coef-
607 ficients and an integer right hand side b, if aT x ≤ b can be shown to be infeasible, then the
608 constraint aT x ≥ b + 1 provides a valid cut.

24
609 We now consider a simple example to illustrate the use of disjunctions to derive cuts. Most
610 state-of-the-art optimizers support mixed integer rounding cuts, both on constraints explicitly in
611 the model, and as Gomory cuts based on implicit constraints derived from the simplex tableau rows
612 of the node LP subproblems. So, practitioners typically do not need to apply disjunctions to derive
613 cuts on constraints like the one in the example we describe below. However, we use this simple
614 example to aid in the understanding of the more challenging example we present subsequently. In
615 the first instance, we illustrate the derivation of a mixed integer rounding cut on the constraint:

4x1 + 3x2 + 5x3 = 10 (12)

616

x1 , x2 , x3 ≥ 0, integer (13)

617 Dividing by the coefficient of x1 , we have

3 5 5
x1 + x2 + x3 = (14)
4 4 2

618 Now, we separate the left and right hand sides into integer and fractional components, and let x̂
619 represent the integer part of the left hand side:

1 1 1 1
x + x + x3 − x2 + x3 = 2 + = 3 − (15)
| 1 {z2 } 4 4 2 2
x̂

620 We examine a disjunction on the integer expression x̂. If x̂ ≤ 2, the terms with fractional coefficients
621 on the left hand side of (15) must be greater than or equal to the first fractional term in the right-
622 hand-side expressions. Similarly, the terms with fractional coefficients on the left hand side must
623 be less than or equal to the second fractional term in the right-hand-side expressions if x̂ ≥ 3.
624 Using the nonnegativity of the x variables to simplify the constraints implied by the disjunction,
625 we conclude:

−1 1 1
x̂ ≤ 2 ⇒ x2 + x3 ≥ ⇒ x3 ≥ 2 (16)
4 4 2
626

−1 1 −1
x̂ ≥ 3 ⇒ x2 + x3 ≤ ⇒ x2 ≥ 2 (17)
4 4 2

25
627 So, either x3 ≥ 2 or x2 ≥ 2. We can then use the result of (11) to derive the cut

x2 + x3 ≥ 2 (18)

628 Note that this eliminates the fractional solution (2, 31 , 15 ), which satisfies the original constraint,
629 (12). Note also that by inspection the only two possible integer solutions to this constraint are
630 (1, 2, 0) and (0, 0, 2). Both satisfy (18), establishing that the cut is valid. (Dividing (12) by the
631 coefficient on x2 or x3 instead of x1 results in a similar mixed integer rounding cut.)
632 This small example serves to illustrate the derivation of a mixed integer rounding cut on a
633 small constraint; state-of-the-art optimizers such as CPLEX would have been able to identify this
634 cut. However, disjunctions are more general, and can yield performance-improving cuts on models
635 for which the optimizer’s cuts do not yield sufficiently good performance. For example, consider
636 the following single-constraint knapsack model. Cornuejols et al. (1997) originally generated this
637 instance. (See Aardal and Lenstra (2004) for additional information on these types of models.) We
638 wish to either find a feasible solution or prove infeasibility for the single-constraint integer program:

13429x1 + 26850x2 + 26855x3 + 40280x4 + 40281x5 + 53711x6 + 53714x7 + 67141x8 = 45094583

639

xj ≥ 0, integer, j = 1, . . . , 8

640 Running CPLEX 12.2.0.2 with default settings results in no conclusion after over 7 hours and
641 2 billion nodes, as illustrated in Node Log #7:

642

643 Node Log #7

644 Nodes Cuts/

645 Node Left Objective IInf Best Integer Best Node ItCnt Gap
646 ...
647 2054970910 13066 0.0000 1 0.0000 25234328
648 Elapsed real time = 27702.98 sec. (tree size = 2.70 MB, solutions = 0)
649 2067491472 14446 0.0000 1 0.0000 25388082
650 2080023238 12892 0.0000 1 0.0000 25542160
651 2092548561 15366 0.0000 1 0.0000 25696280
652 ...
653 -------
654 Total (root+branch&cut) = 28302.29 sec.

26
655

656

657 MIP - Node limit exceeded, no integer solution.

658 Current MIP best bound = 0.0000000000e+00 (gap is infinite)
659 Solution time = 28302.31 sec. Iterations = 25787898 Nodes = 2100000004 (16642)

660

661 However, note that all the coefficients in the model are very close to integer multiples of the
662 coefficient of x1 . Therefore, we can separate the left hand side into the part that is an integer
663 multiple of this coefficient, and the much smaller remainder terms:

13429 (x1 + 2x2 + 2x3 + 3x4 + 3x5 + 4x6 + 4x7 + 5x8 ) (19)
| {z }
x̂
−8x2 − 3x3 − 7x4 − 6x5 − 5x6 − 2x7 − 4x8 (20)

= 3358 ∗ 13429 + 1 = 3359 ∗ 13429 − 13428 (21)

664 This constraint resembles the one from which we previously derived the mixed integer rounding
665 cut. But, instead of separating the integer and fractional components, we separate the components
666 that are exact multiples of the coefficient of x1 from the remaining terms. We now perform the
667 disjunction on x̂ in an analogous manner, again using the nonnegativity of the variables.

x̂ ≤ 3358 ⇒ −8x2 − 3x3 − 7x4 − 6x5 − 5x6 − 2x7 − 4x8 ≥ 1 (22)

| {z }
≤0

668 Thus, if x̂ ≤ 3358, the model is infeasible. Therefore, infeasibility implies that x̂ ≥ 3359 is a
669 valid cut. We can derive an additional cut from the other side of the disjunction on x̂:

x̂ ≥ 3359 ⇒ −8x2 − 3x3 − 7x4 − 6x5 − 5x6 − 2x7 − 4x8 ≤ −13428 (23)

670 This analysis shows that constraints (24) (using the infeasibility argument above) and (25)
671 (multiplying (23) through by -1) are globally valid cuts.

x1 + 2x2 + 2x3 + 3x4 + 3x5 + 4x6 + 4x7 + 5x8 ≥ 3359 (24)

8x2 + 3x3 + 7x4 + 6x5 + 5x6 + 2x7 + 4x8 ≥ 13428 (25)

672 Adding these cuts enables CPLEX 12.2.0.2 to easily identify that the model is infeasible (see Node

27
673 Log #8). Summarizing this example, concepts (iv) and (v), the use of disjunctions and exploiting
674 infeasibility, helped generate cuts that turned a challenging MIP into one that was easily solved.

675

676 Node Log #8

677

678 Nodes Cuts/

679 Node Left Objective IInf Best Integer Best Node ItCnt Gap
680

681 0 0 0.0000 1 0.0000 1

682 0 0 0.0000 2 MIRcuts: 1 3
683 0 0 0.0000 2 MIRcuts: 1 5
684 0 0 cutoff 5
685 Elapsed real time = 0.23 sec. (tree size = 0.00 MB, solutions = 0)
686 Mixed integer rounding cuts applied: 1
687 ...
688 MIP - Integer infeasible.
689 Current MIP best bound is infinite.
690 Solution time = 0.46 sec. Iterations = 5 Nodes = 0

691

692 The second practical example we consider is a rather large maximization problem, and illustrates
693 concepts (ii) and (v): the optimization of one or more related models and the exploitation of
694 infeasibility, respectively. The example involves a collection of n objects with some measure of
695 distance between them. The model selects k < n of the objects in a way that maximizes the sum
696 of the distances between the selected object, i.e., the k most diverse objects are selected. The most
697 direct model formulation involves binary variables and a quadratic objective. Let dij ≥ 0 be the
698 known distance between object i and object j, and let xi be a binary variable that is 1 if object i
699 is selected, and 0 otherwise. The formulation follows:

n X
X n
(M IQP ) max dij xi xj
i=1 j=i+1
700
n
X
subject to xj ≤ k
j=1

28
701
xj binary

702 Because this article focuses on linear and linear-integer models, we consider an equivalent linear
703 formulation that recognizes that the product of binary variables is itself a binary variable (Watters,
704 1967). We replace each product of binaries xi xj in (M IQP ) with a binary variable zij , and add
705 constraints to express the relationship between x and z in a mixed integer linear program (MILP):

n X
X n
(M ILP ) max dij zij (26)
j=1 i=1
i<j
n
X
subject to xj ≤ k (27)
j=1
zij ≤ xi ∀ i, j (28)

zij ≤ xj ∀ i, j (29)

xi + xj ≤ 1 + zij ∀ i, j (30)

xj , zij binary ∀ i, j (31)

706 The constraints (28), (29) and (30) exist for indices (i, j), i < j because the selection of both i and
707 j is equivalent to the selection of both j and i. Hence, the model only defines zij variables with
708 i < j. Note that if xi or xj = 0, then constraints (28) and (29) force zij to 0, while (30) imposes
709 no restriction on zij . Similarly, if both xi and xj = 1, (28) and (29) impose no restriction on zij ,
710 while (30) forces zij to 1. So, regardless of the values of xi and xj , zij = xi xj , and we can replace
711 occurrences of xi xj with zij to obtain the linearized reformulation above.
712 This linearized model instance with n = 60 and k = 24 possesses 1830 binary variables, and 5311
713 constraints. Due to the large branch-and-bound tree resulting from this instance, we set CPLEX’s
714 file parameter to instruct CPLEX to efficiently swap the memory associated with the branch-and-
715 bound tree to disk. This enables the run to proceed further than with default settings in which
716 CPLEX stores the tree in physical memory. All other parameter settings remain at defaults, so
717 CPLEX makes use of all four available processors. CPLEX runs for just over four hours (see Node
718 Log #9), terminating when the size of the swap file for the branch-and-bound tree exceeds memory
719 limits, i.e., at the point at which CPLEX has processed over 4 million nodes and the solution has
720 an objective value of 3483.0000, proven to be within 51.32% of optimal. This level of performance
721 indicates significant potential for improvement. Although we do not provide the output here, the
722 original MIQP formulation in (M IQP ) performs even worse.

723

29
724 Node Log #9

725 Nodes Cuts/

726 Node Left Objective IInf Best Integer Best Node ItCnt Gap
727

728 * 0+ 0 0.0000 2247 ---

729 0 0 7640.4000 1830 0.0000 7640.4000 2247 ---
730 * 0+ 0 19.0000 7640.4000 2247 ---
731

732 ...
733

734 * 0+ 0 3185.0000 7445.4286 2286 133.77%

735 0 2 7628.5333 1829 3185.0000 7445.4286 2286 133.77%
736 Elapsed real time = 4.09 sec. (tree size = 0.01 MB, solutions = 8)
737 35 37 6579.2308 1378 3185.0000 7445.4286 6615 133.77%
738 ...
739 4332613 3675298 4936.6750 1099 3483.0000 5270.8377 1.78e+08 51.33%
740 4341075 3682375 3889.4643 714 3483.0000 5270.4545 1.79e+08 51.32%
741

742 ...
743 CPLEX Error 1803: Failure on temporary file write.
744

745 Solution pool: 25 solutions saved.

746

747 MIP - Error termination, no tree: Objective = 3.4830000000e+03

748 Current MIP best bound = 5.2704102564e+03 (gap = 1787.41, 51.32%)
749 Solution time = 15031.18 sec. Iterations = 178699476 Nodes = 4342299 (3682262)

750

751 Experimentation with non-default parameter settings as described in Section 3 yields modest
752 performance improvements, but does not come close to enabling CPLEX to find an optimal solution
753 to the model.
754 We carefully examine a smaller model instance with n = 3 and k = 2 to assess how removing
755 integrality restrictions yields an artificially high objective function value:

30
max 3z12 + 4z13 + 5z23

subject to x 1 + x 2 + x3 ≤ 2

z12 − x1 ≤ 0

z12 − x2 ≤ 0

x1 + x2 ≤ 1 + z12

z13 − x1 ≤ 0

z13 − x3 ≤ 0

x1 + x3 ≤ 1 + z13

z23 − x2 ≤ 0

z23 − x3 ≤ 0

x2 + x3 ≤ 1 + z23

x1 , x2 , x3 , z12 , z13 , z23 binary

756 The optimal solution to this MILP consists of setting z23 = x2 = x3 = 1, yielding an objective
757 value of 5. By contrast, relaxing integrality enables a fractional solution consisting of setting all
758 x and z variables to 2/3, yielding a much better objective value of 8. Note that the difference
759 between the MILP and its relaxation occurs when the zij variables assume values strictly less than
760 1. When any zij = 1, the corresponding xi and xj variables are forced to 1 by constraints (28)
761 and (29) for both the MILP and its LP relaxation. By contrast, when 0 ≤ zij < 1, xi or xj must
762 assume a value of 0 in the MILP, but not in the relaxation. Thus, in the LP relaxation, we can set
763 more of the z variables to positive values than in the MILP. This raises the question of how many
764 z variables we can set to 1 in the MILP. In the optimal solution, only z23 assumes a value of 1.
765 So, can we set two of the z variables to 1 and find a feasible solution to the MILP? To answer this
766 question, pick any two z variables and set them to 1. Since each z variable is involved in similar
767 types of constraints, without loss of generality, we set z12 and z13 to 1. From the constraints:

31
z12 − x1 ≤ 0

z12 − x2 ≤ 0

z13 − x1 ≤ 0

z13 − x3 ≤ 0

768 we see that x1 , x2 , and x3 must all be set to 1. But this violates the constraint that the x variables
769 can sum to at most 2. For any of the other two distinct pairs of z variables in this smaller model,
770 all three x variables are forced to a value of 1 since for the MILP:

zij > 0 ⇐⇒ xi = xj = 1 (32)

771 Thus, any distinct pair of z variables set to 1 forces three x variables to 1, violating the constraint
772 that x1 + x2 + x3 ≤ 2. Hence, in any integer feasible solution, at most one z variable can be set to
773 1. This implies that the constraint:

z12 + z13 + z23 ≤ 1

774 is a globally valid cut. And, we can see that it cuts off the optimal solution of the LP relaxation
775 consisting of setting each z variable to 2/3.
776 We now generalize this to (M ILP ), in which the x variables can sum to at most k. We
777 wish to determine the number of z variables we can set to 1 in (M ILP ) without forcing the
778 sum of the x variables to exceed k. Suppose we set k of the x variables to 1. Since (32) holds
779 for all pairs of x variables, without loss of generality, consider an integer feasible solution with
780 x1 = x2 = · · · = xk = 1, and xk+1 = · · · = xn = 0. From (32), zij = 1 if and only if 1 ≤ i ≤ k,
781 1 ≤ j ≤ k, and i < j. We can therefore count the number of z variables that equal 1 when
782 x1 = x2 = · · · = xk = 1. Specifically, there are k(k − 1) pairs (i, j) with i 6= j, but only half of them
783 have i < j. So, at most k(k − 1)/2 of the zij variables can be set to 1 when k of the x variables are
784 set to 1. In other words,

n X
X n
zij ≤ k(k − 1)/2
i=1 j=i+1

785 is a globally valid cut.

32
786 Adding this cut to the instance with n = 60 and k = 24 enables CPLEX to solve the model to
787 optimality in just over 2 hours and 30 minutes on the same machine using settings identical to those
788 from the previous run without the cut. (See Node Log #10.) Note that the cut tightened the
789 formulation significantly, as can be seen by the much better root node objective value of 4552.4000,
790 which compares favorably to the root node objective value of 7640.4000 on the instance without
791 the cut. Furthermore, the cut enabled CPLEX to add numerous zero-half cuts to the model that
792 it could not with the original formulation. The zero-half cuts resulted in additional progress in the
793 best node value that was essential to solving the model to optimality in a reasonable amount of
794 time.

795

796 Node Log #10

Nodes Cuts/
Node Left Objective IInf Best Integer Best Node ItCnt Gap

* 0+ 0 0.0000 1161 ---

0 0 4552.4000 750 0.0000 4552.4000 1161 ---
* 0+ 0 6.0000 4552.4000 1161 ---
...
* 0+ 0 3477.0000 3924.7459 37882 12.88%
0 2 3924.7459 1281 3477.0000 3924.7459 37882 12.88%
Elapsed real time = 51.42 sec. (tree size = 0.01 MB, solutions = 31)
1 3 3919.3378 1212 3477.0000 3924.7459 39886 12.88%
2 4 3910.8201 1243 3477.0000 3924.7459 42289 12.88%
3 5 3910.8041 1144 3477.0000 3919.3355 44070 12.72%
...
125571 7819 cutoff 3590.0000 3599.7046 60456851 0.27%
Elapsed real time = 9149.19 sec. (tree size = 234.98 MB, solutions = 43)
Nodefile size = 196.38 MB (168.88 MB after compression)
*126172 7231 integral 0 3591.0000 3599.7046 60571398 0.24%
127700 5225 cutoff 3591.0000 3598.0159 60769494 0.20%
131688 6 cutoff 3591.0000 3592.5939 60980430 0.04%

Zero-half cuts applied: 2244

33
Solution pool: 44 solutions saved.

MIP - Integer optimal solution: Objective = 3.5910000000e+03

Solution time = 9213.79 sec. Iterations = 60980442 Nodes = 131695

797

798 Given the modest size of the model, a run time of 2.5 hours to optimality suggests potential
799 for additional improvements in the formulation. However, by adding one globally valid cut, we see
800 a dramatic performance improvement nonetheless. Furthermore, the derivation of this cut draws
801 heavily on the guidelines proposed for tightening the formulation. By using a small instance of
802 the model, we can easily identify how removal of integrality restrictions enables the objective to
803 improve. Furthermore, we use infeasibility to derive the cut: by recognizing that the simplified
804 MILP model is infeasible when z12 + z13 + z23 ≥ 2, we show that z12 + z13 + z23 ≤ 1 is a valid cut.

805 5 Conclusion

806 Today’s hardware and software allow practitioners to formulate and solve increasingly large and
807 detailed models. However, optimizers have become less straightforward, often providing many
808 methods for implementing their algorithms to enhance performance given various mathematical
809 structures. Additionally, the literature regarding methods to increase the tractability of mixed
810 integer linear programming problems contains a high degree of theoretical sophistication. Both of
811 these facts might lead a practitioner to conclude that developing the skills necessary to successfully
812 solve difficult mixed integer programs is too time consuming or difficult. This paper attempts to
813 refute that perception, illustrating that practitioners can implement many techniques for improving
814 performance without expert knowledge in the underlying theory of integer programming, thereby
815 enabling them to solve larger and more detailed models with existing technology.

816 Acknowledgements

817 Dr. Klotz wishes to acknowledge all of the CPLEX practitioners over the years, many of whom
818 have provided the wide variety of models that revealed the guidelines described in this paper. He
819 also wishes to thank the past and present CPLEX development, support, and sales and marketing
820 teams who have contributed to the evolution of the product. Professor Newman wishes to thank
821 former doctoral students Chris Cullenbine, Brian Lambert, Kris Pruitt, and Jennifer Van Dinter at

34
822 the Colorado School of Mines for their helpful comments; she also wishes to thank her colleagues
823 Jennifer Rausch (Jeppeson, Englewood, Colorado) and Professor Josef Kallrath (BASF-AG, Lud-
824 wigshafen, Germany) for helpful comments on an earlier draft. Both authors thank an anonymous
825 referee for his helpful comments that improved the paper.
826 Both authors wish to remember Lloyd Clarke (February 14, 1964-September 20, 2007). His
827 departure from the CPLEX team had consequences that extended beyond the loss of an important
828 employee and colleague.

35
829 References
830 Aardal, K. and Lenstra, A., 2004. “Hard equality constrained integer knapsacks.” Mathematics of
831 Operations Research, 3(29): 724–738.

832 Achterberg, T., 2007. Constraint Integer Programming (Ph.D. Dissertation), Technical University
833 Berlin, Berlin.

834 Balas, E., 1965. “An additive algorithm for solving linear programs with zero-one variables.” Op-
835 erations Research, 13(4): 517–546.

836 Balas, E., 1998. “Disjunctive programming: Properties of the convex hull of feasible points.” Dis-
837 crete Applied Mathematics Tech. Report MSRR 348, Carnegie Mellon University, 89(1-3): 3–44.

838 Bertsimas, D. and Stock Patterson, S., 1998. “The air traffic flow management problem with enroute
839 capacities.” Operations Research, 46(3): 406–422.

840 Bixby, R. and Rothberg, E., 2007. “Progress in computational mixed integer programming – a look
841 back from the other side of the tipping point.” Annals of Operations Research, 149(1): 37–41.

842 Camm, J., Raturi, A. and Tadisina, S., 1990. “Cutting big M down to size.” Interfaces, 20(5):
843 61–66.

844 Cornuejols, G., Urbaniak, R., Weismantel, R. and Wolsey, L., 1997. “Decomposition of integer
845 programs and of generating sets.” Lecture Notes in Computer Science; Proceedings of the 5th
846 Annual European Symposium on Algorithms, 1284: 92–102.

847 Danna, E., 2008. “Performance variability in mixed integer programming.” Presentation at MIP
848 2008 Workshop, Columbia University.

849 FICO, 2012. Xpress-MP Optimization Suite, Minneapolis, MN.

850 Grötschel, ed., 2004. The Sharpest Cut: The Impact of Manfred Padberg and His Work, MPS-SIAM
851 Series on Optimization.

852 Gurobi, 2012. Gurobi Optimizer, Houston, TX.

853 IBM, 2012. ILOG CPLEX, Incline Village, NV.

854 Klotz, E. and Newman, A., To appear. “Practical guidelines for solving difficult linear programs.”
855 Surveys in Operations Research and Management Science, doi:10.1016/j.sorms.2012.11.001.

856 Koch, T., Achterberg, T., Andersen, E., Bastert, O., Berthold, T., Bixby, R., Danna, E., Gamrath,
857 G., Gleixner, A., Heinz, S., Lodi, A., Mittelmann, H., Ralphs, T., Salvagnin, D., Steffy, D. and
858 Wolter, K., 2011. “MIPLIB 2010.” Mathematical Programming Computation, 3(2): 103–163.

859 Lambert, W., Brickey, A., Newman, A. and Eurek, K., to appear. “Open pit block sequencing
860 formulations: A tutorial.” Interfaces.

861 MOPS, 2012. MOPS, Paderborn, Germany.

862 MOSEK, 2012. MOSEK Optimization Software, Copenhagen, Denmark.

863 Rardin, R., 1998. Optimization in Operations Research, Prentice Hall, chap. 6.

36
864 Rebennack, S., Reinelt, G. and Pardalos, P., 2012. “A tutorial on branch and cut algorithms for
865 the maximum stable set problem.” International Transactions in Operational Research, 19(1-2):
866 161–199.

867 Savelsbergh, M., 1994. “Preprocessing and probing techniques for mixed integer programming prob-
868 lems.” INFORMS Journal on Computing, 6(4): 445–454.

869 Watters, L., 1967. “Reduction of integer polynomial programming to zero-one linear programming
870 problems.” Operations Research, 15(6): 1171–1174.

871 Winston, W., 2004. Operations Research: Applications and Algorithms, Brooks/Cole, Thompson
872 Learning.

873 Wolsey, L., 1998. Integer Programming, Wiley.

(Textbooks in Mathematics) Jeffrey Paul Wheeler - An Introduction To Optimization. With Applications in Machine Learning and Data Analytics-CRC Press (2024)
No ratings yet
(Textbooks in Mathematics) Jeffrey Paul Wheeler - An Introduction To Optimization. With Applications in Machine Learning and Data Analytics-CRC Press (2024)
475 pages
2010 TRANSMISSION Automatic Transmission (RE0F08B) - Cube
100% (2)
2010 TRANSMISSION Automatic Transmission (RE0F08B) - Cube
233 pages
Ideas Rule The World - Sam Adeyemi
92% (13)
Ideas Rule The World - Sam Adeyemi
60 pages
Alexander Schrijver - Theory of Linear and Integer Programming PDF
100% (2)
Alexander Schrijver - Theory of Linear and Integer Programming PDF
484 pages
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
From Everand
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
Héctor Jorquera González
5/5 (1)
Ampl Tutorial
No ratings yet
Ampl Tutorial
19 pages
Atamtürk and Savelsbergh - 2005 - Integer-Programming Software Systems
No ratings yet
Atamtürk and Savelsbergh - 2005 - Integer-Programming Software Systems
58 pages
IP Lecture Notes
No ratings yet
IP Lecture Notes
95 pages
Powell Modern Approach To Teaching Optimization Submitted November 9 2023
No ratings yet
Powell Modern Approach To Teaching Optimization Submitted November 9 2023
93 pages
Optimization For Engineering Design
100% (1)
Optimization For Engineering Design
438 pages
Lect ModConvOpt
No ratings yet
Lect ModConvOpt
590 pages
C. B. Gupta - Optimization Techniques in Operation Research-I.K. International (2020)
No ratings yet
C. B. Gupta - Optimization Techniques in Operation Research-I.K. International (2020)
381 pages
OptCourseNotes18 19
No ratings yet
OptCourseNotes18 19
51 pages
Linear and Nonlinear Programming Essentials
From Everand
Linear and Nonlinear Programming Essentials
Tanushri Kaniyar
No ratings yet
Mdobook
No ratings yet
Mdobook
642 pages
Powell MATO Final Jan 23 2024
No ratings yet
Powell MATO Final Jan 23 2024
129 pages
Surveys in Operations Research and Management Science. Vol. 18. No 1-2
No ratings yet
Surveys in Operations Research and Management Science. Vol. 18. No 1-2
53 pages
Mdobook-J R R A MARTINS
No ratings yet
Mdobook-J R R A MARTINS
450 pages
Ben-Tal A., Nemirovski A. - Lectures on modern convex optimization-Technion, Georgia Institute of Technology (2023)
No ratings yet
Ben-Tal A., Nemirovski A. - Lectures on modern convex optimization-Technion, Georgia Institute of Technology (2023)
732 pages
Progress in Linear Programming-Based Algorithms For Integer Programming - Johnson Et Al.
No ratings yet
Progress in Linear Programming-Based Algorithms For Integer Programming - Johnson Et Al.
22 pages
Wolsey IntegerProgramming
No ratings yet
Wolsey IntegerProgramming
20 pages
Discrete Optimization: National Engineering School of Tunis
No ratings yet
Discrete Optimization: National Engineering School of Tunis
51 pages
Practical Optimization Methods With Mathematica Applications PDF
100% (1)
Practical Optimization Methods With Mathematica Applications PDF
729 pages
Mixed Integer Programming: Models and Methods - Nicolai Pisaruk
No ratings yet
Mixed Integer Programming: Models and Methods - Nicolai Pisaruk
261 pages
1. From linear to conic programming
No ratings yet
1. From linear to conic programming
37 pages
OQM Lecture Note - Part 1 Introduction To Mathematical Optimisation
No ratings yet
OQM Lecture Note - Part 1 Introduction To Mathematical Optimisation
10 pages
Liberti or Modelling SW Slides
No ratings yet
Liberti or Modelling SW Slides
158 pages
Lecture Notes On Operations Research 3OR
No ratings yet
Lecture Notes On Operations Research 3OR
51 pages
Lecture Notes (7) : Eme7102 Engineering Research Methodology
No ratings yet
Lecture Notes (7) : Eme7102 Engineering Research Methodology
32 pages
Ancău, Mircea - Practical Optimization With MATLAB (2019, Cambridge Scholars Publishing)
No ratings yet
Ancău, Mircea - Practical Optimization With MATLAB (2019, Cambridge Scholars Publishing)
292 pages
Programare Liniara
No ratings yet
Programare Liniara
20 pages
Learning Optimal Objective Values For MILP.18321v1
No ratings yet
Learning Optimal Objective Values For MILP.18321v1
12 pages
Second Edition: Optimization For Engineering Design
No ratings yet
Second Edition: Optimization For Engineering Design
16 pages
Math 404 - W01 - Intro
No ratings yet
Math 404 - W01 - Intro
28 pages
Math Prog
100% (1)
Math Prog
122 pages
Chapter 6 - Integer Programming (Part 1)
No ratings yet
Chapter 6 - Integer Programming (Part 1)
22 pages
MINLP
No ratings yet
MINLP
123 pages
Accelerating Benders Decomposition Algorithmic Enh PDF
No ratings yet
Accelerating Benders Decomposition Algorithmic Enh PDF
41 pages
Milp Base Paper
No ratings yet
Milp Base Paper
4 pages
Simplex Algorithm
No ratings yet
Simplex Algorithm
8 pages
Handbooks in Operations Research and Management Science - Vol 12 Discrete Optimization - (Elsevier) - 2005
No ratings yet
Handbooks in Operations Research and Management Science - Vol 12 Discrete Optimization - (Elsevier) - 2005
606 pages
Lecture 9
No ratings yet
Lecture 9
11 pages
Optimization Algorithms and Hierarchical Convergence
From Everand
Optimization Algorithms and Hierarchical Convergence
Pasquale De Marco
No ratings yet
Scheduling Optimisation of Chemical Process Plant
No ratings yet
Scheduling Optimisation of Chemical Process Plant
223 pages
MB0048 Set 1 & 2
No ratings yet
MB0048 Set 1 & 2
14 pages
Download Optimizations and Programming: Linear, Non-linear, Dynamic, Stochastic and Applications with Matlab Abdelkhalak El Hami ebook All Chapters PDF
100% (4)
Download Optimizations and Programming: Linear, Non-linear, Dynamic, Stochastic and Applications with Matlab Abdelkhalak El Hami ebook All Chapters PDF
66 pages
Chapter 8 Using Excel Solver Solve Mathematical Model Modul Dss
No ratings yet
Chapter 8 Using Excel Solver Solve Mathematical Model Modul Dss
45 pages
50 Years of Integer Programming 1958-2008 PDF
100% (3)
50 Years of Integer Programming 1958-2008 PDF
811 pages
MOSEK
No ratings yet
MOSEK
399 pages
Parallel Computing Optimization
No ratings yet
Parallel Computing Optimization
16 pages
Solving Optimization Problems Using The Matlab Opt
No ratings yet
Solving Optimization Problems Using The Matlab Opt
50 pages
Tristan Gally, Marc Pfetsch Et Stefan Ulbrich - A Framework for Solving Mixed-Integer Semidefinite Programs (2017)
No ratings yet
Tristan Gally, Marc Pfetsch Et Stefan Ulbrich - A Framework for Solving Mixed-Integer Semidefinite Programs (2017)
40 pages
2024 03 07 Lecture Notes
No ratings yet
2024 03 07 Lecture Notes
43 pages
Seeding Methodsfor Solving General MINLP
No ratings yet
Seeding Methodsfor Solving General MINLP
42 pages
Linear Programming Using Matlab: Nikolaos Ploskas & Nikolaos Samaras
No ratings yet
Linear Programming Using Matlab: Nikolaos Ploskas & Nikolaos Samaras
15 pages
Analysis of Practical Applications Using The Linea
No ratings yet
Analysis of Practical Applications Using The Linea
9 pages
Optimizatio With Matlab
No ratings yet
Optimizatio With Matlab
49 pages
Mastering Dynamic Programming in Python
From Everand
Mastering Dynamic Programming in Python
Ed A Norex
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Advanced Guide to Dynamic Programming in Python: Techniques and Applications
From Everand
Advanced Guide to Dynamic Programming in Python: Techniques and Applications
Adam Jones
No ratings yet
Elements of Statistical Learning
From Everand
Elements of Statistical Learning
Swarnalata Verma
No ratings yet
A Decision Making Tool For The Determination of The Distribution Center Location in Humanitarian Logistics Network
No ratings yet
A Decision Making Tool For The Determination of The Distribution Center Location in Humanitarian Logistics Network
17 pages
Fuzzy SA For Project Time Cost Trade Off - Milenkovic
No ratings yet
Fuzzy SA For Project Time Cost Trade Off - Milenkovic
14 pages
Impact of Travel Time Uncertainties On The Solution Cost of A Two Echelon Vehicle Routing Problem With Sinchronization
No ratings yet
Impact of Travel Time Uncertainties On The Solution Cost of A Two Echelon Vehicle Routing Problem With Sinchronization
23 pages
Inland Intermodal Terminals and Freight Logistics Hubs PDF
No ratings yet
Inland Intermodal Terminals and Freight Logistics Hubs PDF
29 pages
Optimization of The Rail Freight Car Fleet Sizing Problem
No ratings yet
Optimization of The Rail Freight Car Fleet Sizing Problem
7 pages
A Survey of Optimization Models For Train Routing and Scheduling PDF
No ratings yet
A Survey of Optimization Models For Train Routing and Scheduling PDF
25 pages
Integrated Operations Planning and Revenue Management For Rail Freight Transportation PDF
No ratings yet
Integrated Operations Planning and Revenue Management For Rail Freight Transportation PDF
20 pages
Basics in Finance 9 Questions Homework WarmUps Solutions - PITANJ ZA KOLOKVIJUM ISPIT
No ratings yet
Basics in Finance 9 Questions Homework WarmUps Solutions - PITANJ ZA KOLOKVIJUM ISPIT
13 pages
2013 IEEE ITSC A Hierarchical MPC For Optimizing Intermodal Container Terminal Operations JLNabais PDF
No ratings yet
2013 IEEE ITSC A Hierarchical MPC For Optimizing Intermodal Container Terminal Operations JLNabais PDF
7 pages
Bining Neural Network Model With Seasonal Time Series ARIMA Model
No ratings yet
Bining Neural Network Model With Seasonal Time Series ARIMA Model
17 pages
Analytic Model Driven Organizational Design and Experimentation in Adaptive Command and Control
No ratings yet
Analytic Model Driven Organizational Design and Experimentation in Adaptive Command and Control
12 pages
Parallel Simulated Annealing Algorithms: N Different Nodes of The Network. The Two Algorithms Have
No ratings yet
Parallel Simulated Annealing Algorithms: N Different Nodes of The Network. The Two Algorithms Have
6 pages
Chacarera Del Rancho - Flauta
No ratings yet
Chacarera Del Rancho - Flauta
1 page
SEM 5 - Comps, IOT, CYBER, CS - Software Engineering - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
No ratings yet
SEM 5 - Comps, IOT, CYBER, CS - Software Engineering - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
4 pages
DC Traction Power Supply: Powerful, Efficient and Safe
No ratings yet
DC Traction Power Supply: Powerful, Efficient and Safe
12 pages
Đề Listening & Reading Vol 9
No ratings yet
Đề Listening & Reading Vol 9
44 pages
Manual CableOS Support - TAC - PP - v14
No ratings yet
Manual CableOS Support - TAC - PP - v14
20 pages
Sword and Pyramid Guide1
No ratings yet
Sword and Pyramid Guide1
9 pages
Sagar Samvad Book-2
No ratings yet
Sagar Samvad Book-2
105 pages
Innocence Taken Damien Kaine 1 1st Edition Victoria M Patton all chapter instant download
100% (2)
Innocence Taken Damien Kaine 1 1st Edition Victoria M Patton all chapter instant download
47 pages
Diffusion and Osmosis Worksheet
100% (2)
Diffusion and Osmosis Worksheet
5 pages
How To Build 60W Bass Amplifier (Circuit Diagram)
No ratings yet
How To Build 60W Bass Amplifier (Circuit Diagram)
9 pages
Health Safety Health: Key Performance Indicator Month of October
100% (2)
Health Safety Health: Key Performance Indicator Month of October
1 page
Avr 700
No ratings yet
Avr 700
60 pages
Sample Synopsis
No ratings yet
Sample Synopsis
10 pages
Android:An Open Handset Alliance Project
No ratings yet
Android:An Open Handset Alliance Project
23 pages
MF2507I002310474-4
No ratings yet
MF2507I002310474-4
8 pages
My Ielts
No ratings yet
My Ielts
2 pages
Cyber Laws in Pakistan
No ratings yet
Cyber Laws in Pakistan
28 pages
Eddf Sid 07 01
No ratings yet
Eddf Sid 07 01
8 pages
GME
100% (2)
GME
4 pages
RSL Guitar Syllabus Guide 2018 DIGITAL 31may2019 PDF
No ratings yet
RSL Guitar Syllabus Guide 2018 DIGITAL 31may2019 PDF
87 pages
Coffee Shop Business Plan
67% (3)
Coffee Shop Business Plan
35 pages
Minilec Catalouge PDF
No ratings yet
Minilec Catalouge PDF
64 pages
Depression Journal PDF
No ratings yet
Depression Journal PDF
24 pages
GDS Online Engagement Application: Personal Details
No ratings yet
GDS Online Engagement Application: Personal Details
2 pages
First Language Acquisition-2023
No ratings yet
First Language Acquisition-2023
61 pages
Ias 40 - Investment Property - 2
No ratings yet
Ias 40 - Investment Property - 2
53 pages
Qatalum Corporate Profile
No ratings yet
Qatalum Corporate Profile
27 pages
Radiology Tech Skills Checklist
No ratings yet
Radiology Tech Skills Checklist
2 pages

Practical Guidelines For Solving Difficult Mixed Integer Linear Programs

Uploaded by

Practical Guidelines For Solving Difficult Mixed Integer Linear Programs

Uploaded by

1 Practical Guidelines for Solving Difficult Mixed Integer Linear

9 “Problems worthy of attack prove their worth by hitting back.”

11 “Everybody has a plan until he gets hit in the mouth.”

(PM IP ) : min cTC xC + cTI xI

Integer x3 = 0.6 ??? Fathomed

Figure 1: Branch-and-bound algorithm

217 3 Guidelines for Successful Algorithm Performance

222 • (iii) There is lack of progress in the best lower bound.

Nodes Cuts/ ItCnt

Nodes Cuts/ ItCnt

278 3.2 Lack of Progress in the Best Integer Solution

283 Node Log #3: Lack of Progress in Best Integer Solution

284 Nodes Cuts/ ItCnt Gap

317 Node Log #4: No initial practitioner-supplied solution

Root relaxation solution time = 131.45 sec.

0 0 1.09590e+07 2424 1.09590e+07 108111

320 Node Log #5: An initial solution supplied by the practitioner

Root relaxation solution time = 93.92 sec.

* 0+ 0 1.07197e+07 108111 ---

485 433 1.09075e+07 2398 1.07197e+07 1.08840e+07 244077 1.53%

339 3.3 Lack of Progress in the Best Bound

P1 2) ∃ x ∈ P2 : âTi x > b̂i (separation)

Figure 2: Convex hull

360 Nodes Cuts/ ItCnt Gap

363 300 296 2018.0000 27 3780.0000 560.0000 3703 85.19%

4x3 + 3x4 + 5x5 ≤ 10 (5)

3x2 + 4x5 ≤ 5 (7)

Cut name Mathematical description of cut Structure of original MILP

492 3.4 Data and Memory Problems

0 ≤ x ≤ 5000; z binary (10)

Characteristic Recognition Suggested tactic(s)

535 4 Tighter Formulations

4x1 + 3x2 + 5x3 = 10 (12)

617 Dividing by the coefficient of x1 , we have

13429x1 + 26850x2 + 26855x3 + 40280x4 + 40281x5 + 53711x6 + 53714x7 + 67141x8 = 45094583

643 Node Log #7

644 Nodes Cuts/

657 MIP - Node limit exceeded, no integer solution.

= 3358 ∗ 13429 + 1 = 3359 ∗ 13429 − 13428 (21)

x̂ ≤ 3358 ⇒ −8x2 − 3x3 − 7x4 − 6x5 − 5x6 − 2x7 − 4x8 ≥ 1 (22)

x1 + 2x2 + 2x3 + 3x4 + 3x5 + 4x6 + 4x7 + 5x8 ≥ 3359 (24)

8x2 + 3x3 + 7x4 + 6x5 + 5x6 + 2x7 + 4x8 ≥ 13428 (25)

676 Node Log #8

678 Nodes Cuts/

681 0 0 0.0000 1 0.0000 1

xj , zij binary ∀ i, j (31)

725 Nodes Cuts/

728 * 0+ 0 0.0000 2247 ---

734 * 0+ 0 3185.0000 7445.4286 2286 133.77%

745 Solution pool: 25 solutions saved.

747 MIP - Error termination, no tree: Objective = 3.4830000000e+03

x1 , x2 , x3 , z12 , z13 , z23 binary

zij > 0 ⇐⇒ xi = xj = 1 (32)

z12 + z13 + z23 ≤ 1

785 is a globally valid cut.

796 Node Log #10

* 0+ 0 0.0000 1161 ---

Zero-half cuts applied: 2244

MIP - Integer optimal solution: Objective = 3.5910000000e+03

849 FICO, 2012. Xpress-MP Optimization Suite, Minneapolis, MN.

852 Gurobi, 2012. Gurobi Optimizer, Houston, TX.

853 IBM, 2012. ILOG CPLEX, Incline Village, NV.

861 MOPS, 2012. MOPS, Paderborn, Germany.

862 MOSEK, 2012. MOSEK Optimization Software, Copenhagen, Denmark.

873 Wolsey, L., 1998. Integer Programming, Wiley.

You might also like