second-order cone programming algorithm
definition of second-order cone programming
a second-order cone programming problem has the form
subject to the constraints
f, x, b, beq, lb, and ub are vectors, and a and aeq are matrices. for each i, the matrix asc(i), the vectors bsc(i) and dsc(i), and the scalar γ(i) are in a second-order cone constraint that you create using .
in other words, the problem has a linear objective function and linear constraints, as well as a set of second-order cone constraints of the form .
coneprog
algorithm
the solver uses the algorithm described in andersen, roos, and terlaky [1]. this method is an interior-point algorithm similar to the interior-point linprog algorithm.
standard form
the algorithm starts by placing the problem in standard form. the algorithm adds nonnegative slack variables so that the problem has the form
subject to the constraints
the solver expands the sizes of the linear coefficient vector f and linear constraint matrix a to account for the slack variables.
the region k is the cross product of lorentz cones equation 1 and the nonnegative orthant. to convert each convex cone
to a lorentz cone equation 1, create a column vector of variables t1, t2, …, tn 1:
here, the number of variables n for each cone i is the number of rows in asc(i). by its definition, the variable vector t satisfies the inequality
(1) |
equation 1 is the definition of a lorentz cone in (n 1) variables. the variables t appear in the problem in place of the variables x in the convex region k.
internally, the algorithm also uses a rotated lorentz cone in the reformulation of cone constraints, but this topic does not address that case. for details, see andersen, roos, and terlaky [1].
when adding slack variables, the algorithm negates variables, as needed, and adds appropriate constants so that:
variables with only one bound have a lower bound of zero.
variables with two bounds have a lower bound of zero and, using a slack variable, have no upper bound.
variables without bounds are placed in a lorentz cone with a slack variable as the constrained variable. this slack variable is not part of any other expression, objective or constraint.
dual problem
the dual cone is
the dual problem is
such that
for some
a dual optimal solution is a point (y,s) that satisfies the dual constraints and maximizes the dual objective.
homogeneous self-dual formulation
to handle potentially infeasible or unbounded problems, the algorithm adds two more variables τ and κ and formulates the problem as homogeneous (equal to zero) and self-dual.
(2) |
along with the constraints
(3) |
here, is the cone k adjoined with the nonnegative real line, which is the space for (x;τ). similarly is the cone adjoined with the nonnegative real line, which is the space for (s;κ). in this formulation, the following lemma shows that τ is the scaling for feasible solutions, and κ is the indicator of an infeasible problem.
lemma ([1] lemma 2.1)
let (x, τ, y, s, κ) be a feasible solution of equation 2 along with the constraints in equation 3.
xts τκ = 0.
if τ > 0, then (x, y, s)/τ is a primal-dual optimal solution of the standard form second-order cone problem.
if κ > 0, then at least one of these strict inequalities holds:
bty > 0
ftx < 0.
if the first inequality holds, then the standard form, primal second-order cone problem is infeasible. if the second inequality holds, then the standard form, dual second-order cone problem is infeasible.
in summary, for feasible problems, the variable τ scales the solution between the original standard form problem and the homogeneous self-dual problem. for infeasible problems, the final iterate (x, y, s, τ, κ) provides a certificate of infeasibility for the original standard form problem.
start point
the start point for the iterations is the feasible point:
x = 1 for each nonnegative variable, 1 for the first variable in each lorentz cone, and 0 otherwise.
y = 0.
s = (1,0,…,0) for each cone, 1 for each nonnegative variable.
τ = 1.
κ = 1.
central path
the algorithm attempts to follow the central path, which is the parameterized solution to the following equations for γ decreasing from 1 toward 0.
(4) |
each variable with a 0 subscript indicates the start point of the variable.
the variables x and s are arrow head matrices formed from the x and s vectors, respectively. for a vector x = [x1,x2,…,xn], the arrow head matrix x has the definition
by its definition, x is symmetric.
the variable e is the vector with a 1 in each cone coordinate corresponding to the x1 lorentz cone coordinate.
the variable μ0 has the definition
where k is the number of nonzero elements in x0.
the central path begins at the start point and ends at an optimal solution to the homogeneous self-dual problem.
andersen, roos, and terlaky [1] show in lemma 3.1 that the complementarity condition xts = 0, where x and s are in a product of lorentz cones l, is equivalent to the condition
for every cone i. here xi = mat(xi), xi is the variable associated with the lorentz cone i, si = mat(si), and ei is the unit vector [1,0,0,…,0] of the appropriate dimension. this discussion shows that the central path satisfies the complementarity condition at its end point.
search direction
to obtain points near the central path as the parameter γ decreases from 1 toward 0, the algorithm uses newton's method. the variables to find are labeled (x, τ, y, s, κ). let dx represent the search direction for the x variables, and so on. then the newton step solves the following linear system, derived from equation 4.
the algorithm obtains its next point by taking a step in the d direction.
for some step .
for both numerical stability and accelerated convergence, the algorithm scales the step according to a suggestion in nesterov and todd [8]. also, the algorithm corrects the step according to a variant of mehrotra's predictor-corrector [7]. (for further details, see andersen, roos, and terlaky [1].)
step solver variations
the preceding discussion relates to the linearsolver
option
with the value 'augmented'
specified. the solver has other
values that change the step calculation to suit different types of
problems.
'auto'
(default) —coneprog
chooses the step solver:if the problem is sparse, the step solver is
'prodchol'
.otherwise, the step solver is
'augmented'
.
'normal'
— the solver uses a variant of the'augmented'
step that is suitable when the problem is sparse. see andersen, roos, and terlaky [1].'schur'
— the solver uses a modified schur complement method for handling a sparse problem with a few dense columns. this method is also suitable for large cones. see andersen [2].'prodchol'
— the solver uses the methods described in goldfarb and scheinberg ([4] and [5]) for handling a sparse problem with a few dense columns. this method is also suitable for large cones.
iterative display and stopping conditions
at each iteration k, the algorithm computes three relative convergence measures:
primal infeasibility
dual infeasibility
gap infeasibility
you can view these three statistics at the command line by specifying iterative display.
options = optimoptions('coneprog','display','iter');
all three should approach zero when the problem is feasible and the solver converges. for a feasible problem, the variable κk approaches zero, and the variable τk approaches a positive constant.
one stopping condition is somewhat related to the gap infeasibility. the stopping condition is when the following optimality measure decreases below the optimality tolerance.
this statistic measures the precision of the objective value.
the solver also stops and declares the problem to be infeasible under the
following conditions. the three relative infeasibility measures are less than
c = constrainttolerance
, and
if btyk > 0, then the solver declares that the primal problem is infeasible. if ftxk < 0, then the solver declares that the dual problem is infeasible.
the algorithm also stops when
and
in this case, coneprog
reports that the problem is
numerically unstable (exit flag -10
).
the remaining stopping condition occurs when at least one infeasibility
measure is greater than constrainttolerance
and the computed
step size is too small. in this case, coneprog
reports that
the search direction became too small and no further progress could be made
(exit flag -7
).
references
[1] andersen, e. d., c. roos, and t. terlaky. on implementing a primal-dual interior-point method for conic quadratic optimization. math. program., ser. b 95, pp. 249–277 (2003).
[2] andersen, k. d. a modified schur-complement method for handling dense columns in interior-point methods for linear programming. acm transactions on mathematical software (toms), 22(3):348–356, 1996.
[3] ben-tal, aharon, and arkadi nemirovski. convex optimization in engineering: modeling, analysis, algorithms. (1998).
[4] goldfarb, d. and k. scheinberg. a product-form cholesky factorization method for handling dense columns in interior point methods for linear programming. mathematical programming, 99(1):1–34, 2004.
[5] goldfarb, d. and k. scheinberg. product-form cholesky factorization in interior point methods for second-order cone programming. mathematical programming, 103(1):153–179, 2005.
[6] luo, zhi-quan, jos f. sturm, and shuzhong zhang. duality and self-duality for conic convex programming. (1996).
[7] mehrotra, sanjay. “on the implementation of a primal-dual interior point method.” siam journal on optimization 2, no. 4 (november 1992): 575–601. .
[8] nesterov, yu. e., and m. j. todd. “self-scaled barriers and interior-point methods for convex programming.” mathematics of operations research 22, no. 1 (february 1997): 1–42. .
see also
|