multiobjective optimization algorithms
multiobjective optimization definition
there are two optimization toolbox™ multiobjective solvers: fgoalattain
and fminimax
.
fgoalattain
addresses the problem of reducing a set of nonlinear functions fi(x) below a set of goals f*i. since there are several functions fi(x), it is not always clear what it means to solve this problem, especially when you cannot achieve all the goals simultaneously. therefore, the problem is reformulated to one that is always well-defined.the unscaled goal attainment problem is to minimize the maximum of fi(x) – f*i.
there is a useful generalization of the unscaled problem. given a set of positive weights wi, the goal attainment problem tries to find x to minimize the maximum of
(1) this minimization is supposed to be accomplished while satisfying all types of constraints: c(x) ≤ 0, ceq(x) = 0, a·x ≤ b, aeq·x = beq, and l ≤ x ≤ u.
if you set all weights equal to 1 (or any other positive constant), the goal attainment problem is the same as the unscaled goal attainment problem. if the f*i are positive, and you set all weights as wi = f*i, the goal attainment problem becomes minimizing the relative difference between the functions fi(x) and the goals f*i.
in other words, the goal attainment problem is to minimize a slack variable γ, defined as the maximum over i of the expressions in equation 1. this implies the expression that is the formal statement of the goal attainment problem:
such that f(x) – w·γ ≤ f*, c(x) ≤ 0, ceq(x) = 0, a·x ≤ b, aeq·x = beq, and l ≤ x ≤ u.
fminimax
addresses the problem of minimizing the maximum of a set of nonlinear functions, subject to all types of constraints:such that c(x) ≤ 0, ceq(x) = 0, a·x ≤ b, aeq·x = beq, and l ≤ x ≤ u.
clearly, this problem is a special case of the unscaled goal attainment problem, with f*i = 0 and wi = 1.
algorithms
goal attainment method
this section describes the goal attainment method of gembicki [3]. this method uses a set of design goals, , associated with a set of objectives, f(x) = {f1(x),f2(x),...,fm(x)}. the problem formulation allows the objectives to be under- or overachieved, enabling the designer to be relatively imprecise about the initial design goals. the relative degree of under- or overachievement of the goals is controlled by a vector of weighting coefficients, w = {w1,w2,...,wm}, and is expressed as a standard optimization problem using the formulation
(2) |
such that
the term wiγ introduces an element of slackness into the problem, which otherwise imposes that the goals be rigidly met. the weighting vector, w, enables the designer to express a measure of the relative tradeoffs between the objectives. for instance, setting the weighting vector w equal to the initial goals indicates that the same percentage under- or overachievement of the goals, f*, is achieved. you can incorporate hard constraints into the design by setting a particular weighting factor to zero (i.e., wi = 0). the goal attainment method provides a convenient intuitive interpretation of the design problem, which is solvable using standard optimization procedures. illustrative examples of the use of the goal attainment method in control system design can be found in fleming ( and ).
the goal attainment method is represented geometrically in the figure below in two dimensions.
figure 7-1, geometrical representation of the goal attainment method
specification of the goals, , defines the goal point, p. the weighting vector defines the direction of search from p to the feasible function space, λ(γ). during the optimization γ is varied, which changes the size of the feasible region. the constraint boundaries converge to the unique solution point f1s, f2s.
algorithm improvements for the goal attainment method
the goal attainment method has the advantage that it can be posed as a nonlinear programming problem. characteristics of the problem can also be exploited in a nonlinear programming algorithm. in sequential quadratic programming (sqp), the choice of merit function for the line search is not easy because, in many cases, it is difficult to “define” the relative importance between improving the objective function and reducing constraint violations. this has resulted in a number of different schemes for constructing the merit function (see, for example, schittkowski ). in goal attainment programming there might be a more appropriate merit function, which you can achieve by posing equation 2 as the minimax problem
(3) |
where
following the argument of brayton et al. [1] for minimax optimization using sqp, using the merit function of equation 30 for the goal attainment problem of equation 3 gives
(4) |
when the merit function of equation 4 is
used as the basis of a line search procedure, then, although ψ(x,γ) might
decrease for a step in a given search direction, the function max
λi might
paradoxically increase. this is accepting a degradation in the worst
case objective. since the worst case objective is responsible for
the value of the objective function γ, this
is accepting a step that ultimately increases the objective function
to be minimized. conversely, ψ(x,γ) might
increase when max
λi decreases,
implying a rejection of a step that improves the worst case objective.
following the lines of brayton et al. [1], a solution is therefore to set ψ(x) equal to the worst case objective, i.e.,
(5) |
a problem in the goal attainment method is that it is common to use a weighting coefficient equal to 0 to incorporate hard constraints. the merit function of equation 5 then becomes infinite for arbitrary violations of the constraints.
to overcome this problem while still retaining the features of equation 5, the merit function is combined with that of equation 31, giving the following:
(6) |
another feature that can be exploited in sqp is the objective function γ. from the kkt equations it can be shown that the approximation to the hessian of the lagrangian, h, should have zeros in the rows and columns associated with the variable γ. however, this property does not appear if h is initialized as the identity matrix. h is therefore initialized and maintained to have zeros in the rows and columns associated with γ.
these changes make the hessian, h, indefinite.
therefore h is set to have zeros in the rows and
columns associated with γ, except for the
diagonal element, which is set to a small positive number (e.g., 1e
-10).
this allows use of the fast converging positive definite qp method
described in quadratic programming solution.
the preceding modifications have been implemented in and have been found to make the method more robust. however, because of the rapid convergence of the sqp method, the requirement that the merit function strictly decrease sometimes requires more function evaluations than an implementation of sqp using the merit function of equation 30.
minimizing the maximum objective
fminimax
uses a goal attainment method.
it takes goals of 0, and weights of 1. with this formulation, the
goal attainment problem becomes
which is the minimax problem.
parenthetically, you might expect fminimax
to
turn the multiobjective function into a single objective. the function
f(x) = max(f1(x),...fj(x))
references
[1] brayton, r. k., s. w. director, g. d. hachtel, and l. vidigal, “a new algorithm for statistical circuit design based on quasi-newton methods and function splitting,” ieee transactions on circuits and systems, vol. cas-26, pp 784-794, sept. 1979.
[2] fleming, p.j. and a.p. pashkevich, computer aided control system design using a multi-objective optimisation approach, control 1985 conference, cambridge, uk, pp. 174-179.
[3] gembicki, f.w., “vector optimization for control with performance and parameter sensitivity indices,” ph.d. dissertation, case western reserve univ., cleveland, oh, 1974.
[4] grace, a.c.w., “computer-aided control system design using optimization techniques,” ph.d. thesis, university of wales, bangor, gwynedd, uk, 1989.
[5] han, s.p., “a globally convergent method for nonlinear programming,” journal of optimization theory and applications, vol. 22, p. 297, 1977.
[6] madsen, k. and h. schjaer-jacobsen, “algorithms for worst case tolerance optimization,” ieee trans. of circuits and systems, vol. cas-26, sept. 1979.
[7] powell, m.j.d., “a fast algorithm for nonlinear constrained optimization calculations,” numerical analysis, ed. g.a. watson, lecture notes in mathematics, vol. 630, springer verlag, 1978.
see also
|