NAG Library Function Document
nag_opt_handle_solve_dfls (e04ffc)
Note: this function uses optional parameters to define choices in the problem specification and in the details of the algorithm. If you wish to use default
settings for all of the optional parameters, you need only read Sections 1 to 10 of this document. If, however, you wish to reset some or all of the settings please refer to Section 11 for a detailed description of the algorithm and to Section 12 for a detailed description of the specification of the optional parameters.
1
Purpose
nag_opt_handle_solve_dfls (e04ffc) is a derivative-free solver from the NAG optimization modelling suite for small to medium-scale nonlinear least squares problems with bound constraints.
2
Specification
#include <nag.h> |
#include <nage04.h> |
|
3
Description
nag_opt_handle_solve_dfls (e04ffc) serves as a solver for compatible problems stored as a handle. The handle points to an internal data structure which defines the problem and serves as a means of communication for functions in the suite.
nag_opt_handle_solve_dfls (e04ffc) is aimed at minimizing a sum of a squares objective function subject to bound constraints:
Here the
are smooth nonlinear functions called residuals and
and
are
-dimensional vectors defining bounds on the variables. Typically, in a calibration or data fitting context, the residuals will be defined as the difference between a data point and a nonlinear model (see
Section 2.2.3 in the e04 Chapter Introduction)
To define a compatible problem handle, you must call
nag_opt_handle_init (e04rac) followed by
nag_opt_handle_set_nlnls (e04rmc) to initialize it and optionally call
nag_opt_handle_set_simplebounds (e04rhc) to define bounds on the variables. If
nag_opt_handle_set_simplebounds (e04rhc) is not called, all the variables will be considered free by the solver. It should be noted that
nag_opt_handle_solve_dfls (e04ffc) always assumes that the Jacobian of the residuals is dense, therefore defining a sparse structure for the residuals in the call to
nag_opt_handle_set_nlnls (e04rmc) will have no effect.
It is possible to fix some variables with the definition of the bounds. However, some constraints must be met in order to be able to call the solver:
- the number of non-fixed variables has to be at least
- for all non-fixed variable , the value of has to be at least twice the starting trust region radius (see the consistency constraint of the optional parameter ).
The solver is based on a derivative-free trust region framework. This type of method is well suited for small to medium-scale problems (around 100 variables) for which the derivatives are unavailable or not easy to compute and/or for which the function evaluations are expensive or noisy. For a detailed description of the algorithm see
Section 11. The algorithm behaviour and solver strategy can be modified by various optional parameters (see
Section 12) which can be set by
nag_opt_handle_opt_set (e04zmc) and
nag_opt_handle_opt_set_file (e04zpc) anytime between the initialization of the handle by
nag_opt_handle_init (e04rac) and a call to the solver. The default values for these optional parameters are chosen to work well in the general case but it is recommended to tune them to your particular problem. In particular, if the objective function is noisy, it is highly recommended to set the optional parameter
to SLOW to improve convergence.
Once the solver has finished, options may be modified for the next solve. The solver may be called repeatedly with various starting points and/or optional parameters.
4
References
Powell M J D (2009) The BOBYQA algorithm for bound constrained optimization without derivatives
Report DAMTP 2009/NA06 University of Cambridge
http://www.damtp.cam.ac.uk/user/na/NA_papers/NA2009_06.pdf
Zhang H, CONN A R and Scheinberg k (2010) A Derivative-Free Algorithm for Least-Squares Minimization SIAM J. Optim. 20(6) 3555–3576
5
Arguments
- 1:
– void *Input
-
On entry: the handle to the problem. It needs to be initialized by
nag_opt_handle_init (e04rac) and the objective must be declared as nonlinear least squares by a call to the function
nag_opt_handle_set_nlnls (e04rmc). The function
nag_opt_handle_set_simplebounds (e04rhc) can optionally be called to define box bounds. It
must not be changed between calls to the NAG optimization modelling suite.
- 2:
– function, supplied by the userExternal Function
-
objfun must evaluate the value of the nonlinear residuals
at a specified point
.
The specification of
objfun is:
- 1:
– IntegerInput
-
On entry:
, the number of variables in the problem, as set during the initialization of the handle by
nag_opt_handle_init (e04rac).
- 2:
– const doubleInput
-
On entry: , the vector of variable values at which the residuals are to be evaluated.
- 3:
– IntegerInput
-
On entry:
, the number of residuals in the problem, as set during the initialization of the handle by
nag_opt_handle_set_nlnls (e04rmc).
- 4:
– doubleOutput
-
On exit: the value of the residuals at .
- 5:
– Integer *Input/Output
-
On entry: a non-negative value.
On exit: may be used to request the solver to stop immediately. Specifically, if
then the value of
rx will be discarded and the solver will terminate immediately with
NE_USER_STOP otherwise, the solver will proceed normally.
- 6:
– Nag_Comm *
Pointer to structure of type Nag_Comm; the following members are relevant to
objfun.
- user – double *
- iuser – Integer *
- p – Pointer
The type Pointer will be
void *. Before calling
nag_opt_handle_solve_dfls (e04ffc) you may allocate memory and initialize these pointers with various quantities for use by
objfun when called from
nag_opt_handle_solve_dfls (e04ffc) (see
Section 3.3.1.1 in How to Use the NAG Library and its Documentation).
Note: objfun should not return floating-point NaN (Not a Number) or infinity values, since these are not handled by
nag_opt_handle_solve_dfls (e04ffc). If your code inadvertently
does return any NaNs or infinities,
nag_opt_handle_solve_dfls (e04ffc) is likely to produce unexpected results.
- 3:
– function, supplied by the userExternal Function
-
mon is provided to enable you to monitor the progress of the optimization and, if necessary, to halt the optimization process using argument
inform.
If no monitoring is required,
mon may be specified as
NULLFN.
mon is called at the end of every
step where
is controlled by the optional parameter
(default value
,
mon is never called).
The specification of
mon is:
- 1:
– IntegerInput
-
On entry: , the number of variables in the problem.
- 2:
– const doubleInput
-
On entry: the current best point.
- 3:
– Integer *Input/Output
-
On entry: a non-negative value.
On exit: may be used to request the solver to stop immediately. Specifically, if
then the value of
rx will be discarded and the solver will terminate immediately with
NE_USER_STOP otherwise, the solver will proceed normally.
- 4:
– const doubleInput
-
On entry: best objective value computed and various indicators (the values are as described in the main argument
rinfo).
- 5:
– const doubleInput
-
On entry: solver statistics at the end of the current iteration (the values are as described in the main argument
stats).
- 6:
– Nag_Comm *
Pointer to structure of type Nag_Comm; the following members are relevant to
mon.
- user – double *
- iuser – Integer *
- p – Pointer
The type Pointer will be
void *. Before calling
nag_opt_handle_solve_dfls (e04ffc) you may allocate memory and initialize these pointers with various quantities for use by
mon when called from
nag_opt_handle_solve_dfls (e04ffc) (see
Section 3.3.1.1 in How to Use the NAG Library and its Documentation).
- 4:
– IntegerInput
-
On entry:
, the number of variables in the problem. It must be unchanged from the value set during the initialization of the handle by
nag_opt_handle_init (e04rac).
Constraint:
.
- 5:
– doubleInput/Output
-
On entry: , the initial estimates of the variables .
On exit: the final values of the variables .
- 6:
– IntegerInput
-
On entry:
, the number of residuals in the problem. It must be unchanged from the value set during the definition of the objective structure by
nag_opt_handle_set_nlnls (e04rmc).
- 7:
– doubleOutput
-
On exit: the values of the residuals at the final point given in
x.
- 8:
– doubleOutput
-
On exit: optimal objective value and various indicators at the end of the final iteration as given in the table below:
|
objective function value (sum of the squared residuals); |
|
, the size of trust region at the end of the algorithm; |
|
the number of interpolation points used by the solver. |
|
reserved for future use. |
- 9:
– doubleOutput
-
On exit: solver statistics at the end of the final iteration as given in the table below:
|
number of calls to the objective function; |
|
if is activated, total time spent in the solver (including time spent evaluating the objective); |
|
if is activated, total time spent evaluating the objective function; |
|
number of steps. |
|
reserved for future use. |
- 10:
– Nag_Comm *
-
The NAG communication argument (see
Section 3.3.1.1 in How to Use the NAG Library and its Documentation).
- 11:
– NagError *Input/Output
-
The NAG error argument (see
Section 3.7 in How to Use the NAG Library and its Documentation).
6
Error Indicators and Warnings
- NE_ALLOC_FAIL
-
Dynamic memory allocation failed.
See
Section 2.3.1.2 in How to Use the NAG Library and its Documentation for further information.
- NE_BAD_PARAM
-
On entry, argument had an illegal value.
- NE_BOUND
-
Optional argument
,
,
and
.
Constraint: if
in coordinate
, then
.
Use
nag_opt_handle_opt_set (e04zmc) to set compatible option values.
- NE_HANDLE
-
The supplied
handle does not define a valid handle to the data structure for the NAG optimization modelling suite. It has not been initialized by
nag_opt_handle_init (e04rac) or it has been corrupted.
- NE_INT
-
There were unequal bounds.
Constraint: .
There were
unequal bounds and the optional argument
Constraint:
.
Use
nag_opt_handle_opt_set (e04zmc) to set compatible option values.
- NE_INTERNAL_ERROR
-
An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact
NAG for assistance.
See
Section 2.7.6 in How to Use the NAG Library and its Documentation for further information.
- NE_NO_IMPROVEMENT
-
No progress, the solver was stopped after consecutive slow steps.
Use the optional argument to modify the maximum number of slow steps accepted.
The solver stopped after consecutive slow steps and a trust region above the tolerance set by .
- NE_NO_LICENCE
-
Your licence key may have expired or may not have been installed correctly.
See
Section 2.7.5 in How to Use the NAG Library and its Documentation for further information.
- NE_PHASE
-
The problem is already being solved.
- NE_REAL_2
-
Inconsistent optional arguments
and
.
Constraint:
.
Use
nag_opt_handle_opt_set (e04zmc) to set compatible option values.
Inconsistent optional arguments
and
.
Constraint:
.
Use
nag_opt_handle_opt_set (e04zmc) to set compatible option values.
- NE_REF_MATCH
-
The information supplied does not match with that previously stored.
On entry,
must match that given during the definition of the objective in the
handle, i.e.,
.
The information supplied does not match with that previously stored.
On entry,
must match that given during initialization of the
handle, i.e.,
.
- NE_RESCUE_FAILED
-
A rescue procedure has been called in order to correct damage from rounding errors when computing an update to a quadratic approximation of
, but no further progress could be made. Check your specification of
objfun and whether the function needs rescaling. Try a different initial
x.
- NE_SETUP_ERROR
-
The solver does not support the model defined in the handle.
It supports only nonlinear least squares problems with bound constraints.
- NE_TIME_LIMIT
-
The solver terminated after the maximum time allowed was exceeded.
Maximum number of seconds exceeded. Use option to reset the limit.
- NE_TOO_MANY_ITER
-
Maximum number of function evaluations exceeded.
- NE_TR_STEP_FAILED
-
The predicted reduction in a trust region step was non-positive. Check your specification of
objfun and whether the function needs rescaling. Try a different initial
x.
- NE_USER_STOP
-
User requested termination after a call to the objective function.
inform was set to a negative value within the user-supplied function
objfun.
User requested termination during a monitoring step.
inform was set to a negative value within the user-supplied function
mon
- NW_NOT_CONVERGED
-
The problem was solved to an acceptable level after consecutive slow iterations.
Use the optional argument to modify the maximum number of slow steps accepted.
The solver stopped after consecutive slow steps and a trust region below the tolerance set by .
7
Accuracy
The solver can declare convergence on two conditions:
(i) |
The trust region radius is below the tolerance set by the optional parameter . When this condition is met, the corresponding solution will generally be at a distance lower than of a local minimimum. |
(ii) |
The sum of the square of the residuals is below the tolerance set by the optional parameter . In a data fitting context, this condition means that the error between the observed data and the model is smaller than the requested tolerance. |
8
Parallelism and Performance
nag_opt_handle_solve_dfls (e04ffc) makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the
x06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this function. Please also consult the
Users' Note for your implementation for any additional implementation-specific information.
9.1
Description of the Printed Output
The solver can print information to give an overview of the problem and of the progress of the computation. The output may be sent to two independent
file ID
which are set by optional parameters and . Optional parameters , , and determine the exposed level of detail. This allows, for example, a detailed log file to be generated while the condensed information is displayed on the screen.
By default (, ), four sections are printed to the standard output: a header, a list of options, an iteration log and a summary.
Header
The header contains statistics about the problem. It should look like:
----------------------------------------------
E04FF, Derivative-free solver for data fitting
(nonlinear least squares problems)
----------------------------------------------
Problem statistics
Number of variables 10
Number of unconstrained variables 10
Number of fixed variables 0
Number of residuals 10
Optional parameters list
If
is set to
, a list of the optional parameters and their values is printed. The list shows all options of the solver, each displayed on one line. The line contains the option name, its current value and an indicator for how it was set. The options left at their defaults are noted by (d) and the ones set by the user are noted by (U). Note that the output format is compatible with the file format expected by
nag_opt_handle_opt_set_file (e04zpc). The output looks as follows:
Stats Time = Yes * U
Dfls Trust Region Tolerance = 1.00000E-07 * U
Dfls Max Objective Calls = 500 * d
Dfls Starting Trust Region = 1.10000E-01 * U
Dfls Number Interp Points = 0 * d
Iteration log
If
, the solver will print a summary line for each step. An iteration is considered successful when it yields a decrease of the objective sufficiently close to the decrease predicted by the quadratic model. The line shows the step number, the value of the objective function, the radius of the trust region and the cumulative number of objective function evaluations. The output looks as follow:
----------------------------------------
step | obj rho | nf |
----------------------------------------
1 | 3.82E+01 1.10E-01 | 13 |
2 | 3.55E+01 1.10E-01 | 14 |
3 | 3.05E+01 1.10E-01 | 15 |
4 | 2.15E+01 1.10E-01 | 18 |
Occasionally, the letter ‘s’ is printed at the end of the line indicating that the progress is considered slow by the slow convergence detection heuristic. After a certain number of consecutive slow steps, the solver is stopped. The limit on the number of slow iterations can be controlled by the optional parameter and the tolerance on the trust region radius before the solver can be stopped is driven by .
Summary
Once the solver finishes, a summary is produced:
Status: Converged, small trust region size.
Value of the objective 2.23746E-06
Number of objective function evaluations 107
Number of steps 51
Optionally, if
is set to
, the timings are printed:
Timings
Total time spent in the solver 0.056
Time spent in the objective evaluation 0.012
Additionally, if
is set to
, the solution is printed along with the bounds:
Computed Solution:
idx Lower bound Value Upper bound
1 -inf -1.00000E+00 inf
2 -inf -1.00000E+00 inf
3 -inf -1.00000E+00 inf
4 -inf -1.00000E+00 inf
10
Example
In this example, we minimize the Kowalik and Osborne function with bounds on some of the variables. In this problem, the number of variables
and the number of residuals
. The residuals
are computed by
where
The following bounds are defined on the variables
10.1
Program Text
Program Text (e04ffce.c)
10.2
Program Data
None.
10.3
Program Results
Program Results (e04ffce.r)
11
Algorithmic Details
This section contains a short description of the algorithm used in
nag_opt_handle_solve_dfls (e04ffc) which is based on Powell's method BOBYQA
Powell (2009) and the work of
Zhang et al. (2010). It is based on a model-based derivative-free trust region framework adapted to exploit least squares problems structure.
11.1
Derivative-free trust region algorithm
In this section, we are interested in generic problems of the form
where the derivatives of the objective function
are not easily available. A model-based derivative-free optimization (DFO) algorithm maintains a set of points
centred on an iterate
to build quadratic interpolation models of the objective
where
and
are built with the interpolation conditions
Note that if the number of interpolation points
is smaller than
, the model chosen is the one for which the hessian
is the closest to
in the Frobenius norm sense.
This model is iteratively optimized over a trust region, updated and moved around the new computed points. More precisely, it can be described as:
- DFO Algorithm
-
1. |
Initialization
Choose an initial interpolation set , trust region radius and build the first quadratic model . |
2. |
Iteration k
(i) |
Minimize the model in the trust region to obtain a step . |
(ii) |
If the step is too small, adjust the geometry of the interpolation set and the trust region size and restart the iteration. |
(iii) |
Evaluate the objective at the new point . |
(iv) |
Replace a far away point from by to create . |
(v) |
If the decrease of the objective is sufficient (successful step), choose , else choose . |
(vi) |
Choose and adjust the geometry of , if necessary. |
(vii) |
Build using the new interpolation set. |
(viii) |
Stop the algorithm if is below the chosen tolerance . |
|
In the rest of this documentation page, we call an iteration successful when the trial point
is accepted as the next iterate.
11.2
Bounds on the variables
The bounds on the variables are handled during the model optimization step (step
2(i) of
DFO Algorithm) with an active set method. If a bound is hit, it is fixed and step
2(i) is restarted. The set of active constraints is kept throughout the optimization, progressively fixing the corresponding variables.
11.3
Adaptation to nonlinear least squares problems
In the specific case where
is a sum of square
, a good approximation of the hessian of the objective can be
where
is the
by
first derivative matrix of
. This approximation is the main idea behind the Gauss–Newton and Levenberg–Marquardt methods. Following the work of
Zhang et al. (2010), it is possible to adapt it to the DFO framework. In
nag_opt_handle_solve_dfls (e04ffc), one quadratic model is built for each residual
We call
. To build the model of the objective
, we then choose
where
is chosen as
and
as
The constants
,
and
are chosen as proposed in
Zhang et al. (2010).
The first expression amounts to making a Gauss–Newton approximation when we are far from a stationary point, the second to a Levenberg–Marquardt approximation when we are close to a stationary point with small residuals while the third takes the full hessian into account.
nag_opt_handle_solve_dfls (e04ffc) integrates this method of building models into the framework presented in the algorithm
DFO Algorithm.
12
Optional Parameters
Several optional parameters in nag_opt_handle_solve_dfls (e04ffc) define choices in the problem specification or the algorithm logic. In order to reduce the number of formal arguments of nag_opt_handle_solve_dfls (e04ffc) these optional parameters have associated default values that are appropriate for most problems. Therefore, you need only specify those optional parameters whose values are to be different from their default values.
The remainder of this section can be skipped if you wish to use the default values for all optional parameters.
The optional parameters can be changed by calling
nag_opt_handle_opt_set (e04zmc) anytime between the initialization of the handle by
nag_opt_handle_init (e04rac) and the call to the solver. Modification of the arguments during intermediate monitoring stops is not allowed. Once the solver finishes, the optional parameters can be altered again for the next solve.
The option values may be retrieved by
nag_opt_handle_opt_get (e04znc).
The following is a list of the optional parameters available. A full description of each optional parameter is provided in
Section 12.1.
12.1
Description of the Optional Parameters
For each option, we give a summary line, a description of the optional parameter and details of constraints.
The summary line contains:
- the keywords, where the minimum abbreviation of each keyword is underlined;
- a parameter value,
where the letters , and denote options that take character, integer and real values respectively.
- the default value, where the symbol is a generic notation for machine precision (see nag_machine_precision (X02AJC)).
All options accept the value to return single options to their default states.
Keywords and character values are case and white space insensitive.
This special keyword may be used to reset all optional parameters to their default values. Any argument value given with this keyword will be ignored.
DFLS Maximum Slow Steps | | Default |
If , this argument defines the maximum number of consecutive slow iterations allowed. Set it to 0 to deactivate the slow iteration detection. The algorithm can stop in two situations:
and
with
NW_NOT_CONVERGED
with
NE_NO_IMPROVEMENT
Constraint: .
DFLS Max Objective Calls | | Default |
A limit on the number of objective function evaluations the solver is allowed to compute. If the limit is reached, the solver stops with
NE_TOO_MANY_ITER.
Constraint: .
DFLS Monitor Frequency | | Default |
If
, the solver calls the user defined monitoring function
mon at the end of every
th step.
Constraint: .
DFLS Number Interp Points | | Default |
The number of interpolation points in
(9) used to build the quadratic models. If
, the number of points is chosen to be
where
is the number of non-fixed variables.
Constraint: .
Consistency constraint, the solver stops with
NE_INT if not met:
.
DFLS Print Frequency | | Default |
If , the solver prints the iteration log to the appropriate units at the end of every th step.
Constraint: .
DFLS Small Residuals Tol | | Default |
This option defines the tolerance on the value of the residuals. Namely, the solver declares convergence if
.
Constraint: .
DFLS Starting Trust Region | | Default |
, the initial trust region radius. This argument should be set to about one tenth of the greatest expected overall change to a variable: the initial quadratic model will be constructed by taking steps from the initial x of length along each coordinate direction. The default value assumes that the variables have an order of magnitude 1.
Constraint: .
Consistency constraints, the solver stops with
NE_BOUND or
NE_REAL_2 if not met:
.
DFLS Trust Region Tolerance | | Default |
, the requested trust region radius. The algorithm declares convergence when the trust region radius reaches this limit. It should indicate the absolute accuracy that is required in the final values of the variables.
Constraint: .
Consistency constraints, the solver stops with
NE_BOUND or
NE_REAL_2 if not met:
.
DFLS Trust Region Slow Tol | | Default |
The minimal acceptable trust region radius for the solution to be declared as acceptable. The solver stops if:
and
Constraint: .
Consistency constraints, the solver stops with
NE_BOUND or
NE_REAL_2 if not met:
DFLS Trust Region Update | | Default |
Controls the speed at which the trust region is decreased after unsuccessful iterations. In smooth non-noisy cases, a fast decrease often leads to faster convergence.
However, in noisy cases, a slow decrease is recommended to avoid premature stops.
Constraint: or .
Infinite Bound Size | | Default |
This defines the ‘infinite’ bound in the definition of the problem constraints. Any upper bound greater than or equal to will
be regarded as (and similarly any lower bound less than or equal to will be regarded as ). Note that a modification of this optional parameter does not influence constraints which have already been defined; only the constraints formulated after the change will be affected.
Constraint: .
Monitoring File | | Default |
(See
Section 3.3.1.1 in How to Use the NAG Library and its Documentation for further information on NAG data types.)
If
, the
Nag_FileID number (as returned from
nag_open_file (x04acc))
for the secondary (monitoring) output. If set to
, no secondary output is provided. The information output to this file ID is controlled by
.
Constraint: .
Monitoring Level | | Default |
This argument sets the amount of information detail that will be printed by the solver to the secondary output. The meaning of the levels is the same as with .
Constraint: .
Print File | | Default
|
(See
Section 3.3.1.1 in How to Use the NAG Library and its Documentation for further information on NAG data types.)
If
, the
Nag_FileID number (as returned from
nag_open_file (x04acc),
stdout as the default)
for the primary output of the solver. If
, the primary output is completely turned off independently of other settings. The information output to this unit is controlled by
.
Constraint: .
This argument defines how detailed information should be printed by the solver to the primary and secondary output.
|
Output |
|
No output from the solver |
|
The Header and Summary. |
,,, |
Additionally, the Iteration log. |
Constraint: .
Print Options | | Default |
If , a listing of optional parameters will be printed to the primary output. It is always printed to the secondary output.
Constraint: or .
Print Solution | | Default |
If , the solution will be printed to the primary and secondary output.
Constraint: or .
This argument turns on timings of various parts of the algorithm to give a better overview of where most of the time is spent. This might be helpful for a choice of different solving approaches. It is possible to choose between CPU and wall clock time. Choice is equivalent to wall clock.
Constraint: , , or .
A limit on seconds that the solver can use to solve one problem. If during the convergence check this limit is exceeded, the solver will terminate with
NE_TIME_LIMIT error message.
Warning: the timings are not computed if is set to . The solver will therefore NOT be stopped if the time limit is exceeded in such a case.
Constraint: .