




Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Main discussion in this file is about CNLR, Model, Goal, Algorithm, Convergence Tests, Termination, Bootstrapping Estimates, Major Iteration
Typology: Study notes
1 / 8
This page cannot be seen from the preview
Don't miss anything!





1
CNLR is used to estimate the parameters of a function by minimizing a smooth nonlinear loss function (objective function) when the parameters are subject to a set of constraints.
Consider the model
f = f (x ~ (^) , θ ~ )
where θ ~ is a p × 1 parameter vector, x~ is an independent variable vector, and f is a function of x~ and θ~.
Find the estimate θ ~ ∗of θ ~ such that θ ~ ∗^ minimizes
F = F y f 1 , 6
subject to
l A q C
~ u
~ ≤ ~
% &
K
' K
( )
K
K
θ
θ
L ( (^) ~)
~
where F is the smooth loss function (objective function), which can be specified by the user. A (^) L is an m (^) L × p matrix of linear constraints, and C ( θ~ ) is an m (^) N × 1 vector of nonlinear constraint functions. (^) l ~ (^) ′^ = ( l ~ (^) ′ B (^) , l ~ (^) ′ L (^) , l ~′ N ), where (^) l ~ B ′ , (^) l ~ ′ L ,
and (^) l ~ (^) N^ ′^ represent the lower bounds, linear constraints and nonlinear constraints, respectively. The upper bound u~ is defined similarly.
CNLR uses the algorithms proposed and implemented in NPSOL by Gill, Murray, Saunders, and Wright. A description of the algorithms can be found in the User’s Guide for NPSOL , Version 4.0 (1986). The method used in NPSOL is a sequential quadratic programming (SQP) method. For an overview of SQP methods, see Gill, Murray, and Wright (1981), pp. 237–242. The basic structure of NPSOL involves major and minor iterations. Based on the given initial value θ~( )^0 of θ~ , the algorithm first selects an initial working set that includes bounds or general inequality constraints that lie within a crash tolerance (CRSHTOL). At the k th iteration, the algorithm starts with
This iteration searches for the direction Pk , which is the solution of a quadratic subproblem; that is, Pk , is found by minimizing
g P ′ k (^) + 21 P H P ′ k (1)
subject to
l
~(^^ k^ ) L u ~(^ ) N
≤ k
% &
K
'
K
( )
K
K
where g k is the gradient of F at θ ~^ (^ k^ ), the matrixp H k is a positive-definite quasi-
Newton approximation to the Hessian of the Lagrangian function, A (^) N is the Jacobian matrix of the nonlinear-constraint vector C evaluated at θ~(^ k^ ), and
satisfies the following conditions:
function
L F (^) i ci s (^) i c s i
i i i i
( ,θ λ~ ~ , ) s~ = ( )~θ − ( )θ ~ − ( )~θ
∑ ∑ λ ρ
2 (2)
The summation terms in (2) involve only the nonlinear constraints. The vector λ is an estimate of the Lagrange multipliers for the nonlinear constraints. The non-negative slack variables (^) ; @ si allow nonlinear inequality constraints to be
treated without introducing discontinuities. The solution of the QP subproblem defined in (1) provides a vector triple that serves as a direction search for θ~ ,
λ~ and s~. The non-negative vector of penalty parameters1 6 ρ is initialized to
zero at the beginning of the first major iteration. Function precision criteria are used as a measure of the accuracy with which the functions F and ci can be
evaluated.
g ′ ( (^) ~ θ 1 k^ +^16 ) P k (^) < − η g P ′ k k
where η is the Line Search Tolerance and 0 ≤ η < 1. The value of η determines the accuracy with which α (^) k approximates a stationary point of F along Pk. A smaller value of^ η^ produces a more accurate line search.
θ~ 1^ k^^ +^16 − θ~1 6 k^ = α (^) k P k ≤Step Limit.
After α (^) k is determined from the major iteration, the following conditions are checked:
α (^) k P k (^) ≤ r^ + k
~^ θ
1 6
|| g (^) z ( θ ~ k^ +^ )|| ≤ r + max +| F ( (^) ~θ k^ +^ )|,|| g ( θ~ k + )||
^
1 1 6 1 1 1 16 1 16
and
res (^) j ≤ FTOL, for all j ,
where g z is the projected gradient, g is the gradient of F with respect to the free parameters, res (^) j is the violation of the j th nonlinear constraint, FTOL is the Nonlinear Feasibility Tolerance, and r is the Optimality Tolerance.
If none of these three conditions are satisfied, the algorithm continues with the Minor Iteration to find a new search direction.
The following are termination conditions.
Bootstrapping is a nonparametric technique of estimating the standard error of a parameter estimate using repeated samples from the original data. This is done by sampling with replacement. CNLR computes and saves the parameter estimates for each sample generated. This results, for each parameter, in a sample of estimates from which the standard deviation is calculated. This is the estimated standard error. Mathematically, the bootstrap covariance matrix S for the p parameter estimates is
× sij 3 8 p p
where
s
m
ij ik i jk j k
m
i ik k
m
=
=
∑
∑
θ θ θ θ
θ θ
4 94 9 1
1
and θ$ ik is the CNLR parameter estimate of θ (^) i for the k th bootstrap sample and m
is the number of samples generated by the bootstrap. By default, m
1 6 .
The standard error for the j th parameter estimate is estimated by
s m
jj − 1
and the correlation between the i th and j th parameter estimates is estimated by
s s s
ij ii jj
The “95% Trimmed Range” values are the most extreme values that remain after trimming from the set of estimates for a parameter, the g largest, and the g smallest estimates, where g is the largest integer not exceeding 0.025 m.
Gill, P. E., Murray, W. M., and Wright, M. H. 1981. Practical Optimization. London: Academic Press.
Gill, P. E., Murray, W. M., Saunders, M. A., and Wright, M. H. 1984. Procedures for optimization problems with a mixture of bounds and general linear constraints. ACM Transactions on Mathematical Software , 10 (3): 282–296.
Gill, P. E., Murray, W. M., Saunders, M. A., and Wright, M. H. 1986. User’s guide for NPSOL ( version 4.0): A fortran package for nonlinear programming. Technical Report SOL 86–2, Department of Operations Research, Stanford University.