



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An introduction to stochastic optimization methods, which are used when deterministic methods fail due to large numbers of control variables, highly nonlinear problem functions, and multiple local optima. Common issues in measuring performance and archiving solutions, and introduces the concept of dissimilarity archiving. The algorithms discussed in this course all employ some form of random search.
Typology: Study notes
1 / 6
This page cannot be seen from the preview
Don't miss anything!




There are a great many optimization methods. Most of these are deterministic methods with some formal mathematical basis, broadly falling into two categories:
Optimality Criteria
The mathematical conditions for an optimal solution are established and then either:
Search Methods
Many search methods use first and second derivative (gradient and Hessian) information to direct the moves made.
Unfortunately, there are many optimization problems which cannot be satisfactorily solved using any deterministic optimization algorithms — inevitably many problems of practical interest fall into this category. The characteristics which make optimization problems difficult for deterministic algorithms are:
When these systematic search methods fail, one must resort to less conventional search tech- niques. There are a wide variety of such techniques. Many of them employ some form of ran- dom or stochastic search. Collectively these ‘unconventional’ optimization techniques are known as Heuristic Methods , where in this context:
An heuristic optimization method is a technique which seeks good (i.e. near-optimal) solutions at reasonable computational cost without being able to guarantee optimality.
The algorithms discussed in this course all employ some form of random search.
1.1 Performance Measures
It is not necessarily easy to measure the performance of a stochastic optimization method, because, unless exactly the same sequence of random numbers is used, the algorithm will not perform the same search on the same problem, even if given the same starting point. For this reason, before making any claims as to the performance of an algorithm on a given problem, several (at least 25, preferably 50 or more) runs should be made using different random num- ber sequences (this is usually done by specifying different seeds to the random number gener- ator used) and, if the starting point can affect the run, different initial solutions.
There are different ways in which the algorithm performance can be measured:
Figure 1.1 : Hypothetical Performance Curves.
Objective function
Evaluations
1.2.1 Best L Solutions
One obvious strategy is store the best L solutions located (their control variable, objective function and constraint values), where L may be of the order of 25. This is easily implemented. The one potential disadvantage is that the best 25 solutions may well be very similar and therefore this ‘archive’ may not give very much information about the rest of the search space explored.
1.2.2 Best L Dissimilar Solutions
This disadvantage may be overcome by storing the best L solutions with a minimum level of dissimilarity. An obvious requirement is therefore a measure of dissimilarity between solu- tions. This is most readily defined in terms of the control variables. For instance, for continu- ous control variables, a simple measure of dissimilarity between two solutions and is:
, (1.1)
i.e. the Euclidean distance between the solutions in control variable space. If the individual control variables vary over significantly different ranges (for instance, and ), then it may be appropriate to rescale them so that within the optimiza- tion routine they can vary over the same range, e.g. or.
In addition, one needs to define two dissimilarity thresholds and , which are used as follows:
x A x B
DAB = ( x A − x B ) T^ ( x A − x B )
0 m ≤ x 1 ≤10 m 0 m ≤ x 2 ≤0.001 m ( 0 1, ) ( −1 1, ) D min D sim
x K x J x E x J DEJ ≤ DKJ ∀ K = 1 , … , l x G f ( x G ) ≥ f ( x K )∀ K = 1 , … , l x J
l < L x J DKJ > D min∀ K = 1 , … , l x J
l = L x J DKJ > D min∀ K = 1 , … , L f ( x J ) < f ( x G ) x J x G x J
DKJ < D min x J f ( x J ) < f ( x K )∀ K = 1 , … , l
or , if it is not the best solution found so far, archive it if it is sufficiently similar to and better than : If for some K , archive if and. (1.5) (In both cases replaces in the archive.)
Figures 1.2 and 1.3 show examples of this logic in action for the simple case of an optimiza- tion problem with just one control variable.
Using this logic a helpful picture of the search space can be built up. Obviously appropriate values of the dissimilarity thresholds and will be problem dependent. It is clear that and indeed should probably be at least an order of magnitude smaller than .
x E DKJ < D min x J f ( x J ) < f ( x E ) DEJ < D sim x J x E
D min D sim D sim < D min D sim D min
Figure 1.2 : Dissimilarity Archiving — D minThreshold Met.
x
f ( ) x Candidate solution archived
2 D min
Archive Not Full
x
f ( ) x Candidate solution archived
2 D min
Archive solution removed
Archive Full
Solution in archive Candidate for archiving