Extrema

Extrema#

Optimization is about finding extrema, which depending on the application could be minima or maxima. When defining extrema, it is necessary to consider the set of inputs over which we’re optimizing.

Unconstrained vs. Constrained Optimization#

This set \(\mathcal{X} \subseteq \mathbb{R}^d\) is called the feasible set. If \(\mathcal{X}\) is the entire domain of the function being optimized (as it often will be for our purposes), we say that the problem is unconstrained. Otherwise the problem is constrained and may be much harder to solve, depending on the nature of the feasible set.

In the following example we observe the difference in optimization of the function \(f(x,y)=(x-1)^2+2(y-2)^2\) over all of \(\mathbb{R}^2\) from constrained optimization over a subset \(\mathcal{X}=\{(x,y)\|x^2+y^2\leq 4\}\), with the constrained problem leading the minimizer to lie on the boundary of the feasible region.

../_images/baff56f540aa4faf5de8da584f9ab8f27c4662b35fde776b7067be137d4ddd21.png

The level curves (contours) of the cost function \(f(x,y)=(x-1)^2+2(y-2)^2\).
The unconstrained minimum (red) at \((1,2)\), which lies outside the feasible disk.
The feasible set (orange disk) defined by \(x^2+y^2 \le 2^2\).
The constrained minimum (blue) as the projection of the unconstrained solution onto the disk boundary.

Local vs. Global Extrema#

Suppose \(f : \mathbb{R}^d \to \mathbb{R}\). A point \(\mathbf{x}\) is said to be a local minimum (resp. local maximum) of \(f\) in \(\mathcal{X}\) if \(f(\mathbf{x}) \leq f(\mathbf{y})\) (resp. \(f(\mathbf{x}) \geq f(\mathbf{y})\)) for all \(\mathbf{y}\) in some neighborhood \(N \subseteq \mathcal{X}\) about \(\mathbf{x}\).

Furthermore, if \(f(\mathbf{x}) \leq f(\mathbf{y})\) for all \(\mathbf{y} \in \mathcal{X}\), then \(\mathbf{x}\) is a global minimum of \(f\) in \(\mathcal{X}\) (similarly for global maximum). If the phrase “in \(\mathcal{X}\)” is unclear from context, assume we are optimizing over the whole domain of the function.

The qualifier strict (as in e.g. a strict local minimum) means that the inequality sign in the definition is actually a \(>\) or \(<\), with equality not allowed. This indicates that the extremum is unique within some neighborhood.

Here’s an example visualization of the one‑dimensional function \(f(x)=0.1x^2+\sin(3x)\), which has several “valleys” and “peaks” that correspond to local minima and maxima:

../_images/ea148ec56783187c9d033ae7a3d8646d35b1e291df6e99ecb80f86e83cc7110a.png

Extrema

Contents

Extrema#

Unconstrained vs. Constrained Optimization#

Local vs. Global Extrema#