Saddle Points & Critical Points
Visualize and analyze local minima, maxima, and saddle points of multivariable functions.
Saddle Points & Critical Points
Concept Overview
In calculus, understanding the behavior of functions at specific points is essential for optimization and analysis. For a function of multiple variables, points where the gradient (the vector of first partial derivatives) is zero, or undefined, are known as critical points. At these points, the tangent plane to the surface is horizontal. While a critical point can be a local minimum (a "valley") or a local maximum (a "peak"), it can also be a saddle point—a point where the surface curves upwards in one direction and downwards in another, resembling a horse's saddle.
Mathematical Definition
For a scalar function of two variables, f(x, y), a point (a, b) is a critical point if both partial derivatives are zero:
∂f/∂y (a, b) = 0
or ∇f(a, b) = [0, 0]
To classify the critical point, we use the Second Derivative Test, which relies on the discriminant (or Hessian determinant), denoted as D:
The nature of the critical point depends on the value of D and the second partial derivative with respect to x, fxx:
Key Concepts
- Local Minimum: Occurs when D > 0 and fxx > 0. The surface curves upwards in all directions from the critical point.
- Local Maximum: Occurs when D > 0 and fxx < 0. The surface curves downwards in all directions from the critical point.
- Saddle Point: Occurs when D < 0. The point is a local maximum in one direction and a local minimum in another. The classic example is the hyperbolic paraboloid f(x, y) = x2 - y2.
- Inconclusive Test: If D = 0, the Second Derivative Test is inconclusive. Higher-order tests or alternative analysis methods are required to determine the point's nature. An example is the "monkey saddle" given by f(x, y) = x3 - 3xy2, which has three distinct "dips" for a monkey's legs and tail.
Historical Context
The formal study of critical points and extrema dates back to Pierre de Fermat's work on optimization in the 17th century. Fermat established the method of setting the derivative of a function to zero to find its maximum and minimum values, which laid the groundwork for the calculus of variations and optimization theory.
The extension of these concepts to multiple variables, including the classification using the Hessian matrix, was developed significantly by mathematicians such as Joseph-Louis Lagrange and Carl Gustav Jacob Jacobi in the 18th and 19th centuries. The Hessian matrix itself is named after the German mathematician Ludwig Otto Hesse, who formulated many of its properties in the mid-19th century.
Real-world Applications
- Machine Learning: Loss landscapes of complex neural networks are riddled with saddle points rather than local minima. Optimizing algorithms (like stochastic gradient descent) must be designed to effectively navigate and escape these saddle points to converge on a useful minimum.
- Physics & Mechanics: Saddle points represent states of unstable equilibrium. In dynamical systems, a marble balanced perfectly in a saddle-shaped bowl will easily roll off if perturbed slightly.
- Economics & Game Theory: In zero-sum games, a saddle point represents a minimax solution—an equilibrium point where neither player can improve their outcome by unilaterally changing their strategy.
- Chemistry: In transition state theory, the transition state of a chemical reaction is mathematically a saddle point on the potential energy surface. It represents the highest energy point along the reaction coordinate but a minimum relative to perpendicular coordinates.
Related Concepts
- Gradient and Contour Plots — Visualizing slopes and elevation lines of multivariable functions
- Lagrange Multipliers — Finding constrained maxima and minima
- Taylor Series — Multi-variable Taylor series approximations use gradients and Hessians
Experience it interactively
Adjust parameters, observe in real time, and build deep intuition with Riano’s interactive Saddle Points & Critical Points module.
Try Saddle Points & Critical Points on Riano →