Calculus of Variations: The Geometry of Functional Optimization

The Calculus of Variations (CoV) represents a transition from the optimization of finite-dimensional vectors to the optimization of infinite-dimensional objects—functions. While standard differential calculus identifies critical points $x \in R^{n}$ of a function $f (x)$ , CoV identifies “critical functions” $y (x)$ that extremize a functional $J [y]$ . This field is the mathematical foundation of Lagrangian mechanics, general relativity, and minimal surface theory.

1. Functionals and the Domain of Optimization

A functional $J$ is a mapping from a function space $Y$ (typically a Sobolev space or $C^{k}$ space) to the real numbers. The canonical form encountered in physics and geometry is the integral functional:

$J [y] = \int_{x_{1}}^{x_{2}} L (x, y (x), y^{'} (x)) d x$

Here, $L (x, y, y^{'})$ is the Lagrangian. The domain is usually restricted by Dirichlet boundary conditions: $y (x_{1}) = y_{1}$ and $y (x_{2}) = y_{2}$ . The goal is to find $y \in Y$ such that $J [y]$ is a local extremum.

2. The First Variation and Stationary Conditions

To find an extremum, we consider a variation $y (x) \to y (x) + ϵη (x)$ , where $ϵ$ is a small parameter and $η (x)$ is a smooth “test function” satisfying $η (x_{1}) = η (x_{2}) = 0$ . The variation of the functional is defined as the Gâteaux derivative:

$δ J [y; η] = \frac{d}{d ϵ} J [y + ϵη]_{ϵ = 0}$

For $y$ to be a stationary point, we require $δ J = 0$ for all admissible $η$ . Using the chain rule:

$δ J = \int_{x_{1}}^{x_{2}} [\frac{\partial L}{\partial y} η (x) + \frac{\partial L}{\partial y ^{'}} η^{'} (x)] d x = 0$

Applying integration by parts to the second term:

$\int_{x_{1}}^{x_{2}} \frac{\partial L}{\partial y ^{'}} η^{'} d x = [\frac{\partial L}{\partial y ^{'}} η]_{x_{1}}^{x_{2}} - \int_{x_{1}}^{x_{2}} \frac{d}{d x} (\frac{\partial L}{\partial y ^{'}}) η d x$

Since $η$ vanishes at the boundaries, the first term disappears. The condition $δ J = 0$ becomes:

$\int_{x_{1}}^{x_{2}} (\frac{\partial L}{\partial y} - \frac{d}{d x} \frac{\partial L}{\partial y ^{'}}) η (x) d x = 0$

3. The Euler-Lagrange Equation

By the Fundamental Lemma of the Calculus of Variations, if the integral of $f (x) η (x)$ is zero for every smooth $η$ with compact support, then $f (x)$ must be zero. This yields the Euler-Lagrange Equation:

$\frac{\partial L}{\partial y} - \frac{d}{d x} (\frac{\partial L}{\partial y ^{'}}) = 0$

This second-order differential equation is the necessary condition for $y (x)$ to be an extremizer.

The Beltrami Identity

When the Lagrangian $L$ has no explicit dependence on $x$ ( $\partial L / \partial x = 0$ ), the E-L equation admits a first integral:

$L - y^{'} \frac{\partial L}{\partial y ^{'}} = constant$

In physics, this constant often represents the conservation of energy.

4. Classical Variational Problems

4.1 Geodesics: Shortest Paths

A geodesic is a curve that extremizes the distance functional $S = \int d s$ . On a plane, $d s^{2} = d x^{2} + d y^{2}$ , so $L = 1 + (y^{'})^{2}$ . The E-L equation reduces to $\frac{d}{d x} (\frac{y ^{'}}{1 + y ^{'2}}) = 0$ , implying $y^{'}$ is constant—a straight line. On curved manifolds, geodesics are governed by the metric tensor $g_{ij}$ and the Christoffel symbols.

4.2 The Brachistochrone

The problem of finding the curve $y (x)$ that minimizes the time of travel for a mass sliding under gravity. Using conservation of energy $v = 2 g y$ , the time functional is:

$T [y] = \int \frac{1 + ( y ^{'} ) ^{2}}{2 g y} d x$

Applying the Beltrami Identity to $L = (1 + (y^{'})^{2}) / y$ leads to the differential equation for a cycloid.

5. Constraints and Lagrange Multipliers

In “Isoperimetric” problems, we extremize $J [y]$ subject to a constraint $G [y] = \int M (x, y, y^{'}) d x = C$ . We construct the augmented Lagrangian:

$\overset{ˉ}{L} = L + λ M$

where $λ$ is a constant Lagrange multiplier. An example is finding the shape of a hanging chain (the Catenary), which minimizes gravitational potential energy subject to a fixed length.

6. From Lagrangian to Hamiltonian Dynamics

The transition to Hamiltonian mechanics involves a Legendre transform. We define the generalized momentum:

$p = \frac{\partial L}{\partial y ^{'}}$

The Hamiltonian is defined as $H (x, y, p) = p y^{'} - L$ . This transforms the second-order E-L equation into a system of two first-order equations:

$\frac{d y}{d x} = \frac{\partial H}{\partial p}, \frac{d p}{d x} = - \frac{\partial H}{\partial y}$

This “canonical” form is central to quantum mechanics and statistical field theory.

7. Noether’s Theorem: Symmetry and Conservation

Noether’s Theorem states that for every continuous symmetry of the action $S = \int L d t$ , there is a corresponding conserved quantity.

Suppose $L$ is invariant under a transformation $y \to y + ϵ ψ$ . Then: $\frac{d L}{d ϵ} = \frac{\partial L}{\partial y} ψ + \frac{\partial L}{\partial y ^{'}} ψ^{'} = 0$ Substituting the E-L equation $\frac{\partial L}{\partial y} = \frac{d}{d x} \frac{\partial L}{\partial y ^{'}}$ : $\frac{d}{d x} (\frac{\partial L}{\partial y ^{'}}) ψ + \frac{\partial L}{\partial y ^{'}} ψ^{'} = \frac{d}{d x} (\frac{\partial L}{\partial y ^{'}} ψ) = 0$ Thus, $Q = \frac{\partial L}{\partial y ^{'}} ψ$ is a constant of motion.

8. The Second Variation and Legendre’s Condition

To distinguish between a minimum and a maximum, we look at $δ^{2} J$ . A necessary condition for a minimum is Legendre’s Condition:

$\frac{\partial ^{2} L}{\partial y ^{'2}} \geq 0$

If $L_{y^{'} y^{'}} < 0$ along the path, the stationary point is a local maximum.

9. Python Implementation: Numerical Shooting Method

Analytic solutions are rare. Here we solve the Brachistochrone ODE $y^{''} = - \frac{1 + ( y ^{'} ) ^{2}}{2 y}$ using a boundary value problem solver.

import numpy as np
from scipy.integrate import solve_bvp
import matplotlib.pyplot as plt

def brachistochrone_ode(x, y):
    # y[0] is position, y[1] is derivative y'
    # Adding a small epsilon to avoid division by zero at y=0
    return np.vstack((y[1], -(1 + y[1]**2) / (2 * (y[0] + 1e-6))))

def boundary_conditions(ya, yb):
    # Start at height 1.0, end at height 0.2
    return np.array([ya[0] - 1.0, yb[0] - 0.2])

x_nodes = np.linspace(0, 1, 100)
y_initial = np.linspace(1, 0.2, 100).reshape(1, -1)
yp_initial = np.zeros((1, 100))
y_guess = np.vstack((y_initial, yp_initial))

sol = solve_bvp(brachistochrone_ode, boundary_conditions, x_nodes, y_guess)

if sol.success:
    plt.plot(sol.x, sol.y[0], label='Numerical Brachistochrone')
    plt.gca().invert_yaxis()
    plt.legend()
    plt.show()

Conceptual Check

Under what specific condition is the Beltrami Identity (L - y'L_y' = C) a valid first integral?

Conceptual Check

Which symmetry of the Lagrangian corresponds to the conservation of linear momentum via Noether's Theorem?

Conceptual Check

Legendre's necessary condition for a functional to have a local minimum is given by:

Conceptual Check

Calculus of Variations

Calculus of Variations: The Geometry of Functional Optimization

1. Functionals and the Domain of Optimization

2. The First Variation and Stationary Conditions

3. The Euler-Lagrange Equation

The Beltrami Identity

4. Classical Variational Problems

4.1 Geodesics: Shortest Paths

4.2 The Brachistochrone

5. Constraints and Lagrange Multipliers

6. From Lagrangian to Hamiltonian Dynamics

7. Noether’s Theorem: Symmetry and Conservation

8. The Second Variation and Legendre’s Condition

9. Python Implementation: Numerical Shooting Method

Under what specific condition is the Beltrami Identity (L - y'L_y' = C) a valid first integral?

Which symmetry of the Lagrangian corresponds to the conservation of linear momentum via Noether's Theorem?

Legendre's necessary condition for a functional to have a local minimum is given by:

In the context of constraints, how is the updated Euler-Lagrange equation formed for a functional subject to G[y] = C?