A stochastic linear-quadratic (LQ) optimal control problem seeks to minimise a quadratic cost functional over adapted controls that drive a linear stochastic differential equation. The state equation takes the form with initial condition , and the cost functional is . The goal is to find an admissible control that minimises .

The stochastic LQ problem is fundamentally different from its deterministic counterpart. In the deterministic case, the control weighting matrix must be nonnegative definite for the problem to be well-posed. In the stochastic case, because the control can enter the diffusion term (through ), the “uncertainty cost” from the diffusion can compensate for negative control weighting. Concretely, the relevant quantity becomes , where solves the stochastic Riccati equation. This means stochastic LQ problems with indefinite — even negative definite — can be well-posed, provided the control’s influence on the noise creates sufficient implicit cost.

This observation has deep implications for financial applications. In the mean-variance portfolio selection problem, for instance, (no direct control cost) but the problem is well-posed because the portfolio’s influence on wealth volatility creates an implicit cost. In XVA hedging, the Riccati system for XVA hedging inherits this structure: the hedging rates emerge from a modified Riccati equation where the control simultaneously affects drift and diffusion of the XVA position.

The problem is solved in three equivalent ways: (1) the stochastic maximum principle, which leads to the linear Hamiltonian system (a coupled FBSDE); (2) dynamic programming via the HJB equation; (3) the completion of squares technique. All three approaches produce the stochastic Riccati equation, and the optimal control takes a state feedback form when the Riccati equation is solvable.

Key Details

  • State equation: (one-dimensional Brownian motion case for simplicity)
  • Cost functional: Quadratic in state and control, with possibly indefinite weighting matrices , ,
  • Standard case: , , — always uniquely solvable
  • Key difference from deterministic: need not be nonnegative; the condition replaces
  • Weak formulation: The probability space and Brownian motion are part of the control, i.e., the 5-tuple is optimised over
  • Connection to XVA: The HJB equation for XVA under P under CARA utility with quadratic friction reduces to a stochastic LQ-type problem, whose Riccati equation yields closed-form hedging rates

Textbook References

Stochastic Controls - Hamiltonian Systems and HJB Equations (Yong & Zhou, 1999)

  • Definition 3.1 (p. 301): Finiteness, solvability, and pathwise unique solvability of Problem (SLQ)
  • Theorem 4.2 (p. 308): Finiteness implies ; solvability equivalent to ; if the unique minimiser is
  • Examples 3.2—3.4 (pp. 302—304): Demonstrations that , , and can each lead to well-posed stochastic LQ problems (contrasting with deterministic impossibility)
  • Theorem 6.1 (p. 315): Solvability of the stochastic Riccati equation implies solvability of Problem (SLQ) with explicit state feedback control

concept