stochastic LQ optimal control

A stochastic linear-quadratic (LQ) optimal control problem seeks to minimise a quadratic cost functional over adapted controls that drive a linear stochastic differential equation. The state equation takes the form $d x = [A x + B u + b] d t + \sum_{j} [C_{j} x + D_{j} u + σ_{j}] d W_{j}$ with initial condition $x (s) = y$ , and the cost functional is $J (s, y; u) = E {\int_{s}^{T} [(Q x, x) + 2 (S x, u) + (R u, u)] d t + (G x (T), x (T))}$ . The goal is to find an admissible control $\overset{u}{ˉ} (\cdot)$ that minimises $J$ .

The stochastic LQ problem is fundamentally different from its deterministic counterpart. In the deterministic case, the control weighting matrix $R (t)$ must be nonnegative definite for the problem to be well-posed. In the stochastic case, because the control can enter the diffusion term (through $D_{j}$ ), the “uncertainty cost” from the diffusion can compensate for negative control weighting. Concretely, the relevant quantity becomes $R (t) + \sum_{j} D_{j} (t)^{T} P (t) D_{j} (t)$ , where $P (t)$ solves the stochastic Riccati equation. This means stochastic LQ problems with indefinite — even negative definite — $R$ can be well-posed, provided the control’s influence on the noise creates sufficient implicit cost.

This observation has deep implications for financial applications. In the mean-variance portfolio selection problem, for instance, $R = 0$ (no direct control cost) but the problem is well-posed because the portfolio’s influence on wealth volatility creates an implicit cost. In XVA hedging, the Riccati system for XVA hedging inherits this structure: the hedging rates emerge from a modified Riccati equation where the control simultaneously affects drift and diffusion of the XVA position.

The problem is solved in three equivalent ways: (1) the stochastic maximum principle, which leads to the linear Hamiltonian system (a coupled FBSDE); (2) dynamic programming via the HJB equation; (3) the completion of squares technique. All three approaches produce the stochastic Riccati equation, and the optimal control takes a state feedback form $\overset{u}{ˉ} (t) = - Ψ (t) x (t) - Θ (t)$ when the Riccati equation is solvable.

Key Details

State equation: $d x = [A x + B u + b] d t + [C x + D u + σ] d W$ (one-dimensional Brownian motion case for simplicity)
Cost functional: Quadratic in state and control, with possibly indefinite weighting matrices $Q$ , $R$ , $G$
Standard case: $R ≫ 0$ , $Q - S^{T} R^{- 1} S \geq 0$ , $G \geq 0$ — always uniquely solvable
Key difference from deterministic: $R$ need not be nonnegative; the condition $R (t) + D (t)^{T} P (t) D (t) \geq 0$ replaces $R (t) \geq 0$
Weak formulation: The probability space and Brownian motion are part of the control, i.e., the 5-tuple $(Ω, F, P, W (\cdot), u (\cdot))$ is optimised over
Connection to XVA: The HJB equation for XVA under P under CARA utility with quadratic friction reduces to a stochastic LQ-type problem, whose Riccati equation yields closed-form hedging rates

Textbook References

Stochastic Controls - Hamiltonian Systems and HJB Equations (Yong & Zhou, 1999)

Definition 3.1 (p. 301): Finiteness, solvability, and pathwise unique solvability of Problem (SLQ)
Theorem 4.2 (p. 308): Finiteness implies $N_{r} \geq 0$ ; solvability equivalent to $N_{s} \overset{u}{ˉ} + H_{s} (y) = 0$ ; if $N_{s} ≫ 0$ the unique minimiser is $\overset{u}{ˉ} = - N_{s}^{- 1} H_{s} (y)$
Examples 3.2—3.4 (pp. 302—304): Demonstrations that $R = 0$ , $R < 0$ , and $G < 0$ can each lead to well-posed stochastic LQ problems (contrasting with deterministic impossibility)
Theorem 6.1 (p. 315): Solvability of the stochastic Riccati equation implies solvability of Problem (SLQ) with explicit state feedback control

concept

Alethograph

Explorer

stochastic LQ optimal control

Key Details

Textbook References

Stochastic Controls - Hamiltonian Systems and HJB Equations (Yong & Zhou, 1999)

Graph View

Table of Contents

Backlinks