HJB equation for XVA under P

The Hamilton-Jacobi-Bellman equation for XVA under the real-world measure P arises from applying the dynamic programming principle to the desk’s CARA utility maximisation problem. Unlike the standard Q-world XVA formulation, where adjustments are conditional expectations under the risk-neutral measure, the P-world formulation embeds the credit-risk premium into the hedging decision, producing materially different credit-hedge targets.

The exact HJB for the total derivative MTM $v$ under P is (eq. 5 of BUETGOLFOUSE2026):

$0 = \partial_{t} v + L_{t}^{P} v + (δ_{t} - r_{t}^{repo}) S_{t} θ_{t} + (f_{t} - c_{t}) C_{t} - f_{t} v + φ_{mkt}^{*} (\partial_{θ} v) + φ_{cr}^{*} (\partial_{q} v) + φ_{C}^{*} (\partial_{C} v + 1) - \frac{γ}{2} ∥ Σ^{⊤} \nabla_{y} v + θ_{t} σ_{S, t} ∥^{2} + λ_{t}^{P} \frac{e ^{γ ζ_{C} (v - C_{t} - h_{C} q_{t})^{+}} - 1}{γ}$

The key structural features are: (i) the generator $L_{t}^{P}$ uses the real-world drift $μ_{S, t}^{P}$ , not the risk-neutral drift; (ii) the default jump involves $λ^{P}$ (actual default probability), not $λ^{Q}$ ; (iii) the discount is $- f_{t} v$ from book-funding, not the risk-free rate.

Decomposition into clean price and XVA adjustment

Writing $v = w_{0} + u$ where $w_{0}$ solves the risk-neutral PDE $\partial_{t} w_{0} + L_{t}^{Q} w_{0} - r_{0} w_{0} = 0$ , the HJB for $u$ (eq. 8) contains:

Book-funding discount: $- f_{t} u$
Carry terms: $- (f_{t} - r_{0}) (w_{0} - C_{t}) - (c_{t} - r_{0}) C_{t}$
Real-world drift correction: $β_{t} \partial_{S} (w_{0} + u)$
Diffusive variance penalty: $- \frac{γ}{2} ∥ Σ^{⊤} \nabla_{y} u + θ_{t} σ_{S, t} ∥^{2}$
$w_{0}$ -gradient cross-terms (vanish at leading order in perturbative expansion)
Exponential default jump (linearised to $λ^{P} ζ_{C} u$ at leading order)

Key Details

The equilibrium credit-hedge level $q^{tar}$ depends on $λ^{P}$ , not $λ^{Q}$ — this embeds the credit-risk premium into the hedging decision
The real-world drift $β_{t} = μ_{S, t}^{P} - (r_{t}^{repo} - δ_{t}) S_{t} = σ_{S, t}^{⊤} ϑ_{t}$ enters the delta target, generating carry absent from Q-world formulations
For investment-grade names with $λ^{Q} / λ^{P} \approx 2$ : the P-world credit-hedge target is approximately half the Q-world level
The self-consistency condition ( $v$ appears on both sides of eq. 3) is resolved by the HJB, which gives $v$ as a PDE solution (Remark 2)

Critical Notes

Empirical sensitivity

The quantitative significance of working under P vs Q depends on the ratio $λ^{Q} / λ^{P}$ , which requires decomposing credit spreads into default probability and risk premium. This decomposition is model-dependent (structural vs. reduced-form calibration, CDS-bond basis assumptions) and empirically contested. The paper’s illustrative ratio of 2-3 is plausible for IG names but not universal.

Relationship to existing work

The existing vault notes on xVA BSDE decomposition and GNOATTO2020 work entirely under Q. The P-world formulation here is genuinely different in structure (CARA utility, not replication), but the comparison with Q-world results needs care: the replication-based Q-world framework is model-free up to credit/funding assumptions, whereas the P-world framework requires specifying $λ^{P}$ , risk aversion $γ$ , and the utility function.

Textbook References

Continuous-Time Stochastic Control and Optimization with Financial Applications (Pham, 2009)

Section 6.6.1 (pp. 162-165): The standard CARA framework that the XVA paper extends. Pham derives the BSDE for exponential utility maximisation with option payoff $ξ$ in a complete market: generator $f (t, z) = - z \cdot b_{t} / σ_{t} - ∣ b_{t} / σ_{t} ∣^{2} / (2 η)$ , optimal control $\overset{α}{^}_{t} = (Z_{t} + b_{t} / (η σ_{t})) / σ_{t}$ (Theorem 6.6.10, p. 164). The XVA paper’s HJB (eq. 5) is the multi-dimensional, friction-laden, default-jump extension of this clean-market result
Theorem 6.4.5 (p. 148): BSDE-control duality via Fenchel-Legendre transform — the BSDE solution $Y$ equals the value function of a stochastic control problem over linear BSDEs. The XVA paper’s use of Legendre conjugates $φ_{∙}^{*}$ for the friction terms follows this pattern
Theorem 6.4.7 (p. 151): Recovery of the HJB equation from the BSDE and stochastic maximum principle. Establishes the HJB $\to$ BSDE $\to$ control chain that the XVA paper traverses in Sections 3-4

concept

Alethograph

Explorer

HJB equation for XVA under P

Decomposition into clean price and XVA adjustment

Key Details

Critical Notes

Textbook References

Continuous-Time Stochastic Control and Optimization with Financial Applications (Pham, 2009)

Graph View

Table of Contents

Backlinks