The Hamilton-Jacobi-Bellman equation for XVA under the real-world measure P arises from applying the dynamic programming principle to the desk’s CARA utility maximisation problem. Unlike the standard Q-world XVA formulation, where adjustments are conditional expectations under the risk-neutral measure, the P-world formulation embeds the credit-risk premium into the hedging decision, producing materially different credit-hedge targets.
The exact HJB for the total derivative MTM under P is (eq. 5 of BUETGOLFOUSE2026):
The key structural features are: (i) the generator uses the real-world drift , not the risk-neutral drift; (ii) the default jump involves (actual default probability), not ; (iii) the discount is from book-funding, not the risk-free rate.
Decomposition into clean price and XVA adjustment
Writing where solves the risk-neutral PDE , the HJB for (eq. 8) contains:
- Book-funding discount:
- Carry terms:
- Real-world drift correction:
- Diffusive variance penalty:
- -gradient cross-terms (vanish at leading order in perturbative expansion)
- Exponential default jump (linearised to at leading order)
Key Details
- The equilibrium credit-hedge level depends on , not — this embeds the credit-risk premium into the hedging decision
- The real-world drift enters the delta target, generating carry absent from Q-world formulations
- For investment-grade names with : the P-world credit-hedge target is approximately half the Q-world level
- The self-consistency condition ( appears on both sides of eq. 3) is resolved by the HJB, which gives as a PDE solution (Remark 2)
Critical Notes
Empirical sensitivity
The quantitative significance of working under P vs Q depends on the ratio , which requires decomposing credit spreads into default probability and risk premium. This decomposition is model-dependent (structural vs. reduced-form calibration, CDS-bond basis assumptions) and empirically contested. The paper’s illustrative ratio of 2-3 is plausible for IG names but not universal.
Relationship to existing work
The existing vault notes on xVA BSDE decomposition and GNOATTO2020 work entirely under Q. The P-world formulation here is genuinely different in structure (CARA utility, not replication), but the comparison with Q-world results needs care: the replication-based Q-world framework is model-free up to credit/funding assumptions, whereas the P-world framework requires specifying , risk aversion , and the utility function.
Textbook References
Continuous-Time Stochastic Control and Optimization with Financial Applications (Pham, 2009)
- Section 6.6.1 (pp. 162-165): The standard CARA framework that the XVA paper extends. Pham derives the BSDE for exponential utility maximisation with option payoff in a complete market: generator , optimal control (Theorem 6.6.10, p. 164). The XVA paper’s HJB (eq. 5) is the multi-dimensional, friction-laden, default-jump extension of this clean-market result
- Theorem 6.4.5 (p. 148): BSDE-control duality via Fenchel-Legendre transform — the BSDE solution equals the value function of a stochastic control problem over linear BSDEs. The XVA paper’s use of Legendre conjugates for the friction terms follows this pattern
- Theorem 6.4.7 (p. 151): Recovery of the HJB equation from the BSDE and stochastic maximum principle. Establishes the HJB BSDE control chain that the XVA paper traverses in Sections 3-4