AI RESEARCH
A Spectral Revisit of the Distributional Bellman Operator under the Cram\'er Metric
arXiv CS.LG
•
ArXi:2603.12576v1 Announce Type: new Distributional reinforcement learning (DRL) studies the evolution of full return distributions under Bellman updates rather than focusing on expected values. A classical result is that the distributional Bellman operator is contractive under the Cram\'er metric, which corresponds to an $L^2$ geometry on differences of cumulative distribution functions (CDFs