AI RESEARCH

A Spectral Revisit of the Distributional Bellman Operator under the Cram\'er Metric

arXiv CS.LG

ArXi:2603.12576v1 Announce Type: new Distributional reinforcement learning (DRL) studies the evolution of full return distributions under Bellman updates rather than focusing on expected values. A classical result is that the distributional Bellman operator is contractive under the Cram\'er metric, which corresponds to an $L^2$ geometry on differences of cumulative distribution functions (CDFs