AI RESEARCH
Information-Theoretic Generalization Bounds for Sequential Decision Making
arXiv CS.LG
•
ArXi:2605.12190v1 Announce Type: cross Information-theoretic generalization bounds based on the supersample construction are a central tool for algorithm-dependent generalization analysis in the batch i.i.d.~setting. However, existing supersample conditional mutual information (CMI) bounds do not directly apply to sequential decision-making problems such as online learning, streaming active learning, and bandits, where data are revealed adaptively and the learner evolves along a causal trajectory.