AI RESEARCH

The Bandit's Blind Spot: The Critical Role of User State Representation in Recommender Systems

arXiv CS.LG

ArXi:2604.26651v1 Announce Type: cross With the increasing availability of online information, recommender systems have become an important tool for many web-based systems. Due to the continuous aspect of recommendation environments, these systems increasingly rely on contextual multi-armed bandits (CMAB) to deliver personalized and real-time suggestions. A critical yet underexplored component in these systems is the representation of user state, which typically encapsulates the user's interaction history and is deeply correlated with the model's decisions and learning.