AI RESEARCH

Barriers to Counterfactual Credit Attribution for Autoregressive Models

arXiv CS.LG

ArXi:2605.01425v1 Announce Type: new Generative AI disrupts the practice of giving credit to work that came before. Ideally, a generative model would give credit to any work on which its output depends in a significant way. \emph{Counterfactual credit attribution} (CCA) is a technical condition formalizing this goal--a relaxation of differential privacy--recently We initiate the study of CCA generative models. Specifically, we consider autoregressive models giving credit to a deployment-time dataset (e.g., a RAG database.