AI RESEARCH
Beyond Softmax: A Natural Parameterization for Categorical Random Variables
arXiv CS.LG
•
ArXi:2509.24728v2 Announce Type: replace Latent categorical variables are frequently found in deep learning architectures. They can model actions in discrete reinforcement-learning environments, represent categories in latent-variable models, or express relations in graph neural networks. Despite their widespread use, their discrete nature poses significant challenges to gradient-descent learning algorithms. While a substantial body of work has offered improved gradient estimation techniques, we take a complementary approach.