AI RESEARCH
To Augment or Not to Augment? Diagnosing Distributional Symmetry Breaking
arXiv CS.LG
•
ArXi:2510.01349v2 Announce Type: replace Symmetry-aware methods for machine learning, such as data augmentation and equivariant architectures, encourage correct model behavior on all transformations (e.g. rotations or permutations) of the original dataset. These methods can improve generalization and sample efficiency, under the assumption that the transformed datapoints are highly probable, or "important", under the test distribution. In this work, we develop a method for critically evaluating this assumption.