AI RESEARCH

Dynamic Context Evolution for Scalable Synthetic Data Generation

arXiv CS.LG

ArXi:2604.07147v1 Announce Type: cross Large language models produce repetitive output when prompted independently across many batches, a phenomenon we term cross-batch mode collapse: the progressive loss of output diversity when a language model is prompted repeatedly without access to its prior generations. Practitioners have long mitigated this with ad hoc deduplication and seed rotation, but no principled framework exists. We