AI RESEARCH

Revisiting the Scaling Properties of Downstream Metrics in Large Language Model Training

Apple Machine Learning Research • March 26, 2026

While scaling laws for Large Language Models (LLMs) traditionally focus on proxy metrics like pre