DSGym: A holistic framework for evaluating and training data science agents

Together AI Blog
Machine Learning Generative AI Data Science AI Research

Introducing DSGym—a holisti evaluation and training framework for LLM-based data science agents. Features 90+ bioinformatics tasks, 92 Kaggle competitions, and synthetic trajectory generation. Our 4B model achieves state-of-the-art performance among open-source models through exe