DSGym: A holistic framework for evaluating and training data science agents
Together AI Blog
•
Machine Learning
Generative AI
Data Science
AI Research
Introducing DSGym—a holisti evaluation and training framework for LLM-based data science agents. Features 90+ bioinformatics tasks, 92 Kaggle competitions, and synthetic trajectory generation. Our 4B model achieves state-of-the-art performance among open-source models through exe