Anthropic researchers detail “model spec midtraining”, which adds a stage between pretraining and fine-tuning to improve generalization from alignment training

r/artificial
Machine Learning Generative AI AI Research

Submitted by /u/tekz [link] [comments]