LLM text data is drying up, but Meta points to unlabeled video as the next massive training frontier

The Decoder
Generative AI AI Research

A research team from Meta FAIR and New York University trained a multimodal AI model from scratch and found that several common assumptions about how these models should be built don't hold up. The article LLM text data is drying up, but Meta points to unlabeled video as the next massive