Meta is about to release a pixel space model (Tuna-2)

There's a catch, though, they break it on purpose and want you to fix it: "Due to organizational policy constraints, we are unable to release the full production-trained model weights. To the research community, we plan to release a foundation checkpoint with a small number of layers removed from both the LLM backbone and the diffusion head (flow head). The remaining layers and all other components (vision encoder, projections, embeddings, etc.) are fully preserved.