AllenAI has been iterating on their MolmoAct2 models for robotics
r/LocalLLaMA
•
Robotics
R/AllenAI is cooking with MolmoAct2, a 5B vision-language-action model for robot control. They keep releasing new fine-tunes on different kinds of robotics datasets, including (but not limited to, and they keep releasing new ones): - general robotics tasks - interactive robotics tasks - absolute joint-pose control - also absolute joint-pose control AllenAI has released these as fully open source models, publishing not only their weights but also their complete