Getting Started with Adversarial Attacks on VLMs/VLAs for Humanoid Robots (Master’s Thesis Advice Needed)

r/LocalLLaMA
Generative AI Robotics AI Safety AI Hardware

Hey everyone, I’m currently working on my master’s thesis on AI security for humanoid robots, with a focus on adversarial attacks for VLMs/VLAs. I’ve had some initial exposure to jailbreaking LLMs, but when it comes to VLMs and VLAs, I’m pretty new and honestly a bit unsure how to properly get started. Right now I have access to an NVIDIA Jetson Thor, and I was thinking about starting with an unaligned model for red teaming purposes, then later moving on to building defenses. I’m also considering using NVIDIA Cosmos Reason 2 as a starting point.