AI RESEARCH

AI Steerability 360: A Toolkit for Steering Large Language Models

arXiv CS.CL

ArXi:2603.07837v1 Announce Type: new The AI Steerability 360 toolkit is an extensible, open-source Python library for steering LLMs. Steering abstractions are designed around four model control surfaces: input (modification of the prompt), structural (modification of the model's weights or architecture), state (modification of the model's activations and attentions), and output (modification of the decoding or generation process