AI RESEARCH

Beyond Steering Vector: Flow-based Activation Steering for Inference-Time Intervention

arXiv CS.LG

ArXi:2605.05892v1 Announce Type: cross Activation steering has emerged as a promising alternative for controlling language-model behavior at inference time by modifying intermediate representations while keeping model parameters frozen. However, large-scale evaluations such as AxBench show that existing steering methods are often outperformed by simple in-context prompting and generalize poorly to unseen concepts.