AI RESEARCH
Beyond Steering Vector: Flow-based Activation Steering for Inference-Time Intervention
arXiv CS.LG
•
ArXi:2605.05892v1 Announce Type: cross Activation steering has emerged as a promising alternative for controlling language-model behavior at inference time by modifying intermediate representations while keeping model parameters frozen. However, large-scale evaluations such as AxBench show that existing steering methods are often outperformed by simple in-context prompting and generalize poorly to unseen concepts.