Tag: Activation Steering
All the articles with the tag "Activation Steering".
-
Author-Style Steering via Contrastive Activation Vectors
A complete, reproducible pipeline for author-style steering in language models using contrastive activation vectors across mid-level transformer layers.