Multi-property Steering of Large Language Models with Dynamic Activation Composition

Publication
Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP
Daniel Scalena
Daniel Scalena
PhD Student
Gabriele Sarti
Gabriele Sarti
PhD Student