publications

publications by categories in reversed chronological order

2024

  1. A gentle push funziona benissimo: making instructed models in Italian via contrastive activation steering
    Daniel Scalena, Elisabetta Fersini, and Malvina Nissim
    2024
  2. duckSteering.jpeg
    Multi-property Steering of Large Language Models with Dynamic Activation Composition
    Daniel Scalena, Gabriele Sarti, and Malvina Nissim
    In Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, Nov 2024

2023

  1. LetTheModelRespond.png
    Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence
    Daniel Scalena, Gabriele Sarti, Malvina Nissim, and 1 more author
    Nov 2023
  2. mind_logo.png
    MIND at SemEval-2023 Task 11: From Uncertain Predictions to Subjective Disagreement
    Giulia Rizzi, Alessandro Astorino, Daniel Scalena, and 2 more authors
    In Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), Jul 2023

2021

  1. Tecniche di Natural Language Processing per il riconoscimento dei discorsi d’odio sui social network
    Daniel Scalena
    Jul 2021