A Primer on the Inner Workings of Transformer-based Language Models

Publication
Arxiv Preprint
Gabriele Sarti
Gabriele Sarti
PhD Student
Arianna Bisazza
Arianna Bisazza
Associate Professor