heartcored98 / transformer_anatomy

Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020
14Updated last year

Related projects: