sail-sg / Attention-Sink

[ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View
29Updated last month

Related projects

Alternatives and complementary repositories for Attention-Sink