sail-sg / Attention-Sink

[ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View
27Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for Attention-Sink