qiuzh20 / gated_attention
View external linksLinks

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
842Dec 20, 2025Updated last month

Alternatives and similar repositories for gated_attention

Users that are interested in gated_attention are comparing it to the libraries listed below

Sorting:

Are these results useful?