qiuzh20 / gated_attentionLinks

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
81Updated 2 weeks ago

Alternatives and similar repositories for gated_attention

Users that are interested in gated_attention are comparing it to the libraries listed below

Sorting: