qiuzh20 / gated_attentionView on GitHub
The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
897Dec 20, 2025Updated 3 months ago

Alternatives and similar repositories for gated_attention

Users that are interested in gated_attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?