haonan3 / AnchorContextLinks
AnchorAttention: Improved attention for LLMs long-context training
☆213Updated 10 months ago
Alternatives and similar repositories for AnchorContext
Users that are interested in AnchorContext are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆142Updated 4 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆105Updated last month
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas