TianjinYellow / SPAM-OptimizerLinks
☆34Updated 4 months ago
Alternatives and similar repositories for SPAM-Optimizer
Users that are interested in SPAM-Optimizer are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆118Updated last month
- ☆13Updated 6 months ago
- ☆83Updated 11 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆36Updated 4 months ago
- Work in progress.☆70Updated last month
- ☆81Updated last week
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆25Updated last week
- The evaluation framework for training-free sparse attention in LLMs☆86Updated last month
- ☆23Updated last week
- Official Pytorch Implementation of "The Curse of Depth in Large Language Models" by Wenfang Sun, Xinyuan Song, Pengxiang Li, Lu Yin,Yefen…