knotgrass / GriffinLinks
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
☆12Updated 8 months ago
Alternatives and similar repositories for Griffin
Users that are interested in Griffin are comparing it to the libraries listed below
Sorting:
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆204Updated 3 weeks ago
- ☆128Updated 3 weeks ago
- Variations of Kolmogorov-Arnold Networks☆115Updated last year
- ☆136Updated last year
- Convolutional layer for Kolmogorov-Arnold Network (KAN)☆106Updated 5 months ago
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆188Updated 9 months ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Updated 2 weeks ago
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆131Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆121Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…