knotgrass / GriffinLinks
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
☆13Updated last year
Alternatives and similar repositories for Griffin
Users that are interested in Griffin are comparing it to the libraries listed below
Sorting:
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆213Updated last week
- ☆131Updated 6 months ago
- Variations of Kolmogorov-Arnold Networks☆116Updated last year
- KAN for Vision Transformer☆255Updated last year
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆189Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆121Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆120Updated last week
- Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports☆91Updated last year
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆402Updated last year
- Convolutional layer for Kolmogorov-Arnold Network (KAN)☆115Updated 10 months ago
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆134Updated last year
- ☆140Updated last year
- A repository to house some personal attempts to beat some state-of-the-art for medical datasets☆101Updated 2 years ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Updated 3 months ago
- State Space Models