knotgrass / GriffinLinks
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
☆12Updated 11 months ago
Alternatives and similar repositories for Griffin
Users that are interested in Griffin are comparing it to the libraries listed below
Sorting:
- KAN for Vision Transformer☆253Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆210Updated last month
- Variations of Kolmogorov-Arnold Networks☆115Updated last year
- An easy to use PyTorch implementation of the Kolmogorov Arnold Network and a few novel variations☆186Updated 11 months ago
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆115Updated last month
- ☆140Updated last year
- ☆129Updated 3 months ago
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆118Updated 3 weeks ago
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆392Updated last year
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆133Updated last year
- The best collection of AI tutorials to make you a boss of Data Science!☆106Updated 4 months ago
- Convolutional layer for Kolmogorov-Arnold Network (KAN)☆113Updated 7 months ago
- Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis☆259Updated 4 months ago
- xLSTM as Generic Vision Backbone☆488Updated last month
- Pytorch implementation of the xLSTM model by Beck et al. (2024)☆179Updated last year
- A modified CNN architecture using Kolmogorov-Arnold Networks☆84Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆121Updated last year
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆195Updated 3 weeks ago
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆173Updated last year
- Boltzmann Attention Sampling for Image Analysis with Small Objects☆31Updated 3 months ago
- Minimal Mamba-2 implementation in PyTorch☆234Updated last year
- A repository to house some personal attempts to beat some state-of-the-art for medical datasets☆100Updated 2 years ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆229Updated last month
- A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…☆106Updated last year
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆127Updated last year
- Code and Documentation for the first place solution in 2023 Abdominal Trauma Detection Competition hosted by RSNA on Kaggle.☆50Updated 2 years ago
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆107Updated last month
- 2nd Place Solution for the RSNA 2023 Abdominal Trauma Detection Kaggle Competition☆40Updated last year
- Resources about xLSTM by Sepp Hochreiter☆317Updated last year
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Updated 3 weeks ago