peytontolbert / GriffinLinks
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
☆13Updated last year
Alternatives and similar repositories for Griffin
Users that are interested in Griffin are comparing it to the libraries listed below
Sorting:
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆55Updated 2 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 9 months ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Updated last week
- ☆31Updated 7 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 8 months ago
- ☆23Updated 8 months ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆16Updated 10 months ago
- ☆26Updated 10 months ago
- This code implements a Radial Basis Function (RBF) based Kolmogorov-Arnold Network (KAN) for function approximation.☆28Updated 11 months ago
- Hierarchical State Space Models☆47Updated last year
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆28Updated last month
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆82Updated last year
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆74Updated last year
- ☆43Updated 4 months ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated last week
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆33Updated 7 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆18Updated 3 months ago
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆35Updated 9 months ago
- Official Code Repository for the paper "Key-value memory in the brain"☆26Updated 3 months ago
- Unofficial implementation of Linear Recurrent Units, by Deepmind, in Pytorch☆69Updated last month
- Explorations into improving ViTArc with Slot Attention☆41Updated 7 months ago
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆40Updated last year
- ☆29Updated 6 months ago
- C++ and Cuda ops for fused FourierKAN☆78Updated last year
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Updated 4 years ago
- Griffin MQA + Hawk Linear RNN Hybrid☆87Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆62Updated 11 months ago
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119Updated 2 months ago
- State Space Models☆67Updated last year
- Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan an…☆26Updated 10 months ago