dtunai/Griffin-Jax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dtunai/Griffin-Jax)

dtunai / Griffin-Jax

Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"

☆16

Alternatives and similar repositories for Griffin-Jax

Users that are interested in Griffin-Jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alkaidpku / DQ-ToolQA
View on GitHub
☆10Nov 15, 2023Updated 2 years ago
kyegomez / Griffin
View on GitHub
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆58Oct 27, 2025Updated 9 months ago
OpenNLPLab / HGRN
View on GitHub
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆68Apr 24, 2024Updated 2 years ago
sustcsonglin / mamba-triton
View on GitHub
☆52Jan 28, 2024Updated 2 years ago
EleutherAI / rnngineering
View on GitHub
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆33May 25, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
berlino / gated_linear_attention
View on GitHub
☆107Mar 9, 2024Updated 2 years ago
catid / spectral_ssm
View on GitHub
Implementation of Spectral State Space Models
☆16Feb 23, 2024Updated 2 years ago
yale-nlp / InstruSum
View on GitHub
☆23Feb 26, 2024Updated 2 years ago
tobiaskatsch / GatedLinearRNN
View on GitHub
☆30Feb 27, 2024Updated 2 years ago
srush / mamba-scans
View on GitHub
Blog post
☆17Feb 16, 2024Updated 2 years ago
johanwind / wind_rwkv
View on GitHub
☆27Feb 26, 2026Updated 5 months ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
BlinkDL / LinearAttentionArena
View on GitHub
Here we will test various linear attention designs.
☆62Apr 25, 2024Updated 2 years ago
Benjamin-Walker / selective-ssms-and-linear-cdes
View on GitHub
Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)
☆17Jan 7, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jxiw / MambaByte
View on GitHub
[CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model
☆27Oct 12, 2024Updated last year
Leooyii / LCEG
View on GitHub
[COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs
☆65Mar 9, 2026Updated 4 months ago
albertfgu / awesome-ssm-ml
View on GitHub
☆14May 30, 2024Updated 2 years ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
Doraemonzzz / hgru2-pytorch
View on GitHub
☆24Sep 25, 2024Updated last year
FieteLab / MESH
View on GitHub
☆15Jul 12, 2022Updated 4 years ago
rdyro / tune-jax
View on GitHub
Microbenchmarking hyperparameter tuning for JAX functions.
☆22Jul 7, 2026Updated 3 weeks ago
kazuki-irie / kv-memory-brain
View on GitHub
Official Code Repository for the paper "Key-value memory in the brain"
☆32Feb 25, 2025Updated last year
expz / annotated-hyena
View on GitHub
An annotated implementation of the Hyena Hierarchy paper
☆34May 28, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
niklaswretblad / the-effects-of-noise-in-text-to-SQL
View on GitHub
Code for the paper "Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark".
☆19May 14, 2026Updated 2 months ago
google-deepmind / spectral_ssm
View on GitHub
☆35Apr 12, 2024Updated 2 years ago
sjelassi / transformers_ssm_copy
View on GitHub
☆40Feb 26, 2024Updated 2 years ago
NicolasZucchet / minimal-LRU
View on GitHub
Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)
☆63Sep 3, 2025Updated 10 months ago
proger / hippogriff
View on GitHub
Griffin MQA + Hawk Linear RNN Hybrid
☆89Apr 13, 2026Updated 3 months ago
Tangshengku / Bi-Mamba
View on GitHub
The official implementation of Bi-Mamba
☆17Oct 22, 2025Updated 9 months ago
catid / dataloader
View on GitHub
High-performance tokenized language data-loader for Python C++ extension
☆15Jul 22, 2024Updated 2 years ago
uthree / fastersvc
View on GitHub
☆27Mar 20, 2024Updated 2 years ago
lucidrains / hyena-dna
View on GitHub
Fork of HyenaDNA, a long-range genomic foundation model built with Hyena
☆10Aug 14, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
google-deepmind / xtr
View on GitHub
XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
☆64Jun 20, 2024Updated 2 years ago
Eliyas0007 / Pytorch-Intention
View on GitHub
Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention
☆12May 24, 2023Updated 3 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
jeffeuxMartin / meta-learning-hlp
View on GitHub
A publishing website of a table collecting meta-learning-related papers in the area of human language processing.
☆17Aug 2, 2021Updated 4 years ago
luojie1024 / MossQA-mnbvc
View on GitHub
本项目主要对开源的MOSS SFT数据进行整理，转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面，共353w样本，MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数，共630w样本，
☆13Dec 3, 2023Updated 2 years ago
Princeton-SysML / kNNLM_privacy
View on GitHub
Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888
☆37Jun 10, 2024Updated 2 years ago
akbir / deq-jax
View on GitHub
[NeurIPS'19] Deep Equilibrium Models Jax Implementation
☆43Oct 26, 2020Updated 5 years ago