HazyResearch/prefix-linear-attention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HazyResearch/prefix-linear-attention)

HazyResearch / prefix-linear-attention

☆62

Alternatives and similar repositories for prefix-linear-attention

Users that are interested in prefix-linear-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HazyResearch / based
View on GitHub
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
☆256Jun 6, 2025Updated last year
iesl / s-diora
View on GitHub
☆12Jan 29, 2021Updated 5 years ago
HazyResearch / zoology
View on GitHub
Understand and test language model architectures on synthetic tasks.
☆278Mar 22, 2026Updated 4 months ago
OpenNLPLab / HGRN
View on GitHub
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆68Apr 24, 2024Updated 2 years ago
simran-arora / focus
View on GitHub
This repo contains code for the paper: "Can Foundation Models Help Us Achieve Perfect Secrecy?"
☆24Feb 9, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
proger / nanokitchen
View on GitHub
Parallel Associative Scan for Language Models
☆18Jan 8, 2024Updated 2 years ago
sjelassi / transformers_ssm_copy
View on GitHub
☆40Feb 26, 2024Updated 2 years ago
BlinkDL / LinearAttentionArena
View on GitHub
Here we will test various linear attention designs.
☆62Apr 25, 2024Updated 2 years ago
OpenSparseLLMs / MoM
View on GitHub
☆139Feb 4, 2026Updated 5 months ago
fla-org / hybrid-distillation
View on GitHub
☆34Dec 31, 2025Updated 6 months ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
HazyResearch / train-tk
View on GitHub
train with kittens!
☆67Oct 25, 2024Updated last year
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
timvieira / vocrf
View on GitHub
Variable-order CRFs with structure learning
☆17Aug 1, 2024Updated last year
RL10x / RetNet
View on GitHub
an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf
☆11Jul 25, 2023Updated 3 years ago
Zyphra / Zamba2
View on GitHub
PyTorch implementation of models from the Zamba2 series.
☆193Jan 23, 2025Updated last year
declare-lab / EFLA
View on GitHub
Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics
☆76Mar 26, 2026Updated 4 months ago
goombalab / hydra
View on GitHub
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
☆175Jan 30, 2025Updated last year
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
stanfordnlp / multi-distribution-retrieval
View on GitHub
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆17Jan 16, 2024Updated 2 years ago
siyuanseever / llama2Rnn.c
View on GitHub
☆13Apr 15, 2024Updated 2 years ago
EleutherAI / rnngineering
View on GitHub
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆33May 25, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
test-time-training / ttt-tk
View on GitHub
☆45Nov 1, 2025Updated 8 months ago
IBM / selective-dense-state-space-model
View on GitHub
Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …
☆16Sep 18, 2025Updated 10 months ago
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
Hprairie / Bi-Mamba2
View on GitHub
A Triton Kernel for incorporating Bi-Directionality in Mamba2
☆83Dec 18, 2024Updated last year
facebookresearch / concurrentqa
View on GitHub
This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."
☆47Jul 18, 2024Updated 2 years ago
lucidrains / gateloop-transformer
View on GitHub
Implementation of GateLoop Transformer in Pytorch and Jax
☆93Jun 18, 2024Updated 2 years ago
Jellyfish042 / RWKV_Othello
View on GitHub
A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. It…
☆44Jan 25, 2025Updated last year
Leooyii / LCEG
View on GitHub
[COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs
☆65Mar 9, 2026Updated 4 months ago
howard-hou / RWKV-X
View on GitHub
RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…
☆59Mar 31, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fla-org / flame
View on GitHub
🔥 A minimal training framework for scaling FLA models
☆403Apr 22, 2026Updated 3 months ago
OpenMOSE / RWKV-Infer
View on GitHub
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆51Oct 21, 2025Updated 9 months ago
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
nanowell / Q-Sparse-LLM
View on GitHub
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆37Aug 14, 2024Updated last year
Cranial-XIX / longhorn
View on GitHub
Official PyTorch Implementation of the Longhorn Deep State Space Model
☆57Dec 4, 2024Updated last year
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
OpenSparseLLMs / Linearization
View on GitHub
☆71Jul 8, 2025Updated last year