eamartin/parallelizing_linear_rnns

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eamartin/parallelizing_linear_rnns)

eamartin / parallelizing_linear_rnns

☆45

Alternatives and similar repositories for parallelizing_linear_rnns

Users that are interested in parallelizing_linear_rnns are comparing it to the libraries listed below

Sorting:

proger / nanokitchen
View on GitHub
Parallel Associative Scan for Language Models
☆18Jan 8, 2024Updated 2 years ago
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
subho406 / agalite
View on GitHub
AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)
☆23Oct 15, 2024Updated last year
ag1988 / dlr
View on GitHub
The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…
☆23Dec 30, 2022Updated 3 years ago
google-deepmind / spectral_ssm
View on GitHub
☆35Apr 12, 2024Updated last year
irhum / hyena
View on GitHub
JAX/Flax implementation of the Hyena Hierarchy
☆34Apr 27, 2023Updated 2 years ago
Doraemonzzz / hgru-pytorch
View on GitHub
☆29Jul 9, 2024Updated last year
jungokasai / T2R
View on GitHub
☆14Nov 20, 2022Updated 3 years ago
glassroom / heinsen_sequence
View on GitHub
Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)
☆98Dec 5, 2024Updated last year
lxxue / prefix_sum
View on GitHub
A PyTorch wrapper of parallel exclusive scan in CUDA
☆12May 25, 2023Updated 2 years ago
HazyResearch / embroid
View on GitHub
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Aug 12, 2023Updated 2 years ago
rycolab / aflt-f2023
View on GitHub
Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)
☆10Feb 21, 2023Updated 3 years ago
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
OpenNLPLab / HGRN
View on GitHub
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆67Apr 24, 2024Updated last year
amirzandieh / HyperAttention
View on GitHub
Triton Implementation of HyperAttention Algorithm
☆48Dec 11, 2023Updated 2 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated last year
machine-discovery / deer
View on GitHub
Parallelizing non-linear sequential models over the sequence length
☆56Jun 23, 2025Updated 8 months ago
sustcsonglin / flash-linear-rnn
View on GitHub
Implementations of various linear RNN layers using pytorch and triton
☆54Aug 4, 2023Updated 2 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆35Nov 22, 2024Updated last year
habanero-lab / APPy
View on GitHub
APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…
☆30Jan 28, 2026Updated last month
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated last year
srush / tangent
View on GitHub
Source-to-Source Debuggable Derivatives in Pure Python
☆15Jan 23, 2024Updated 2 years ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
kazuki-irie / kv-memory-brain
View on GitHub
Official Code Repository for the paper "Key-value memory in the brain"
☆31Feb 25, 2025Updated last year
zhangjiong724 / spectral-RNN
View on GitHub
STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION
☆16Jun 5, 2018Updated 7 years ago
radarFudan / Curse-of-memory
View on GitHub
Curse-of-memory phenomenon of RNNs in sequence modelling
☆19May 8, 2025Updated 9 months ago
sjelassi / transformers_ssm_copy
View on GitHub
☆36Feb 26, 2024Updated 2 years ago
proger / accelerated-scan
View on GitHub
Accelerated First Order Parallel Associative Scan
☆194Jan 7, 2026Updated last month
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated last year
machelreid / editpro
View on GitHub
Learning to Model Editing Processes
☆26Aug 3, 2025Updated 6 months ago
kotoba-tech / kotomamba
View on GitHub
Mamba training library developed by kotoba technologies
☆71Feb 11, 2024Updated 2 years ago
renll / SeqBoat
View on GitHub
[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
☆40Dec 2, 2023Updated 2 years ago
acosharma / elita-transformer
View on GitHub
Official Repository for Efficient Linear-Time Attention Transformers.
☆18Jun 2, 2024Updated last year
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 2 months ago
fla-org / fla-zoo
View on GitHub
Flash-Linear-Attention models beyond language
☆21Aug 28, 2025Updated 6 months ago
cyk1337 / Highway-Transformer
View on GitHub
[ACL‘20] Highway Transformer: A Gated Transformer.
☆33Dec 5, 2021Updated 4 years ago
HazyResearch / flash-fft-conv
View on GitHub
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
☆343Dec 28, 2024Updated last year
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆28Sep 4, 2025Updated 5 months ago
jlin816 / homegrid
View on GitHub
A minimal home grid world environment to evaluate language understanding in interactive agents.
☆24Sep 6, 2023Updated 2 years ago