ag1988/dlr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ag1988/dlr)

ag1988 / dlr

The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonathan Berant).

☆23

Alternatives and similar repositories for dlr

Users that are interested in dlr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eamartin / parallelizing_linear_rnns
View on GitHub
☆45Apr 30, 2018Updated 8 years ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
ag1988 / dss
View on GitHub
Sequence Modeling with Structured State Spaces
☆69Aug 2, 2022Updated 3 years ago
IdoAmos / not-from-scratch
View on GitHub
☆33Oct 22, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Eliyas0007 / Pytorch-Intention
View on GitHub
Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention
☆12May 24, 2023Updated 3 years ago
lindermanlab / S5
View on GitHub
☆324Jan 8, 2025Updated last year
FarnoushRJ / MambaLRP
View on GitHub
[NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" 🐍
☆47Nov 6, 2024Updated last year
samblouir / birdie
View on GitHub
☆15Jun 8, 2026Updated last month
Doraemonzzz / hgru-pytorch
View on GitHub
☆29Jul 9, 2024Updated 2 years ago
assafbk / OPRM
View on GitHub
Overflow Prevention Enhances Long-Context Recurrent LLMs (COLM 2025)
☆18Jul 8, 2025Updated last year
catid / spectral_ssm
View on GitHub
Implementation of Spectral State Space Models
☆16Feb 23, 2024Updated 2 years ago
IBM / selective-dense-state-space-model
View on GitHub
Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …
☆16Sep 18, 2025Updated 10 months ago
ShaharLutatiPersonal / OCD
View on GitHub
Official PyTorch Implementation
☆17Dec 3, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NicolasZucchet / minimal-LRU
View on GitHub
Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)
☆62Sep 3, 2025Updated 10 months ago
SwiftieH / SpGAT
View on GitHub
Spectral Graph Attention Network with Fast Eigen-approximation
☆11Dec 24, 2021Updated 4 years ago
automl / DeltaProduct
View on GitHub
DeltaProduct is a new linear recurrent neural network architecture that uses products of generalized Householder matrices as state-transi…
☆15Oct 13, 2025Updated 9 months ago
ctlllll / SGConv
View on GitHub
☆165Jan 24, 2023Updated 3 years ago
ethanbar11 / ssm_2d
View on GitHub
More dimensions = More fun
☆26Jul 27, 2024Updated last year
jemisjoky / umps_code
View on GitHub
u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…
☆19Jul 2, 2020Updated 6 years ago
goombalab / hydra
View on GitHub
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
☆175Jan 30, 2025Updated last year
AKiessner / TUHAbnormal-Expansion-dataset
View on GitHub
☆14Sep 5, 2023Updated 2 years ago
swiseman / neighbor-splicing
View on GitHub
☆11Jan 2, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jungokasai / T2R
View on GitHub
☆14Nov 20, 2022Updated 3 years ago
OpenMOSE / RWKV-Infer
View on GitHub
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆51Oct 21, 2025Updated 9 months ago
tatsu-lab / mlm_inductive_bias
View on GitHub
Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"
☆16Apr 13, 2021Updated 5 years ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
BlinkDL / LinearAttentionArena
View on GitHub
Here we will test various linear attention designs.
☆62Apr 25, 2024Updated 2 years ago
GabMartino / TransformerForDummies
View on GitHub
Annotated implementation of vanilla Transformers to guide through all the ambiguities.
☆10Jun 20, 2025Updated last year
deep-spin / sparse_continuous_distributions
View on GitHub
This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.
☆15May 10, 2023Updated 3 years ago
ben-hayes / neural-field-synth
View on GitHub
NASH 2021 project... this may or may not end up working 🤷‍♂️
☆12Dec 19, 2021Updated 4 years ago
Benjamin-Walker / selective-ssms-and-linear-cdes
View on GitHub
Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)
☆17Jan 7, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FranxYao / RDP
View on GitHub
Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization
☆13Jul 24, 2022Updated 3 years ago
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
a-carson / ddsp-phaser
View on GitHub
☆18Apr 8, 2024Updated 2 years ago
christhetree / mod_discovery
View on GitHub
Source code for "Modulation Discovery with Differentiable Digital Signal Processing".
☆15Mar 25, 2026Updated 3 months ago
XuezheMax / fairseq-apollo
View on GitHub
FairSeq repo with Apollo optimizer
☆113Dec 20, 2023Updated 2 years ago
ansonb / FeTA_TMLR
View on GitHub
This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)
☆11Jul 10, 2022Updated 4 years ago