srush / annotated-s4Links

Implementation of https://srush.github.io/annotated-s4

☆504

Alternatives and similar repositories for annotated-s4

Users that are interested in annotated-s4 are comparing it to the libraries listed below

Sorting:

lindermanlab / S5
☆306Updated 9 months ago
srush / annotated-mamba
Annotated version of the Mamba paper
☆489Updated last year
marin-community / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆671Updated this week
google-deepmind / nanodo
☆283Updated last year
google / flaxformer
☆362Updated last year
HazyResearch / hippo-code
☆186Updated last year
google-research / long-range-arena
Long Range Arena for Benchmarking Efficient Transformers
☆765Updated last year
google-research / meliad
☆257Updated 4 months ago
proger / accelerated-scan
Accelerated First Order Parallel Associative Scan
☆189Updated last year
facebookresearch / optimizers
For optimization algorithm research and development.
☆543Updated this week
facebookresearch / torchdim
Named tensors with first-class dimensions for PyTorch
☆331Updated 2 years ago
ayaka14732 / jax-smi
JAX Synergistic Memory Inspector
☆179Updated last year
google / grain
Library for reading and processing ML training data.
☆570Updated this week
google-deepmind / neural_networks_chomsky_hierarchy
Neural Networks and the Chomsky Hierarchy
☆210Updated last year
facebookresearch / mega
Sequence modeling with Mega.
☆300Updated 2 years ago
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆518Updated 2 years ago
kach / gradient-descent-the-ultimate-optimizer
Code for our NeurIPS 2022 paper
☆369Updated 2 years ago
ctlllll / SGConv
☆164Updated 2 years ago
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆400Updated this week
lucidrains / st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
☆363Updated last year
lucidrains / rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
☆769Updated 3 months ago
lucidrains / local-attention
An implementation of local windowed attention for language modeling
☆483Updated 3 months ago
google-deepmind / enn
☆311Updated 7 months ago
PeaBrane / mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
☆125Updated last year
google / CommonLoopUtils
CLU lets you write beautiful training loops in JAX.
☆356Updated 4 months ago
lucidrains / block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
☆221Updated last year
google-research / jaxpruner
☆234Updated 8 months ago
HomebrewML / HeavyBall
Efficient optimizers
☆275Updated last week
pytorch / tensordict
TensorDict is a pytorch dedicated tensor container.
☆972Updated last week
HazyResearch / safari
Convolutions for Sequence Modeling
☆900Updated last year