srush / annotated-s4
Implementation of https://srush.github.io/annotated-s4
☆479Updated last year
Alternatives and similar repositories for annotated-s4:
Users that are interested in annotated-s4 are comparing it to the libraries listed below
- ☆278Updated 3 weeks ago
- Structured state space sequence models☆2,538Updated 6 months ago
- Annotated version of the Mamba paper☆470Updated 11 months ago
- Sequence modeling with Mega.☆297Updated 2 years ago
- ☆172Updated 8 months ago
- Long Range Arena for Benchmarking Efficient Transformers☆740Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆536Updated this week
- For optimization algorithm research and development.☆486Updated last week
- Code for our NeurIPS 2022 paper☆366Updated 2 years ago
- Helpful tools and examples for working with flex-attention☆603Updated this week
- Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch☆614Updated 2 months ago
- Accelerated First Order Parallel Associative Scan☆170Updated 5 months ago
- ☆203Updated 6 months ago
- ☆253Updated 2 years ago
- Language Modeling with the H3 State Space Model☆515Updated last year
- Named tensors with first-class dimensions for PyTorch☆321Updated last year
- ☆163Updated 2 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆257Updated last year
- Understand and test language model architectures on synthetic tasks.☆177Updated last week
- Reading list for research topics in state-space models☆257Updated last week
- ☆149Updated last month
- Implementation of Block Recurrent Transformer - Pytorch☆217Updated 5 months ago
- ☆336Updated 9 months ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆299Updated 7 months ago
- ☆164Updated last year
- ☆217Updated 9 months ago
- Neural Networks and the Chomsky Hierarchy☆195Updated 9 months ago
- Library for reading and processing ML training data.☆366Updated this week
- A repository for log-time feedforward networks☆217Updated 9 months ago
- Puzzles for exploring transformers☆331Updated last year