srush / annotated-s4Links
Implementation of https://srush.github.io/annotated-s4
☆506Updated 5 months ago
Alternatives and similar repositories for annotated-s4
Users that are interested in annotated-s4 are comparing it to the libraries listed below
Sorting:
- ☆310Updated 10 months ago
- Annotated version of the Mamba paper☆491Updated last year
- ☆363Updated last year
- Long Range Arena for Benchmarking Efficient Transformers☆768Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆685Updated last week
- For optimization algorithm research and development.☆547Updated 2 weeks ago
- ☆285Updated last year
- Language Modeling with the H3 State Space Model☆519Updated 2 years ago
- ☆259Updated 6 months ago
- Neural Networks and the Chomsky Hierarchy☆211Updated last year
- Named tensors with first-class dimensions for PyTorch☆332Updated 2 years ago
- ☆190Updated last year
- Code for our NeurIPS 2022 paper☆370Updated 2 years ago
- Sequence modeling with Mega.☆301Updated 2 years ago
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆344Updated last year
- Accelerated First Order Parallel Associative Scan☆192Updated last year
- ☆164Updated 2 years ago
- Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch☆780Updated 4 months ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆373Updated last year
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆127Updated last year
- Load tensorboard event logs as pandas DataFrames for scientific plotting; Supports both PyTorch and TensorFlow☆206Updated last year
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆401Updated this week
- ☆460Updated last year
- Puzzles for exploring transformers☆380Updated 2 years ago
- ☆225Updated last year
- maximal update parametrization (µP)☆1,636Updated last year
- TensorDict is a pytorch dedicated tensor container.☆988Updated last week
- JAX Synergistic Memory Inspector☆182Updated last year
- An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"☆322Updated last year
- An implementation of local windowed attention for language modeling☆488Updated 4 months ago