srush / annotated-s4
Implementation of https://srush.github.io/annotated-s4
☆469Updated last year
Related projects ⓘ
Alternatives and complementary repositories for annotated-s4
- ☆261Updated 3 months ago
- Annotated version of the Mamba paper☆457Updated 8 months ago
- Sequence modeling with Mega.☆298Updated last year
- Long Range Arena for Benchmarking Efficient Transformers☆729Updated 11 months ago
- ☆167Updated 5 months ago
- For optimization algorithm research and development.☆449Updated this week
- Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch☆571Updated last week
- ☆334Updated 7 months ago
- Named tensors with first-class dimensions for PyTorch☆322Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆516Updated this week
- Structured state space sequence models☆2,470Updated 4 months ago
- ☆164Updated last year
- A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training☆437Updated 10 months ago
- ☆303Updated this week
- TensorDict is a pytorch dedicated tensor container.☆840Updated this week
- ☆197Updated 4 months ago
- Code for our NeurIPS 2022 paper☆363Updated last year
- ☆207Updated 6 months ago
- ☆251Updated 2 years ago
- Accelerated First Order Parallel Associative Scan☆163Updated 3 months ago
- Neural Networks and the Chomsky Hierarchy☆187Updated 7 months ago
- CLU lets you write beautiful training loops in JAX.☆321Updated this week
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆203Updated last year
- ☆161Updated last year
- An implementation of local windowed attention for language modeling☆384Updated 2 months ago
- Unofficial JAX implementations of deep learning research papers☆151Updated 2 years ago
- Language Modeling with the H3 State Space Model☆513Updated last year
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆293Updated 5 months ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆507Updated last year
- A library to inspect and extract intermediate layers of PyTorch models.☆470Updated 2 years ago