state-spaces / s4
Structured state space sequence models
โ2,455Updated 3 months ago
Related projects โ
Alternatives and complementary repositories for s4
- Pytorch library for fast transformer implementationsโ1,642Updated last year
- ๐ฆ Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorchโ2,028Updated 4 months ago
- Implementation of https://srush.github.io/annotated-s4โ468Updated last year
- Vector (and Scalar) Quantization, in Pytorchโ2,594Updated this week
- Transformer based on a variant of attention that is linear complexity in respect to sequence lengthโ695Updated 6 months ago
- maximal update parametrization (ยตP)โ1,398Updated 3 months ago
- Long Range Arena for Benchmarking Efficient Transformersโ727Updated 10 months ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.โ1,004Updated 2 months ago
- A concise but complete full-attention transformer with a set of promising experimental features from various papersโ4,768Updated this week
- Implementation of Rotary Embeddings, from the Roformer paper, in Pytorchโ565Updated last month
- PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. โก๐ฅโกโ4,236Updated 2 months ago
- An implementation of Performer, a linear attention-based transformer, in Pytorchโ1,093Updated 2 years ago
- Machine learning metrics for distributed, scalable PyTorch applications.โ2,133Updated this week
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.โ2,614Updated 8 months ago
- PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538โ974Updated 6 months ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorchโ1,092Updated last year
- โ749Updated last month
- Tensors, for human consumptionโ1,111Updated last week
- Schedule-Free Optimization in PyTorchโ1,880Updated this week
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.โ1,460Updated this week
- TorchCFM: a Conditional Flow Matching libraryโ1,198Updated last month
- Mamba SSM architectureโ13,130Updated this week
- Foundation Architecture for (M)LLMsโ3,025Updated 6 months ago
- An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"โ1,163Updated last year
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.โ2,303Updated last month
- PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)โ1,736Updated 3 months ago
- Collection of papers on state-space modelsโ549Updated this week
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Tritonโ1,320Updated this week
- Reformer, the efficient Transformer, in Pytorchโ2,116Updated last year
- Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)โ1,816Updated 3 months ago