Sid3503 / NoPropLinks
PyTorch implementation of the groundbreaking paper "NoProp: Training Neural Networks Without Backpropagation or Forward Propagation".
☆59Updated 2 months ago
Alternatives and similar repositories for NoProp
Users that are interested in NoProp are comparing it to the libraries listed below
Sorting:
- The official implementation of TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)☆376Updated 2 weeks ago
- A More Fair and Comprehensive Comparison between KAN and MLP☆171Updated 11 months ago
- [ICLR 2025 Spotlight] Official Implementation for ToST (Token Statistics Transformer)☆110Updated 4 months ago
- tinybig for deep function learning☆61Updated last month
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆224Updated last year
- When it comes to optimizers, it's always better to be safe than sorry☆302Updated 3 months ago
- Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆416Updated 11 months ago
- [ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule☆185Updated 4 months ago
- ☆136Updated last year
- State Space Models☆68Updated last year
- ☆69Updated 5 months ago
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆107Updated 3 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆176Updated 2 weeks ago
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆48Updated last year
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆38Updated 3 months ago
- Simba☆209Updated last year
- Minimal Mamba-2 implementation in PyTorch☆207Updated last year
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆71Updated 6 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆173Updated 3 months ago
- ☆43Updated 5 months ago
- xLSTM as Generic Vision Backbone☆480Updated 8 months ago
- Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆71Updated last year
- ☆66Updated 8 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆144Updated 5 months ago
- ☆123Updated last month
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆65Updated last month
- Exploring Diffusion Transformer Designs via Grafting☆45Updated last month
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆37Updated 8 months ago
- Awesome list of papers that extend Mamba to various applications.☆134Updated last month
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆169Updated 5 months ago