yhgon / NoPropLinks
implement of NoProp-CT
☆25Updated 7 months ago
Alternatives and similar repositories for NoProp
Users that are interested in NoProp are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the groundbreaking paper "NoProp: Training Neural Networks Without Backpropagation or Forward Propagation".☆65Updated 7 months ago
- ☆76Updated 10 months ago
- ☆140Updated last year
- Benchmarking and Testing FastKAN☆88Updated last year
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆99Updated last month
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆95Updated 9 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆198Updated last month
- ☆43Updated last year
- Oscillatory State-Space Models☆111Updated last month
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆231Updated last month
- ☆129Updated 4 months ago
- ☆69Updated last year
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆42Updated 8 months ago
- Implementation of the proposed minGRU in Pytorch☆311Updated 8 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆75Updated 11 months ago
- KAN for Vision Transformer☆253Updated last year
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆107Updated 2 months ago
- Awesome list of papers that extend Mamba to various applications.☆139Updated 6 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆389Updated 2 months ago
- Minimal Mamba-2 implementation in PyTorch☆236Updated last year
- Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis☆260Updated 4 months ago
- [ICLR 2025 Spotlight] Official Implementation for ToST (Token Statistics Transformer)☆127Updated 9 months ago
- Benchmark for efficiency in memory and time of different KAN implementations.☆136Updated last year
- [IEEE Trans. AI 2024] Spiking Diffusion Models☆49Updated 7 months ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆120Updated 3 months ago
- Reading list for research topics in state-space models☆335Updated 6 months ago
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆86Updated 2 years ago
- PyTorch implementation of Titans.☆30Updated 10 months ago
- [NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)☆431Updated last month
- FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)☆455Updated last year