Event-AHU / Awesome_Modern_Hopfield_Networks
Paper list for Modern Hopfield Networks
☆12Updated last month
Alternatives and similar repositories for Awesome_Modern_Hopfield_Networks
Users that are interested in Awesome_Modern_Hopfield_Networks are comparing it to the libraries listed below
Sorting:
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 8 months ago
- Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)☆23Updated 4 months ago
- Implementations of various linear RNN layers using pytorch and triton☆51Updated last year
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆27Updated last month
- [ICLR 2024] Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks☆36Updated last year
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆99Updated this week
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆76Updated last year
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆64Updated last year
- Parallelizing non-linear sequential models over the sequence length☆51Updated 4 months ago
- ☆23Updated 7 months ago
- An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"☆16Updated 11 months ago
- A repository for DenseSSMs☆87Updated last year
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆53Updated 11 months ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆53Updated last month
- Triton implement of bi-directional (non-causal) linear attention☆47Updated 3 months ago
- The official GitHub page for the survey paper "A Survey of RWKV".☆26Updated 4 months ago
- ☆103Updated last year
- This project contains code for the paper titled "SpikingBERT: Distilling BERT to Train Spiking Language Models Using Implicit Differentia…☆20Updated last year
- Here we will test various linear attention designs.☆60Updated last year
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆12Updated 2 years ago
- State Space Models☆67Updated last year
- [ICLR 2023] "Dilated convolution with learnable spacings" Ismail Khalfaoui Hassani, Thomas Pellegrini and Timothée Masquelier☆66Updated last year
- Official Code Repository for the paper "Key-value memory in the brain"☆25Updated 2 months ago
- Pytorch implementation of Hebbian learning algorithms to train deep convolutional neural networks.☆27Updated 10 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆106Updated 8 months ago
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆13Updated 2 weeks ago
- ☆48Updated last year
- ☆21Updated 2 years ago
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated 11 months ago
- ☆47Updated last year