ml-research / self-expanding-neural-networksLinks
Self-Expanding Neural Networks
☆39Updated last year
Alternatives and similar repositories for self-expanding-neural-networks
Users that are interested in self-expanding-neural-networks are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Attention as a Hypernetwork"☆36Updated 11 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 9 months ago
- ☆53Updated 8 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆12Updated 2 years ago
- ☆51Updated 11 months ago
- A State-Space Model with Rational Transfer Function Representation.☆78Updated last year
- Deep Networks Grok All the Time and Here is Why☆36Updated last year
- Fork of Flame repo for training of some new stuff in development☆13Updated this week
- ☆31Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated last week
- Parallelizing non-linear sequential models over the sequence length☆51Updated 4 months ago
- Code for☆27Updated 5 months ago
- ☆32Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated 2 weeks ago
- ☆26Updated 2 years ago
- ☆23Updated 8 months ago
- Adaptation of titans-pytorch to llama models on HF☆16Updated 3 months ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆12Updated 4 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆36Updated 2 months ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆40Updated last year
- Induce brain-like topographic structure in your neural networks☆62Updated 2 weeks ago
- Official Code Repository for the paper "Key-value memory in the brain"☆26Updated 3 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆50Updated 6 months ago
- Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)☆55Updated 2 years ago
- Python implementation of the methods in Meulemans et al. 2020 - A Theoretical Framework For Target Propagation☆32Updated 7 months ago
- Here we will test various linear attention designs.☆58Updated last year
- ☆13Updated 2 years ago
- ☆65Updated 7 months ago
- Official code for our NeurIPS 2024 paper "einspace: Searching for Neural Architectures from Fundamental Operations"☆28Updated 7 months ago