ml-research / self-expanding-neural-networks
Self-Expanding Neural Networks
☆39Updated last year
Alternatives and similar repositories for self-expanding-neural-networks:
Users that are interested in self-expanding-neural-networks are comparing it to the libraries listed below
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆123Updated last year
- ☆52Updated 6 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 8 months ago
- Official code for our NeurIPS 2024 paper "einspace: Searching for Neural Architectures from Fundamental Operations"☆28Updated 6 months ago
- C++ and Cuda ops for fused FourierKAN☆77Updated 11 months ago
- Implementation/simulation of the predictive forward-forward credit assignment algorithm for training neurobiologically-plausible recurren…☆56Updated 2 years ago
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆12Updated 2 years ago
- ☆88Updated 10 months ago
- Official code for the paper "Attention as a Hypernetwork"☆28Updated 10 months ago
- Unofficial Implementation of Selective Attention Transformer☆16Updated 5 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆55Updated 2 years ago
- Deep Networks Grok All the Time and Here is Why☆34Updated 11 months ago
- ☆64Updated 6 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆36Updated 2 weeks ago
- Code for☆27Updated 4 months ago
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆26Updated 2 weeks ago
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated 2 months ago
- ☆31Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆78Updated 11 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆42Updated 5 months ago
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆44Updated 10 months ago
- ☆14Updated 3 years ago
- Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆66Updated 9 months ago
- Code for experiments on transformers using Markovian data.☆11Updated 5 months ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆40Updated last year
- ☆49Updated last year
- ☆30Updated 5 months ago
- Parallelizing non-linear sequential models over the sequence length☆51Updated 3 months ago
- ☆54Updated 8 months ago