ml-research / self-expanding-neural-networksLinks
Self-Expanding Neural Networks
☆39Updated last year
Alternatives and similar repositories for self-expanding-neural-networks
Users that are interested in self-expanding-neural-networks are comparing it to the libraries listed below
Sorting:
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- C++ and Cuda ops for fused FourierKAN☆80Updated last year
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆87Updated last year
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- Modern Fixed Point Systems using Pytorch☆118Updated last year
- ☆96Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆82Updated last year
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆84Updated 2 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated last year
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆89Updated 2 years ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆82Updated last year
- ☆67Updated 11 months ago
- Parallelizing non-linear sequential models over the sequence length☆54Updated 3 months ago
- Official implementation of "Fourier Head: Helping Large Language Models Learn Complex Probability Distributions" (ICLR 2025)☆66Updated 6 months ago
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆105Updated 2 weeks ago
- ☆306Updated 9 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆39Updated 6 months ago
- ☆127Updated 2 months ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆40Updated 2 years ago
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆45Updated last year
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆113Updated 2 years ago
- ☆58Updated last year
- Package for working with hypernetworks in PyTorch.☆131Updated 2 years ago
- ☆217Updated 10 months ago
- Rational Activation Functions - Replacing Padé Activation Units☆100Updated 7 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆62Updated 2 years ago
- Free-form flows are a generative model training a pair of neural networks via maximum likelihood☆49Updated 3 months ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆90Updated last year
- ☆292Updated 10 months ago
- ☆15Updated 3 years ago