ml-research / self-expanding-neural-networksLinks

Self-Expanding Neural Networks

☆39

Alternatives and similar repositories for self-expanding-neural-networks

Users that are interested in self-expanding-neural-networks are comparing it to the libraries listed below

Sorting:

thjashin / multires-conv
Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)
☆127Updated 2 years ago
GistNoesis / FusedFourierKAN
C++ and Cuda ops for fused FourierKAN
☆80Updated last year
TariqAHassan / S4Torch
PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.
☆87Updated last year
AhmedImtiazPrio / grok-adversarial
Deep Networks Grok All the Time and Here is Why
☆37Updated last year
locuslab / torchdeq
Modern Fixed Point Systems using Pytorch
☆118Updated last year
quiqi / relu_kan
☆96Updated last year
ruke1ire / RTF
A State-Space Model with Rational Transfer Function Representation.
☆82Updated last year
lucidrains / complex-valued-transformer
Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"
☆84Updated 2 years ago
OpenNLPLab / HGRN2
HGRN2: Gated Linear RNNs with State Expansion
☆54Updated last year
AvivNavon / DWSNets
Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]
☆89Updated 2 years ago
mkofinas / neural-graphs
Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).
☆82Updated last year
NVlabs / ConvSSM
☆67Updated 11 months ago
machine-discovery / deer
Parallelizing non-linear sequential models over the sequence length
☆54Updated 3 months ago
nate-gillman / fourier-head
Official implementation of "Fourier Head: Helping Large Language Models Learn Complex Probability Distributions" (ICLR 2025)
☆66Updated 6 months ago
Zhangyanbo / MLP-KAN
Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)
☆105Updated 2 weeks ago
lindermanlab / S5
☆306Updated 9 months ago
hyperevolnet / Terminator
The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.
☆39Updated 6 months ago
jacobfa / fft
☆127Updated 2 months ago
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆40Updated 2 years ago
jloveric / high-order-layers-torch
High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…
☆45Updated last year
orobix / fwdgrad
Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch
☆113Updated 2 years ago
shikaiqiu / compute-better-spent
☆58Updated last year
chrhenning / hypnettorch
Package for working with hypernetworks in PyTorch.
☆131Updated 2 years ago
nikhilvyas / SOAP
☆217Updated 10 months ago
ml-research / rational_activations
Rational Activation Functions - Replacing Padé Activation Units
☆100Updated 7 months ago
KindXiaoming / Omnigrok
Omnigrok: Grokking Beyond Algorithmic Data
☆62Updated 2 years ago
vislearn / FFF
Free-form flows are a generative model training a pair of neural networks via maximum likelihood
☆49Updated 3 months ago
lucidrains / gateloop-transformer
Implementation of GateLoop Transformer in Pytorch and Jax
☆90Updated last year
bobby-he / simplified_transformers
☆292Updated 10 months ago
mkhodak / relax
☆15Updated 3 years ago