thomasahle / kanmlpsLinks

KANs and MLPs

☆11

Alternatives and similar repositories for kanmlps

Users that are interested in kanmlps are comparing it to the libraries listed below

Sorting:

modula-systems / modula
🧱 Modula software package
☆204Updated 3 months ago
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆98Updated 5 months ago
johnma2006 / candle
Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
☆50Updated last year
nikhilvyas / SOAP
☆197Updated 7 months ago
proger / accelerated-scan
Accelerated First Order Parallel Associative Scan
☆182Updated 10 months ago
LIONS-EPFL / scion
☆26Updated 2 weeks ago
vvvm23 / mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆84Updated last year
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆147Updated 2 weeks ago
alxndrTL / othello_mamba
Evaluating the Mamba architecture on the Othello game
☆47Updated last year
johnryan465 / pscan
☆40Updated last year
shikaiqiu / compute-better-spent
☆53Updated 9 months ago
GallagherCommaJack / modulax
☆17Updated 10 months ago
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆179Updated last month
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆189Updated 7 months ago
proger / hippogriff
Griffin MQA + Hawk Linear RNN Hybrid
☆87Updated last year
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆82Updated 7 months ago
opooladz / Preconditioned-Stochastic-Gradient-Descent
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Updated 9 months ago
Silent-Zebra / twisted-smc-lm
☆29Updated 3 months ago
AndPotap / einsum-search
☆32Updated 9 months ago
lucidrains / transformer-directed-evolution
Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster
☆70Updated last month
dvruette / barrel-rec-pytorch
☆53Updated last year
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆123Updated 7 months ago
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆79Updated last year
berlino / seq_icl
☆53Updated last year
NVlabs / Forecasting-Model-Search
A system for automating selection and optimization of pre-trained models from the TAO Model Zoo
☆25Updated last year
machine-discovery / deer
Parallelizing non-linear sequential models over the sequence length
☆52Updated 3 weeks ago
glassroom / heinsen_sequence
Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)
☆94Updated 7 months ago
NX-AI / xlstm-jax
Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…
☆97Updated 6 months ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆219Updated last month
NVIDIA / ngpt
Normalized Transformer (nGPT)
☆184Updated 7 months ago