lucidrains / minGRU-pytorchLinks

Implementation of the proposed minGRU in Pytorch

☆300

Alternatives and similar repositories for minGRU-pytorch

Users that are interested in minGRU-pytorch are comparing it to the libraries listed below

Sorting:

lucidrains / adam-atan2-pytorch
Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch
☆115Updated 8 months ago
lucidrains / hyper-connections
Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public
☆88Updated last month
myscience / x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024)
☆169Updated 11 months ago
i404788 / s5-pytorch
Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)
☆77Updated last year
nanowell / AdEMAMix-Optimizer-Pytorch
The AdEMAMix Optimizer: Better, Faster, Older.
☆184Updated 10 months ago
lucidrains / gradnorm-pytorch
A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch
☆100Updated last year
TariqAHassan / S4Torch
PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.
☆83Updated last year
ruke1ire / RTF
A State-Space Model with Rational Transfer Function Representation.
☆79Updated last year
lindermanlab / S5
☆298Updated 7 months ago
lucidrains / nGPT-pytorch
Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI
☆289Updated 2 months ago
PeaBrane / mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
☆120Updated 9 months ago
lucidrains / gateloop-transformer
Implementation of GateLoop Transformer in Pytorch and Jax
☆89Updated last year
kyleliang919 / C-Optim
When it comes to optimizers, it's always better to be safe than sorry
☆352Updated this week
nikhilvyas / SOAP
☆206Updated 8 months ago
lucidrains / transformer-directed-evolution
Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster
☆70Updated 2 months ago
kyegomez / Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆56Updated 2 weeks ago
kyegomez / LiqudNet
Implementation of Liquid Nets in Pytorch
☆67Updated 2 weeks ago
lucidrains / agent-attention-pytorch
Implementation of Agent Attention in Pytorch
☆91Updated last year
lucidrains / block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
☆221Updated 11 months ago
lucidrains / pytorch-custom-utils
Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…
☆124Updated last year
tk-rusch / linoss
Oscillatory State-Space Models
☆96Updated 4 months ago
SynodicMonth / ChebyKAN
Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.
☆383Updated last year
ctlllll / SGConv
☆163Updated 2 years ago
myscience / mamba
Pytorch (Lightning) implementation of the Mamba model
☆29Updated 3 months ago
tommyip / mamba2-minimal
Minimal Mamba-2 implementation in PyTorch
☆212Updated last year
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆101Updated 7 months ago
TorchJD / torchjd
Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…
☆264Updated this week
fkodom / yet-another-retnet
A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…
☆106Updated last year
AvivBick / awesome-ssm-ml
Reading list for research topics in state-space models
☆315Updated last month
lucidrains / ttt-rl
Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez
☆14Updated 4 months ago