cheind / mingru

Torch MinGRU implementation based on "Were RNNs All We Needed?"

☆14

Alternatives and similar repositories for mingru:

Users that are interested in mingru are comparing it to the libraries listed below

lucidrains / minGRU-pytorch
Implementation of the proposed minGRU in Pytorch
☆285Updated last month
lessw2020 / FAdam_PyTorch
an implementation of FAdam (Fisher Adam) in PyTorch
☆43Updated 10 months ago
lucidrains / gradnorm-pytorch
A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch
☆91Updated last year
lucidrains / adam-atan2-pytorch
Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch
☆103Updated 4 months ago
lucidrains / hl-gauss-pytorch
The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch
☆58Updated 2 months ago
hipudding / pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
☆9Updated last year
TariqAHassan / S4Torch
PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.
☆79Updated last year
lucidrains / complex-valued-transformer
Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"
☆73Updated last year
jwr1995 / dc1d
A 1D implementation of a deformable convolutional layer in PyTorch with a few tricks.
☆40Updated last year
Jhomanik / Optimal-Flow-Matching
The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)
☆72Updated 4 months ago
lucidrains / hyper-connections
Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public
☆82Updated 2 months ago
lucidrains / deep-cross-attention
Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch
☆82Updated last month
locuslab / ImpSq
Implicit^2: Implicit model for implicit neural representations
☆28Updated 3 years ago
teamtomo / torch-cubic-spline-grids
Cubic spline interpolation on multidimensional grids in PyTorch
☆26Updated this week
VinAIResearch / LFM
Official PyTorch implementation of the paper: Flow Matching in Latent Space
☆266Updated 2 months ago
EdwardDixon / snake
Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"
☆65Updated 11 months ago
JTT94 / diffusion_schrodinger_bridge
PyTorch Implementation of DSB for Score Based Generative Modeling. Experiments managed using Hydra.
☆164Updated 3 years ago
nguyenngocbaocmt02 / BOSS
[ICLR 2024] Official implementation of Bellman Optimal Stepsize Straightening of Flow-Matching Models
☆35Updated last year
jmclong / random-fourier-features-pytorch
Implementation of "Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains" by Tancik et al.
☆91Updated 2 years ago
martenlienen / torchode
A parallel ODE solver for PyTorch
☆254Updated 6 months ago
sangyun884 / fast-ode
Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023
☆82Updated 2 months ago
tyuxie / RFM
The official codebase for Reflected Flow Matching (ICML 2024)
☆16Updated 10 months ago
thjashin / multires-conv
Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)
☆123Updated last year
kyegomez / Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆52Updated 2 weeks ago
ictnlp / DASpeech
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
☆61Updated 8 months ago
GistNoesis / FusedFourierKAN
C++ and Cuda ops for fused FourierKAN
☆77Updated 11 months ago
lucidrains / x-unet
Implementation of a U-net complete with efficient attention as well as the latest research findings
☆277Updated 11 months ago
dongzhuoyao / uspace
An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"
☆36Updated last year
Hprairie / Bi-Mamba2
A Triton Kernel for incorporating Bi-Directionality in Mamba2
☆64Updated 4 months ago
Khochawongwat / GRAMKAN
KAN meets Gram Polynomials
☆16Updated 8 months ago