cheind / mingru
Torch MinGRU implementation based on "Were RNNs All We Needed?"
☆14Updated 4 months ago
Alternatives and similar repositories for mingru:
Users that are interested in mingru are comparing it to the libraries listed below
- Implementation of the proposed minGRU in Pytorch☆285Updated last month
- an implementation of FAdam (Fisher Adam) in PyTorch☆43Updated 10 months ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆91Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆103Updated 4 months ago
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆58Updated 2 months ago
- Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.☆9Updated last year
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆79Updated last year
- Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"☆73Updated last year
- A 1D implementation of a deformable convolutional layer in PyTorch with a few tricks.☆40Updated last year
- The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)☆72Updated 4 months ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆82Updated 2 months ago
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆82Updated last month
- Implicit^2: Implicit model for implicit neural representations☆28Updated 3 years ago
- Cubic spline interpolation on multidimensional grids in PyTorch☆26Updated this week
- Official PyTorch implementation of the paper: Flow Matching in Latent Space☆266Updated 2 months ago
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆65Updated 11 months ago
- PyTorch Implementation of DSB for Score Based Generative Modeling. Experiments managed using Hydra.☆164Updated 3 years ago
- [ICLR 2024] Official implementation of Bellman Optimal Stepsize Straightening of Flow-Matching Models☆35Updated last year
- Implementation of "Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains" by Tancik et al.☆91Updated 2 years ago
- A parallel ODE solver for PyTorch☆254Updated 6 months ago
- Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023☆82Updated 2 months ago
- The official codebase for Reflected Flow Matching (ICML 2024)☆16Updated 10 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆123Updated last year
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆52Updated 2 weeks ago
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆61Updated 8 months ago
- C++ and Cuda ops for fused FourierKAN☆77Updated 11 months ago
- Implementation of a U-net complete with efficient attention as well as the latest research findings☆277Updated 11 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆36Updated last year
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆64Updated 4 months ago
- KAN meets Gram Polynomials☆16Updated 8 months ago