flukeskywalker / nanoDDLinks
Simple Scalable Discrete Diffusion for text in PyTorch
☆37Updated last year
Alternatives and similar repositories for nanoDD
Users that are interested in nanoDD are comparing it to the libraries listed below
Sorting:
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- ☆53Updated 3 weeks ago
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆25Updated 2 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- ☆31Updated last year
- ☆36Updated 9 months ago
- The Superposition of Diffusion Models Using the Itô Density Estimator☆50Updated 9 months ago
- Transformers with doubly stochastic attention☆51Updated 3 years ago
- ☆42Updated 3 years ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆87Updated 2 years ago
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆71Updated 10 months ago
- ☆36Updated this week
- Official Jax Implementation of MD4 Masked Diffusion Models☆151Updated 10 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆51Updated last year
- Code for NeurIPS 2024 paper: "Noether's razor: Learning Conserved Quantities" by Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de H…☆11Updated last year
- Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold☆69Updated 10 months ago
- Official implementation of Fisher-Flow Matching (NeurIPS 2024).☆35Updated last year
- Implementation of Action Matching for the Schrödinger equation☆25Updated 2 years ago
- ☆18Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆28Updated 2 months ago
- The Energy Transformer block, in JAX☆63Updated 2 years ago
- code for "Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching"☆128Updated 5 months ago
- Perceiver (transformer variant) implemented in JAX and Flax☆13Updated 4 years ago
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated last month
- Meta-learning inductive biases in the form of useful conserved quantities.☆39Updated 3 years ago
- ☆62Updated last year
- An annotated implementation of the Hyena Hierarchy paper☆34Updated 2 years ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆84Updated 8 months ago
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆58Updated 2 years ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated 2 years ago