flukeskywalker / nanoDDLinks
Simple Scalable Discrete Diffusion for text in PyTorch
☆37Updated last year
Alternatives and similar repositories for nanoDD
Users that are interested in nanoDD are comparing it to the libraries listed below
Sorting:
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆85Updated 2 years ago
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆25Updated 2 years ago
- ☆35Updated 8 months ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Transformers with doubly stochastic attention☆50Updated 3 years ago
- ☆31Updated last year
- Official Jax Implementation of MD4 Masked Diffusion Models☆146Updated 9 months ago
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆57Updated 2 years ago
- ☆25Updated last year
- The Superposition of Diffusion Models Using the Itô Density Estimator☆51Updated 8 months ago
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆70Updated 8 months ago
- Code for NeurIPS 2024 paper: "Noether's razor: Learning Conserved Quantities" by Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de H…☆11Updated last year
- ☆61Updated last year
- ☆34Updated 3 months ago
- ☆41Updated 3 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 6 months ago
- The Energy Transformer block, in JAX☆62Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 10 months ago
- code for "Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching"☆128Updated 4 months ago
- ☆27Updated 2 years ago
- Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold☆69Updated 9 months ago
- Implementation of Action Matching☆48Updated 2 years ago
- Euclidean Wasserstein-2 optimal transportation☆47Updated 2 years ago
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆29Updated 9 months ago
- Code release for "Stochastic Optimal Control Matching"☆39Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆26Updated last month
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆51Updated last year
- Learning to Split for Automatic Bias Detection☆48Updated 2 years ago
- Concept Learning Dynamics☆16Updated last year