flukeskywalker / nanoDDLinks
Simple Scalable Discrete Diffusion for text in PyTorch
☆36Updated 11 months ago
Alternatives and similar repositories for nanoDD
Users that are interested in nanoDD are comparing it to the libraries listed below
Sorting:
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆84Updated 2 years ago
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆24Updated last year
- ☆34Updated 5 months ago
- ☆31Updated last year
- ☆38Updated 3 years ago
- Transformers with doubly stochastic attention☆47Updated 3 years ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆126Updated 6 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆50Updated last year
- ☆58Updated 11 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 8 months ago
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆67Updated 6 months ago
- Code for NeurIPS 2024 paper: "Noether's razor: Learning Conserved Quantities" by Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de H…☆10Updated 11 months ago
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆56Updated 2 years ago
- The Superposition of Diffusion Models Using the Itô Density Estimator☆51Updated 6 months ago
- Learning to Split for Automatic Bias Detection☆47Updated 2 years ago
- Implementation of Action Matching☆46Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆12Updated 2 years ago
- ☆14Updated 3 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆90Updated last year
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆29Updated 7 months ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆22Updated 3 weeks ago
- ☆33Updated 11 months ago
- code for "Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching"☆121Updated 2 months ago
- Perceiver (transformer variant) implemented in JAX and Flax☆12Updated 4 years ago
- ☆20Updated 3 years ago