flukeskywalker / nanoDDLinks
Simple Scalable Discrete Diffusion for text in PyTorch
☆37Updated last year
Alternatives and similar repositories for nanoDD
Users that are interested in nanoDD are comparing it to the libraries listed below
Sorting:
- ☆35Updated 8 months ago
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆25Updated 2 years ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- The Superposition of Diffusion Models Using the Itô Density Estimator☆50Updated 9 months ago
- ☆47Updated this week
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆86Updated 2 years ago
- Transformers with doubly stochastic attention☆51Updated 3 years ago
- ☆31Updated last year
- Official Jax Implementation of MD4 Masked Diffusion Models☆149Updated 9 months ago
- ☆62Updated last year
- Code for minimum-entropy coupling.☆32Updated last month
- ☆41Updated 3 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 7 months ago
- Implementation of Action Matching☆49Updated 2 years ago
- Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold☆69Updated 9 months ago
- code for "Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching"☆130Updated 5 months ago
- ☆35Updated last week
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆51Updated last year
- Concept Learning Dynamics☆16Updated last year
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆13Updated 2 years ago
- The Energy Transformer block, in JAX☆63Updated 2 years ago
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆70Updated 9 months ago
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆57Updated 2 years ago
- Code release for "Stochastic Optimal Control Matching"☆39Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆30Updated 10 months ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆38Updated 3 years ago
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆69Updated 3 years ago
- simple bibtex generator for any text with \cite{}☆31Updated last year