flukeskywalker / nanoDDLinks
Simple Scalable Discrete Diffusion for text in PyTorch
☆33Updated 9 months ago
Alternatives and similar repositories for nanoDD
Users that are interested in nanoDD are comparing it to the libraries listed below
Sorting:
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆40Updated last year
- Code for NeurIPS 2024 paper: "Noether's razor: Learning Conserved Quantities" by Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de H…☆10Updated 8 months ago
- Transformers with doubly stochastic attention☆46Updated 2 years ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆106Updated 4 months ago
- ☆34Updated 2 months ago
- The Superposition of Diffusion Models Using the Itô Density Estimator☆45Updated 3 months ago
- ☆32Updated 8 months ago
- simple bibtex generator for any text with \cite{}☆31Updated 11 months ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 5 months ago
- Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold☆55Updated 3 months ago
- Code for minimum-entropy coupling.☆32Updated last year
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆22Updated last year
- The Energy Transformer block, in JAX☆58Updated last year
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆50Updated 11 months ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Perceiver (transformer variant) implemented in JAX and Flax☆12Updated 4 years ago
- ☆21Updated 2 months ago
- ☆53Updated 8 months ago
- Latent Diffusion Language Models☆68Updated last year
- ☆99Updated 2 years ago
- ☆115Updated last year
- code for "Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching"☆99Updated last month
- Code for the paper https://arxiv.org/abs/2402.04997☆78Updated last year
- ☆32Updated last year
- Implementation of Action Matching for the Schrödinger equation☆24Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Official implementation of Fisher-Flow Matching (NeurIPS 2024).☆23Updated 8 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆74Updated 7 months ago