kuleshov-group / remdm
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆19Updated 2 months ago
Alternatives and similar repositories for remdm
Users that are interested in remdm are comparing it to the libraries listed below
Sorting:
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆20Updated 2 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 3 months ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling".☆26Updated 2 months ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆99Updated this week
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆36Updated 2 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆50Updated 2 months ago
- Reward fine-tuning for Stable Diffusion models based on stochastic optimal control☆26Updated 2 weeks ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆79Updated 2 months ago
- ☆45Updated last year
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆58Updated 3 weeks ago
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆59Updated 2 weeks ago
- Implementation of a multimodal diffusion transformer in Pytorch☆102Updated 10 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆49Updated 6 months ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆66Updated 7 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- Minimal Implementation of Visual Autoregressive Modelling (VAR)☆33Updated last month
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆30Updated 6 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆141Updated 3 months ago
- Generative Equilibrium Transformer☆18Updated last year
- ☆31Updated 4 months ago
- JAX Scalify: end-to-end scaled arithmetics☆16Updated 6 months ago
- ☆52Updated 7 months ago
- Simple Guidance Mechanisms for Discrete Diffusion Models☆37Updated 5 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 8 months ago
- ☆33Updated 8 months ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆18Updated 6 months ago
- RS-IMLE☆38Updated 5 months ago
- Code accompanying the paper "Generalized Interpolating Discrete Diffusion"☆78Updated last week
- [ICML 2024]: Official implementation for the paper: "Consistent Diffusion Meets Tweedie"☆53Updated last year