kuleshov-group / remdm
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆15Updated 2 weeks ago
Alternatives and similar repositories for remdm:
Users that are interested in remdm are comparing it to the libraries listed below
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆76Updated last week
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling".☆20Updated last week
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆45Updated 4 months ago
- ☆45Updated 10 months ago
- Official code for the paper "Attention as a Hypernetwork"☆25Updated 9 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆44Updated 2 weeks ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆54Updated this week
- ☆17Updated 2 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆50Updated 3 months ago
- Generative Equilibrium Transformer☆17Updated last year
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆30Updated 4 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 9 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆23Updated 2 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆103Updated 5 months ago
- The official repo of continuous speculative decoding☆25Updated 4 months ago
- Code for the paper "Function-Space Learning Rates"☆17Updated last month
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆23Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 6 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆112Updated last month
- RS-IMLE☆38Updated 3 months ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆69Updated last month
- HGRN2: Gated Linear RNNs with State Expansion☆53Updated 7 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆26Updated 11 months ago
- [NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models☆18Updated 3 months ago
- Minimal Implementation of Visual Autoregressive Modelling (VAR)☆28Updated this week
- JAX Scalify: end-to-end scaled arithmetics☆15Updated 4 months ago
- Official PyTorch implementation of "Denoising MCMC for Accelerating Diffusion-Based Generative Models", ICML 2023 Oral Paper☆29Updated last year
- [ICML 2024]: Official implementation for the paper: "Consistent Diffusion Meets Tweedie"☆52Updated 11 months ago
- ☆32Updated 4 months ago
- ☆33Updated 6 months ago