ML-GSAI / Scaling-Diffusion-Transformers-muPLinks
Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".
☆68Updated last month
Alternatives and similar repositories for Scaling-Diffusion-Transformers-muP
Users that are interested in Scaling-Diffusion-Transformers-muP are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆49Updated 3 weeks ago
- ☆152Updated last week
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆114Updated 8 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆155Updated last week
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆109Updated last month
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆22Updated last month
- [Preprint] UCGM: Unified Continuous Generative Models☆152Updated 3 weeks ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆67Updated 2 months ago
- ☆60Updated 2 weeks ago
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆33Updated 4 months ago
- A Collection of Papers on Diffusion Language Models☆81Updated last week
- Official Jax Implementation of MD4 Masked Diffusion Models☆106Updated 3 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆74Updated last week
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆26Updated 3 months ago
- ☆37Updated last month
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆37Updated 3 months ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆30Updated 2 months ago
- ☆30Updated 2 months ago
- [ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)☆105Updated 3 weeks ago
- Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024☆86Updated last year
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling".☆31Updated 3 months ago
- Reward fine-tuning for Stable Diffusion models based on stochastic optimal control, including Adjoint Matching☆36Updated 3 weeks ago
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆221Updated 5 months ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆108Updated last month
- Code for TFG: Unified Training-Free Guidance for Diffusion Models☆59Updated last month
- Code for paper "Principal Components" Enable A New Language of Images☆44Updated 2 weeks ago
- Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.☆59Updated 3 weeks ago
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)☆124Updated 2 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆56Updated 10 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆76Updated 6 months ago