EnVision-Research / DDSM
Denoising Diffusion Step-aware Models (ICLR2024)
☆60Updated last year
Alternatives and similar repositories for DDSM:
Users that are interested in DDSM are comparing it to the libraries listed below
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆64Updated last year
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆32Updated last month
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆107Updated this week
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆77Updated 3 weeks ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆129Updated last month
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆67Updated 3 weeks ago
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆21Updated last year
- Frequency Autoregressive Image Generation with Continuous Tokens☆59Updated last month
- Official implementation of LaVin-DiT☆32Updated 3 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆69Updated 2 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆76Updated 2 weeks ago
- ICCV2023-Diffusion-Papers☆108Updated last year
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆31Updated 2 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆97Updated last month
- ☆73Updated last month
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆163Updated 2 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆75Updated last year
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆49Updated 5 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆44Updated 10 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆50Updated last month
- ☆39Updated last year
- This is the official implementation for ControlVAR.☆104Updated 4 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆86Updated 6 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 4 months ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆41Updated 10 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆91Updated last month
- Official implementation of "Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive" (ICLR 2024)☆54Updated 8 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆69Updated 2 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆53Updated last month
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆68Updated 6 months ago