forever208 / DCTdiff
Official code for the paper 'DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space'
☆25Updated 4 months ago
Alternatives and similar repositories for DCTdiff:
Users that are interested in DCTdiff are comparing it to the libraries listed below
- EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆97Updated 2 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆66Updated 5 months ago
- [ICLR 2024] Code for our paper: GNRI: Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models☆46Updated last month
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆112Updated 2 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆57Updated 2 months ago
- ☆30Updated last month
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆70Updated this week
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆48Updated 5 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆46Updated 3 months ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆65Updated 2 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆108Updated 6 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆65Updated 2 weeks ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆37Updated 6 months ago
- ☆70Updated 5 months ago
- ☆45Updated last month
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆35Updated this week
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆21Updated last week
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆45Updated 3 weeks ago
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆164Updated last month
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆56Updated 8 months ago
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆143Updated last week
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆65Updated last month
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 2 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 5 months ago
- Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment☆47Updated last month
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆37Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆46Updated 4 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆47Updated 7 months ago
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆49Updated 4 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆71Updated last month