wtybest / EnMMDiTLinks
☆11Updated last month
Alternatives and similar repositories for EnMMDiT
Users that are interested in EnMMDiT are comparing it to the libraries listed below
Sorting:
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆66Updated last year
- [ICCV 2025] Diffusion Curriculum (DisCL)☆15Updated 3 months ago
- ☆15Updated last year
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Updated last year
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆54Updated 4 months ago
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆39Updated 6 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated 3 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Updated last year
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆44Updated 6 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 7 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆64Updated last year
- ☆13Updated 11 months ago
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆70Updated 2 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆60Updated 11 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆27Updated 8 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated last year
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆57Updated last year
- [ICML2025] LoRA fine-tune directly on the quantized models.☆39Updated last year
- EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing☆90Updated last month
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Updated 8 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆51Updated last year
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models☆50Updated 3 months ago
- Distilling Diversity and Control in Diffusion Models☆50Updated 8 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆97Updated 11 months ago
- Video Diffusion Transformers are In-Context Learners☆36Updated last year
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆39Updated 10 months ago
- [AAAI 2025] LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation☆41Updated last year
- ☆62Updated 2 weeks ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Updated last year
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆45Updated 9 months ago