SNU-VGILab/exploring-mmdit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SNU-VGILab/exploring-mmdit)

SNU-VGILab / exploring-mmdit

Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing

☆26

Alternatives and similar repositories for exploring-mmdit

Users that are interested in exploring-mmdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SNU-VGILab / e-latentlpips
View on GitHub
Unofficial implementation of E-LatentLPIPS in Diffusion2GAN
☆20Sep 5, 2024Updated last year
SNU-VGILab / improving-editability
View on GitHub
[Official Implementation] Improving Editability in Image Generation with Layer-wise Memory, CVPR 2025
☆38Mar 2, 2026Updated 4 months ago
wlfeng0509 / Awesome-Diffusion-Quantization
View on GitHub
A list of papers, docs, codes about diffusion quantization.This repo collects various quantization methods for the Diffusion Models. Welc…
☆21Feb 2, 2026Updated 5 months ago
SNU-VGILab / InstaOrder
View on GitHub
Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".
☆29Jan 11, 2024Updated 2 years ago
RahulSajnani / GeoDiffuser
View on GitHub
[WACV 2025, Best Student Paper, Oral] GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
☆22Mar 22, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fkyyyy / DiT4Edit
View on GitHub
☆35Nov 5, 2024Updated last year
SNU-VGILab / Liv3Stroke
View on GitHub
Official Repository of Recovering Dynamic 3D Sketches from Videos (CVPR 2025)
☆15Mar 2, 2026Updated 4 months ago
snuvclab / pegasus
View on GitHub
[CVPR 2024] PEGASUS: Personalized Generative 3D Avatars with Composable Attributes
☆60Dec 30, 2024Updated last year
yunpeng1998 / GeoVideo
View on GitHub
code for "GeoVideo: Introducing Geometric Regularization into Video Generation Models"
☆18Jan 8, 2026Updated 6 months ago
TrustAIRLab / ZeroFake
View on GitHub
☆11Oct 30, 2024Updated last year
mingukkang / FlashDecoder
View on GitHub
Official FlashDecoder Github
☆17Apr 4, 2026Updated 3 months ago
furiosa-ai / eta-inversion
View on GitHub
[ECCV 2024] Official Pytorch Implementation for "Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing"
☆34Jun 16, 2025Updated last year
LifuWang-66 / DistillT5
View on GitHub
(CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models
☆53Sep 10, 2025Updated 10 months ago
jslee525 / PIC
View on GitHub
the official code of "Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation" (ECCV2024)
☆13Jan 14, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
wookiekim / CorrespondentDream
View on GitHub
Official PyTorch implementation of CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences (CVPR 2024 Po…
☆19Apr 29, 2024Updated 2 years ago
wtybest / FreeFlux
View on GitHub
[ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
☆77Mar 7, 2026Updated 4 months ago
nstar1125 / ShowMak3r
View on GitHub
Code release for the paper "ShowMak3r: Compositional TV Show Reconstruction" (CVPR 2025)
☆89May 29, 2026Updated last month
OurBluePrint / easy_video
View on GitHub
☆20Mar 3, 2025Updated last year
carpedkm / disentangled-subject-to-vid
View on GitHub
Learning Zero-Shot Subject-Driven Video Generation Using 1% Compute
☆59Jul 9, 2026Updated 2 weeks ago
cvlab-kaist / CAMEO
View on GitHub
Official implementation of "CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models"
☆57May 26, 2026Updated 2 months ago
cvlab-kaist / GSD
View on GitHub
Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling
☆29Sep 17, 2024Updated last year
XinR-Tang / TGN
View on GitHub
This is the official implementation for our TGRS 2024 paper "Text-Guided Diverse Image Synthesis for Long-Tailed Remote Sensing Object Cl…
☆18Jul 3, 2024Updated 2 years ago
SNU-VGILab / InstantDrag
View on GitHub
InstantDrag: Improving Interactivity in Drag-based Image Editing
☆237May 28, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cvlab-kaist / UFC
View on GitHub
☆12Mar 17, 2024Updated 2 years ago
cvlab-kaist / Seg4Diff
View on GitHub
Official implementation of "Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers" (NeurIPS 2025)
☆79Sep 23, 2025Updated 10 months ago
HolmesShuan / FireFlow-Fast-Inversion-of-Rectified-Flow-for-Image-Semantic-Editing
View on GitHub
[ICML2025] An 8-step inversion and 8-step editing process works effectively with the FLUX-dev model. (3x speedup with results that are co…
☆295May 1, 2025Updated last year
wyf0912 / AREdit
View on GitHub
Training-Free Text-Guided Image Editing Using Visual Autoregressive Model
☆76Apr 15, 2025Updated last year
vTAD2025-Challenge / vTAD
View on GitHub
☆17Oct 24, 2025Updated 9 months ago
Xiaohui9607 / LLM_layout_generator
View on GitHub
LLM as Layout generator designed for improving compositional ability of stable diffusion models
☆17Dec 4, 2023Updated 2 years ago
yanghan-yh / MCA-Ctrl
View on GitHub
CVPR2025-Multi-party Collaborative Attention Control for Image Customization
☆17May 14, 2025Updated last year
ethanhe42 / dds
View on GitHub
DDS: Delta Denoising Score PyTorch implementation
☆19Sep 2, 2023Updated 2 years ago
YuchuanTian / U-REPA
View on GitHub
[NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs
☆38Dec 15, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GunwooHan / E-LatentLPIPS
View on GitHub
Unofficial Implementation of E-LatentLPIPS(Ensembled-LatentLPIPS) of Diffusion2GAN
☆42Jul 11, 2024Updated 2 years ago
wlaud1001 / ReFlex
View on GitHub
[ICCV 2025, Highlight] Official Pytorch implementation of the paper: "ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mi…
☆39Aug 1, 2025Updated 11 months ago
alex4727 / MotionStream
View on GitHub
MotionStream: Real-Time Video Generation with Interactive Motion Controls
☆574Mar 1, 2026Updated 4 months ago
SNU-VGILab / TLC-Calib
View on GitHub
[RA-L'26] Official implementation of Targetless LiDAR-Camera Calibration with Neural Gaussian Splatting
☆25May 30, 2026Updated last month
martian422 / MaskGRPO
View on GitHub
The official implementation of MaskGRPO: Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models. (ICLR 2026, arxiv…
☆19Jan 27, 2026Updated 5 months ago
carpedkm / CustoMDiT
View on GitHub
PexelsCustom-1M: A Comprehensive Ecosystem for Open-Domain Customized Video Generation
☆19Jun 30, 2026Updated 3 weeks ago
YRIKKA / ComfyUI-InferenceTimeScaling
View on GitHub
This extension provides inference-time optimization techniques to enhance diffusion-based image generation quality through random search …
☆23Feb 27, 2025Updated last year