instantX-research / InstantUnifyLinks
InstantUnify: Integrates Multimodal LLM into Diffusion Models π₯
β40Updated last year
Alternatives and similar repositories for InstantUnify
Users that are interested in InstantUnify are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β127Updated last year
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformersβ127Updated 5 months ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"β86Updated 6 months ago
- Conceptrol: Concept Control of Zero-shot Personalized Image Generationβ44Updated 8 months ago
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)β48Updated 7 months ago
- β55Updated last year
- an unofficial implementation of dreamtunerβ24Updated last year
- [ICCV 2025] Edicho: Consistent Image Editing in the Wildβ122Updated last month
- Consistency Distillation with Target Timestep Selection and Decoupled Guidanceβ100Updated 11 months ago
- Subjects200K datasetβ123Updated 10 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisβ64Updated 7 months ago
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidanceβ51Updated last year
- Blending Custom Photos with Video Diffusion Transformersβ48Updated 10 months ago
- β93Updated 4 months ago
- β52Updated 11 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)β97Updated 10 months ago
- β91Updated last year
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editingβ163Updated 5 months ago
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generationβ30Updated last year
- β32Updated last year
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.β99Updated last year
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsβ133Updated 9 months ago
- [ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editingβ69Updated 3 months ago
- MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation (ECCV 2024)β134Updated last year
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".β115Updated 7 months ago
- Official PyTorch Implementation of "Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generatiβ¦β27Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformersβ64Updated last year
- RepText: Rendering Visual Text via Replicating π₯β139Updated 6 months ago
- β106Updated last year
- [ICLR 2024] Code for FreeNoise based on AnimateDiffβ108Updated last year