instantX-research / InstantUnify
InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥
☆39Updated 9 months ago
Alternatives and similar repositories for InstantUnify
Users that are interested in InstantUnify are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance☆46Updated 7 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆59Updated 2 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆68Updated 2 weeks ago
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆42Updated last month
- ☆95Updated 8 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆114Updated 9 months ago
- ☆53Updated last year
- Consistency Distillation with Target Timestep Selection and Decoupled Guidance☆78Updated 4 months ago
- [ICLR 2024] Code for FreeNoise based on AnimateDiff☆107Updated last year
- an unofficial implementation of dreamtuner☆24Updated last year
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆38Updated last month
- ☆67Updated last week
- Blending Custom Photos with Video Diffusion Transformers☆46Updated 3 months ago
- ☆34Updated 3 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆57Updated 3 weeks ago
- 🔥 [CVPR 2024] The official repo for Zero-Painter!☆67Updated 11 months ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆76Updated last month
- ☆19Updated 3 weeks ago
- More suitable IP-Adapter for the DiT architecture☆29Updated 10 months ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆121Updated 2 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆32Updated 6 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆116Updated 4 months ago
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆49Updated last month
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆38Updated 11 months ago
- ☆83Updated 8 months ago
- [ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆107Updated last month
- ☆25Updated 10 months ago
- ☆86Updated 7 months ago
- Subjects200K dataset☆110Updated 3 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆41Updated last week