instantX-research / InstantUnify
InstantUnify: Integrates Multimodal LLM into Diffusion Models π₯
β39Updated 8 months ago
Alternatives and similar repositories for InstantUnify:
Users that are interested in InstantUnify are comparing it to the libraries listed below
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)β42Updated 2 weeks ago
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidanceβ46Updated 6 months ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"β73Updated last week
- β53Updated 11 months ago
- β85Updated 7 months ago
- [ICLR 2024] Code for FreeNoise based on AnimateDiffβ107Updated last year
- Blending Custom Photos with Video Diffusion Transformersβ46Updated 3 months ago
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"β46Updated 3 weeks ago
- β38Updated 10 months ago
- β60Updated 9 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)β114Updated 9 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformersβ51Updated 6 months ago
- Official PyTorch Implementation of "Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generatiβ¦β26Updated last year
- Consistency Distillation with Target Timestep Selection and Decoupled Guidanceβ77Updated 3 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"β40Updated 8 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"β80Updated 9 months ago
- an unofficial implementation of dreamtunerβ24Updated last year
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Modelsβ120Updated last month
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)β86Updated 3 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformersβ115Updated 3 months ago
- Official Implementation of PairCustomization SIGGRAPH Asia 2024β97Updated 2 months ago
- β95Updated 7 months ago
- More suitable IP-Adapter for the DiT architectureβ29Updated 9 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generationβ38Updated last year
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editingβ55Updated last month
- β24Updated 10 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generationβ50Updated this week
- β62Updated 10 months ago
- A retrain of AnimateDiff to be conditional on an init imageβ34Updated last year
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisβ36Updated last month