EchoPluto / MagicIDLinks
☆32Updated 7 months ago
Alternatives and similar repositories for MagicID
Users that are interested in MagicID are comparing it to the libraries listed below
Sorting:
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Updated 10 months ago
- ☆50Updated 3 weeks ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated 9 months ago
- ☆20Updated last year
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆34Updated 3 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated 2 weeks ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆69Updated 3 months ago
- Official pytorch implementation for SingleInsert☆27Updated last year
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance☆51Updated last year
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆21Updated last year
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆44Updated last year
- Code for full fintuing Mochi model with FSDP (and CP)☆31Updated 3 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆25Updated 6 months ago
- ☆29Updated 7 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆42Updated last year
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆33Updated last year
- Video Diffusion Transformers are In-Context Learners☆33Updated 9 months ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆20Updated 4 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆62Updated 5 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆72Updated 9 months ago
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆45Updated 6 months ago
- [ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation☆55Updated last year
- [Neurips 2025'] VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping☆49Updated last week
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆66Updated 5 months ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated last year
- ☆51Updated 10 months ago
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆60Updated 3 months ago
- ☆66Updated last year
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆84Updated last year
- InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥☆40Updated last year