Blending Custom Photos with Video Diffusion Transformers
☆48Jan 21, 2025Updated last year
Alternatives and similar repositories for Ingredients
Users that are interested in Ingredients are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆128Jun 26, 2025Updated 8 months ago
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆78Aug 20, 2025Updated 6 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 9 months ago
- [CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆828Aug 30, 2025Updated 6 months ago
- ☆34Mar 18, 2025Updated 11 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 4 months ago
- ☆21Feb 2, 2026Updated last month
- ☆22Mar 7, 2025Updated 11 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last week
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆72Jul 16, 2025Updated 7 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Jul 13, 2025Updated 7 months ago
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆20Jun 12, 2024Updated last year
- ☆13Feb 2, 2025Updated last year
- [NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark☆281Nov 5, 2025Updated 3 months ago
- ☆41Jan 10, 2025Updated last year
- Create your own 3D scene with words anywhere.☆29Updated this week
- ☆11Nov 30, 2025Updated 3 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)☆287Updated this week
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- [ICCV 2025] CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation☆125Aug 6, 2025Updated 6 months ago
- ☆53Dec 20, 2024Updated last year
- Official implementation of "EG4D: Explicit Generation of 4D Object without Score Distillation" (ICLR 2025)☆36Feb 14, 2025Updated last year
- [ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆103Jan 27, 2026Updated last month
- ☆56Apr 30, 2024Updated last year
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆344Oct 30, 2025Updated 4 months ago
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆261Jan 30, 2025Updated last year
- ☆18Jun 14, 2025Updated 8 months ago
- ☆13Jul 10, 2024Updated last year
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆321Mar 30, 2025Updated 11 months ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆124Jan 25, 2025Updated last year
- [ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing☆75Sep 3, 2025Updated 5 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- [ICCV2025] UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization☆277May 1, 2025Updated 10 months ago
- ☆16Dec 30, 2022Updated 3 years ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Dec 12, 2024Updated last year
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 10 months ago
- ☆52Jan 6, 2026Updated last month