Blending Custom Photos with Video Diffusion Transformers
☆48Jan 21, 2025Updated last year
Alternatives and similar repositories for Ingredients
Users that are interested in Ingredients are comparing it to the libraries listed below
Sorting:
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆128Jun 26, 2025Updated 8 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 10 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆78Aug 20, 2025Updated 7 months ago
- [CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆835Mar 8, 2026Updated 2 weeks ago
- ☆22Mar 7, 2025Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 5 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Jul 16, 2025Updated 8 months ago
- [ICCV2025] UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization☆276May 1, 2025Updated 10 months ago
- ☆32Feb 19, 2025Updated last year
- ☆34Mar 18, 2025Updated last year
- [ICCV 2025] CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation☆125Aug 6, 2025Updated 7 months ago
- [NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark☆292Nov 5, 2025Updated 4 months ago
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026)☆55Feb 23, 2026Updated 3 weeks ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆322Mar 30, 2025Updated 11 months ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Jul 13, 2025Updated 8 months ago
- ☆41Jan 10, 2025Updated last year
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆264Jan 30, 2025Updated last year
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆344Oct 30, 2025Updated 4 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆70May 18, 2025Updated 10 months ago
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆90Apr 1, 2025Updated 11 months ago
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆20Jun 12, 2024Updated last year
- [ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆105Jan 27, 2026Updated last month
- ☆52Jan 6, 2026Updated 2 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆706Jun 3, 2025Updated 9 months ago
- ☆57Apr 30, 2024Updated last year
- Scalable and memory-optimized training of diffusion models☆1,344Jun 4, 2025Updated 9 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)☆291Mar 12, 2026Updated last week
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆124Jan 25, 2025Updated last year
- [AAAI 2026] Personalize Anything for Free with Diffusion Transformer☆357Mar 20, 2025Updated last year
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆584Jun 5, 2025Updated 9 months ago
- ACM TOG 2026🎉 Offical repository for "HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention He…☆53Mar 4, 2026Updated 2 weeks ago
- Create your own 3D scene with words anywhere.☆34Updated this week
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆514Jun 17, 2025Updated 9 months ago
- ☆52Dec 20, 2024Updated last year
- ☆16Dec 30, 2022Updated 3 years ago
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆482Oct 18, 2024Updated last year
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆768Dec 5, 2024Updated last year