Official PyTorch implementation of paper MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling
☆13Oct 5, 2024Updated last year
Alternatives and similar repositories for MAVIN
Users that are interested in MAVIN are comparing it to the libraries listed below
Sorting:
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".☆18Jan 30, 2024Updated 2 years ago
- ☆66Jun 4, 2024Updated last year
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild☆21Apr 6, 2025Updated 11 months ago
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆41Nov 30, 2025Updated 3 months ago
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆174May 8, 2025Updated 9 months ago
- [WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"☆62Aug 3, 2025Updated 7 months ago
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 2 years ago
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆41Feb 19, 2025Updated last year
- [AAAI'25] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆101Jul 18, 2024Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- ☆11Jul 19, 2022Updated 3 years ago
- ☆11Nov 30, 2025Updated 3 months ago
- [AAAI 2025] CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities☆52Jan 12, 2025Updated last year
- Blending Custom Photos with Video Diffusion Transformers☆48Jan 21, 2025Updated last year
- ☆11Sep 13, 2024Updated last year
- ☆11Aug 27, 2024Updated last year
- Corresponding code to "FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems" @ CVPR 2021☆13Jun 22, 2021Updated 4 years ago
- ☆14Updated this week
- Code, Resources - Personal project - Llama Paper Summary - October 14, 2024.☆11Oct 15, 2024Updated last year
- ☆13Nov 14, 2023Updated 2 years ago
- ☆11Nov 9, 2023Updated 2 years ago
- Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"☆11Dec 17, 2024Updated last year
- ☆15Sep 23, 2024Updated last year
- Official repository for Polarity Sampling, CVPR 2022 ORAL☆13Jul 25, 2022Updated 3 years ago
- Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion☆47Feb 21, 2025Updated last year
- ☆24Dec 13, 2025Updated 2 months ago
- ☆12Jan 25, 2024Updated 2 years ago
- ☆12May 31, 2024Updated last year
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆16Feb 15, 2025Updated last year
- Official implementation for "Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations witho…☆12Aug 19, 2023Updated 2 years ago
- ☆16Dec 23, 2023Updated 2 years ago
- Awesome Controllable Video Generation with Diffusion Models☆59Jul 22, 2025Updated 7 months ago
- ☆12Mar 15, 2019Updated 6 years ago
- ☆14Jun 2, 2023Updated 2 years ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆57Jan 24, 2025Updated last year
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆128Jun 26, 2025Updated 8 months ago
- [Unofficial Implementation] Subject-driven Video Generation via Disentangled Identity and Motion☆58Jan 5, 2026Updated 2 months ago
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models☆215Dec 30, 2023Updated 2 years ago