[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
☆570Oct 29, 2025Updated 4 months ago
Alternatives and similar repositories for Ditto
Users that are interested in Ditto are comparing it to the libraries listed below
Sorting:
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆144Feb 26, 2026Updated last week
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- [CVPR 2026] Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆632Nov 26, 2025Updated 3 months ago
- [CVPR 2026] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆284Dec 15, 2025Updated 2 months ago
- DreamStyle: A Unified Framework for Video Stylization☆109Jan 7, 2026Updated last month
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆107Feb 21, 2026Updated last week
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 5 months ago
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆118Aug 15, 2025Updated 6 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆497Nov 13, 2025Updated 3 months ago
- [CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis☆158Apr 15, 2025Updated 10 months ago
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆90Nov 30, 2025Updated 3 months ago
- Animate Any Character in Any World☆90Jan 9, 2026Updated last month
- ☆130Dec 19, 2025Updated 2 months ago
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,656Oct 17, 2025Updated 4 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,111Updated this week
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆92Dec 1, 2025Updated 3 months ago
- ☆721Nov 7, 2025Updated 3 months ago
- MoCha: End-to-End Video Character Replacement without Structural Guidance☆649Jan 14, 2026Updated last month
- A Unified Visual Generator with Interleaved OmniModal Context☆192Feb 10, 2026Updated 3 weeks ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models☆93Dec 8, 2025Updated 2 months ago
- [ICLR 2026] Official Repo For "BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration"☆363Jan 28, 2026Updated last month
- [CVPR2026] Code Release of MVInverse: Feedforward Multi-view Inverse Rendering in Seconds☆137Jan 22, 2026Updated last month
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆727Nov 27, 2025Updated 3 months ago
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆655Jan 22, 2026Updated last month
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆38Jan 9, 2026Updated last month
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 4 months ago
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer☆546Jan 13, 2026Updated last month
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,929Updated this week
- The official code of Yume☆621Jan 14, 2026Updated last month
- [ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".☆502Jan 9, 2025Updated last year
- Official implementation for "DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion".☆340Feb 24, 2026Updated last week
- ☆14Jul 5, 2024Updated last year
- ☆56Dec 8, 2025Updated 2 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆807Jun 9, 2025Updated 8 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆704Jun 3, 2025Updated 9 months ago
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆438Feb 11, 2026Updated 3 weeks ago
- ☆85Oct 10, 2025Updated 4 months ago
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆60Nov 27, 2025Updated 3 months ago