EzioBy / DittoLinks
[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
☆566Updated 3 months ago
Alternatives and similar repositories for Ditto
Users that are interested in Ditto are comparing it to the libraries listed below
Sorting:
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆322Updated 5 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆672Updated this week
- [ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆668Updated 2 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆271Updated 8 months ago
- Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆627Updated 2 months ago
- Official implementation for "DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion".☆337Updated 2 months ago
- SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation☆574Updated last month
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆185Updated 2 months ago
- Calligrapher: Freestyle Text Image Customization☆295Updated 5 months ago
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆344Updated 3 months ago
- Community trainer for Lightricks' LTX Video model 🎬 ⚡️☆401Updated last month
- OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆212Updated 2 weeks ago
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆172Updated 9 months ago
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆434Updated last month
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆333Updated 3 months ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆425Updated 8 months ago
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆118Updated 5 months ago
- [SIGGRAPH 2025] Official code of the paper "Cobra: Efficient Line Art COlorization with BRoAder References". Cobra:利 用更广泛参考图实现高效线稿上色☆242Updated 2 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆153Updated 4 months ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆725Updated last month
- High-Quality Text-to-Video Generation with Alpha Channel☆329Updated last month
- ☆227Updated 6 months ago
- MotionStream: Real-Time Video Generation with Interactive Motion Controls☆497Updated this week
- ObjectClear: Complete Object Removal via Object-Effect Attention☆532Updated 2 months ago
- Lynx: Towards High-Fidelity Personalized Video Generation☆308Updated 4 months ago
- The official code of Yume☆607Updated 3 weeks ago
- ☆328Updated 4 months ago
- Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944☆336Updated 6 months ago
- SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)☆152Updated 3 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆672Updated 3 months ago