Collection of scripts to build small-scale datasets for fine-tuning video generation models.
☆78Mar 17, 2025Updated 11 months ago
Alternatives and similar repositories for video-dataset-scripts
Users that are interested in video-dataset-scripts are comparing it to the libraries listed below
Sorting:
- ☆81Mar 2, 2025Updated 11 months ago
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"☆261Dec 31, 2025Updated 2 months ago
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆43Oct 3, 2025Updated 4 months ago
- Advanced CLI diffusion inference/training suite based on Musubi Tuner☆40Updated this week
- SkyReels-V2 with batch mode, video input (extend existing videos), and multiple prompts.☆17May 5, 2025Updated 9 months ago
- Unofficial extension implementation of CausVid☆74Apr 28, 2025Updated 10 months ago
- Lora traing script for Lightricks LTX-video☆70Feb 12, 2025Updated last year
- Scalable and memory-optimized training of diffusion models☆1,338Jun 4, 2025Updated 8 months ago
- Gradio UI for training video models using finetrainers☆33Apr 18, 2025Updated 10 months ago
- Fine-tune of Florence-2 for shot categorization.☆26Mar 6, 2025Updated 11 months ago
- A minimalistic, hackable code base to finetune Wan video generation model☆51Feb 22, 2026Updated last week
- Make self forcing endless. Add cache purging. Add prompt controllability.☆69Sep 9, 2025Updated 5 months ago
- Simple Controlnet module for CogvideoX model.☆182Jan 12, 2025Updated last year
- Official Code of "Distribution Matching Distillation Meets Reinforcement Learning"☆181Feb 1, 2026Updated last month
- Code for full fintuing Mochi model with FSDP (and CP)☆30Jul 15, 2025Updated 7 months ago
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆152Jan 27, 2026Updated last month
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆338Feb 21, 2026Updated last week
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆183Jul 21, 2025Updated 7 months ago
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,912Updated this week
- official code of Efficient Depth-Guided Urban View Synthesis☆14Dec 24, 2024Updated last year
- Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework f…☆25Nov 4, 2025Updated 3 months ago
- PaperBot: Learning to Design Real-World Tools Using Paper☆13Mar 15, 2024Updated last year
- ☆30Aug 21, 2024Updated last year
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Apr 15, 2024Updated last year
- [CVPR 2026] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆281Dec 15, 2025Updated 2 months ago
- Python package for rendering 3D scenes and animations using blender.☆211Aug 24, 2025Updated 6 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆39Aug 23, 2025Updated 6 months ago
- ObjCtrl-2.5D☆58Apr 2, 2025Updated 10 months ago
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆49Jul 28, 2025Updated 7 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆284Jun 10, 2025Updated 8 months ago
- Controlnet module for Wan2.1☆30Aug 4, 2025Updated 6 months ago
- Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"☆15Sep 9, 2021Updated 4 years ago
- SketchColour receives colored first frame and entire scene in sketch format, then colors each frame based on the reference. Evaluated on …☆31Jul 9, 2025Updated 7 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆807Jun 9, 2025Updated 8 months ago
- ☆60Oct 3, 2025Updated 4 months ago
- A pipeline parallel training script for diffusion models.☆1,861Feb 8, 2026Updated 3 weeks ago
- Keyframe Interpolation with CogvideoX☆139Oct 31, 2024Updated last year
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated last month
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,485Updated this week