Collection of scripts to build small-scale datasets for fine-tuning video generation models.
☆80Mar 17, 2025Updated last year
Alternatives and similar repositories for video-dataset-scripts
Users that are interested in video-dataset-scripts are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"☆270Dec 31, 2025Updated 2 months ago
- ☆81Mar 2, 2025Updated last year
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆43Oct 3, 2025Updated 5 months ago
- Make self forcing endless. Add cache purging. Add prompt controllability.☆70Sep 9, 2025Updated 6 months ago
- Advanced CLI diffusion inference/training suite based on Musubi Tuner☆40Mar 4, 2026Updated 2 weeks ago
- Scalable and memory-optimized training of diffusion models☆1,344Jun 4, 2025Updated 9 months ago
- SkyReels-V2 with batch mode, video input (extend existing videos), and multiple prompts.☆17May 5, 2025Updated 10 months ago
- Unofficial extension implementation of CausVid☆75Apr 28, 2025Updated 10 months ago
- Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework f…☆25Nov 4, 2025Updated 4 months ago
- Fine-tune of Florence-2 for shot categorization.☆26Mar 6, 2025Updated last year
- SketchColour receives colored first frame and entire scene in sketch format, then colors each frame based on the reference. Evaluated on …☆31Jul 9, 2025Updated 8 months ago
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,963Updated this week
- official code of Efficient Depth-Guided Urban View Synthesis☆14Dec 24, 2024Updated last year
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆344Feb 21, 2026Updated last month
- Gradio UI for training video models using finetrainers☆33Apr 18, 2025Updated 11 months ago
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Apr 15, 2024Updated last year
- Lora traing script for Lightricks LTX-video☆70Feb 12, 2025Updated last year
- Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"☆15Sep 9, 2021Updated 4 years ago
- ☆18Oct 24, 2024Updated last year
- [CVPR 2026] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆294Dec 15, 2025Updated 3 months ago
- ☆30Aug 21, 2024Updated last year
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆154Updated this week
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆184Jul 21, 2025Updated 8 months ago
- PaperBot: Learning to Design Real-World Tools Using Paper☆13Mar 15, 2024Updated 2 years ago
- This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.☆25Jan 7, 2025Updated last year
- ObjCtrl-2.5D☆58Apr 2, 2025Updated 11 months ago
- ☆14Jun 25, 2025Updated 8 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆284Jun 10, 2025Updated 9 months ago
- A minimalistic, hackable code base to finetune Wan video generation model☆51Feb 22, 2026Updated last month
- Simple Controlnet module for CogvideoX model.☆181Jan 12, 2025Updated last year
- ☆19Jun 17, 2025Updated 9 months ago
- Code for full fintuing Mochi model with FSDP (and CP)☆30Jul 15, 2025Updated 8 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆812Jun 9, 2025Updated 9 months ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,532Mar 13, 2026Updated last week
- Official Code of "Distribution Matching Distillation Meets Reinforcement Learning"☆199Feb 1, 2026Updated last month
- A pipeline parallel training script for diffusion models.☆1,889Feb 8, 2026Updated last month
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆106Feb 25, 2026Updated 3 weeks ago
- Official code for the paper: Can3Tok (ICCV2025)☆39Aug 23, 2025Updated 6 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 7 months ago