Code for full fintuing Mochi model with FSDP (and CP)
☆30Jul 15, 2025Updated 7 months ago
Alternatives and similar repositories for Mochi-Full-Finetuner
Users that are interested in Mochi-Full-Finetuner are comparing it to the libraries listed below
Sorting:
- Pusa: Thousands Timesteps Video Diffusion Model☆672Feb 13, 2026Updated 3 weeks ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- Unofficial extension implementation of CausVid☆74Apr 28, 2025Updated 10 months ago
- Formal implementation of Robust Domain Misinformation Detection via Multi-modal Feature Alignment☆12Dec 8, 2023Updated 2 years ago
- ☆14Feb 20, 2024Updated 2 years ago
- A minimalistic, hackable code base to finetune Wan video generation model☆51Feb 22, 2026Updated 2 weeks ago
- ☆89Mar 11, 2025Updated 11 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆35Jan 2, 2026Updated 2 months ago
- Official Pytorch implementation of the paper Learning Input-agnostic Manipulation Directions in StyleGAN with Text Guidance (accepted to …☆28May 13, 2023Updated 2 years ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆34Jan 26, 2026Updated last month
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆80Mar 17, 2025Updated 11 months ago
- DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder☆179Oct 5, 2025Updated 5 months ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆47Jun 2, 2025Updated 9 months ago
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆116Nov 26, 2024Updated last year
- ☆52Jan 6, 2026Updated 2 months ago
- The offical code implementation of paper "Interpretable Multimodal Misinformation Detection with Logic Reasoning", accepted by Finding of…☆31Feb 5, 2026Updated last month
- ☆81Mar 2, 2025Updated last year
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- musubi-tuner modified to tune image2video/video infilling☆33Jan 30, 2025Updated last year
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆383Mar 26, 2025Updated 11 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆183Jul 21, 2025Updated 7 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 10 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated 9 months ago
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆2,045Nov 4, 2025Updated 4 months ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆164Aug 26, 2025Updated 6 months ago
- Promptopia is an open-source AI prompting tool for modern world to discover, create, and share creative prompts☆12May 27, 2023Updated 2 years ago
- ☆10Apr 12, 2025Updated 10 months ago
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago
- The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficie…☆47Apr 10, 2025Updated 11 months ago
- ☆17Nov 18, 2025Updated 3 months ago
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆20Jun 12, 2024Updated last year
- ☆20Sep 5, 2025Updated 6 months ago
- Code of RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images☆93Nov 9, 2024Updated last year
- The `onnx` Python library (not `onnxruntime`, to be clear) running in the browser using Pyodide.☆12Oct 12, 2023Updated 2 years ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,281Jun 8, 2025Updated 9 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆275Aug 4, 2025Updated 7 months ago
- ☆191Jan 14, 2025Updated last year
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆567Sep 16, 2024Updated last year