Yaofang-Liu / Mochi-Full-FinetunerLinks
Code for full fintuing Mochi model with FSDP (and CP)
☆31Updated 3 months ago
Alternatives and similar repositories for Mochi-Full-Finetuner
Users that are interested in Mochi-Full-Finetuner are comparing it to the libraries listed below
Sorting:
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆99Updated 10 months ago
- ☆63Updated last year
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance☆50Updated last year
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆95Updated 6 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated 8 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆93Updated last week
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆33Updated last year
- ☆66Updated last year
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆62Updated 5 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆42Updated last year
- Unofficial extension implementation of CausVid☆59Updated 5 months ago
- [ICLR 2024] Code for FreeNoise based on AnimateDiff☆106Updated last year
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆129Updated 7 months ago
- ☆49Updated 2 weeks ago
- [NeurIPS'2024] Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps☆100Updated last year
- ☆86Updated last year
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆107Updated 5 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆84Updated last year
- [ICCV 2025] Edicho: Consistent Image Editing in the Wild☆119Updated 9 months ago
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆86Updated last year
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆68Updated 6 months ago
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆44Updated last year
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆157Updated 3 months ago
- Trying to implement https://arxiv.org/abs/2305.08891☆33Updated 2 years ago
- ☆32Updated 6 months ago
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆124Updated 3 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆71Updated last month
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆68Updated 6 months ago
- [arXiv 2025] Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers☆47Updated 2 months ago
- ☆103Updated last month