MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models
☆93Dec 8, 2025Updated 3 months ago
Alternatives and similar repositories for MUG-V
Users that are interested in MUG-V are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official training code for MUG-V 10B video generation model. Built on Megatron-LM (v0.14.0) with production-ready distributed training fo…☆19Oct 20, 2025Updated 5 months ago
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆16Jul 11, 2024Updated last year
- HDM model loader for ComfyUI☆41Dec 14, 2025Updated 3 months ago
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆18Jul 7, 2024Updated last year
- A simple script to see how my ideas evolve over time☆44Jun 4, 2025Updated 9 months ago
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆43Oct 3, 2025Updated 5 months ago
- ComfyUI extension for mixing model during sampling☆30Oct 5, 2025Updated 5 months ago
- [AAAI 2026 Oral] SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation☆37Nov 24, 2025Updated 4 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18May 2, 2025Updated 10 months ago
- This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"☆26Dec 7, 2023Updated 2 years ago
- DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder☆183Oct 5, 2025Updated 5 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)☆24Nov 8, 2021Updated 4 years ago
- [NeurIPS 2023 Spotlight] Combating Representation Learning Disparity with Geometric Harmonization☆24May 14, 2025Updated 10 months ago
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated last month
- LITEN: Learning from Inference Time Execution for VLAs☆27Oct 23, 2025Updated 5 months ago
- Nodes to level up your workflows performance and streamline specific functions.☆10Aug 19, 2025Updated 7 months ago
- ☆173Oct 27, 2025Updated 4 months ago
- ☆24May 22, 2024Updated last year
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆359Oct 5, 2025Updated 5 months ago
- ☆34Nov 21, 2025Updated 4 months ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- Deforum extension script for AUTOMATIC1111's Stable Diffusion webui☆23May 26, 2024Updated last year
- [AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?☆26Dec 14, 2025Updated 3 months ago
- Allows to sample without generating any uncond with Stable Diffusion!☆51Jul 10, 2024Updated last year
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆70May 18, 2025Updated 10 months ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆248Jan 24, 2026Updated 2 months ago
- Implements a minimalistic version of Stable Cascade training☆13Oct 24, 2024Updated last year
- toolkit for WakenLLM framework☆47Dec 29, 2025Updated 2 months ago
- Pytorch implementation of Self-Refining Video Sampling☆153Feb 6, 2026Updated last month
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆76Mar 3, 2026Updated 3 weeks ago
- A small set of unique adapters meant to bridge the dual_stream_shunt trained for guiding prompt embeddings and diffusion.☆14Nov 26, 2025Updated 3 months ago
- Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"☆52Oct 29, 2025Updated 4 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆42Feb 12, 2025Updated last year
- egocentric humanoid manipulation benchmark☆60Dec 4, 2025Updated 3 months ago
- ☆15Apr 28, 2023Updated 2 years ago