MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models
☆93Dec 8, 2025Updated 2 months ago
Alternatives and similar repositories for MUG-V
Users that are interested in MUG-V are comparing it to the libraries listed below
Sorting:
- 一个强大的 多模态大语言模型(MLLM),支持 文本、图像、视频等多模态输入,具备强大的理解、推理和生成能力。☆23Mar 19, 2025Updated 11 months ago
- Official training code for MUG-V 10B video generation model. Built on Megatron-LM (v0.14.0) with production-ready distributed training fo…☆19Oct 20, 2025Updated 4 months ago
- Three.js -> TSL -> Raymarching Clouds -> Tornado☆38Nov 29, 2025Updated 3 months ago
- A simple script to see how my ideas evolve over time☆44Jun 4, 2025Updated 8 months ago
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆15Jul 11, 2024Updated last year
- "AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation"☆37Jan 27, 2026Updated last month
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18May 2, 2025Updated 10 months ago
- Conveniently control parts of text prompts with custom UI. Pack includes loaders from txt and csv files, dynamic text concatenation tool …☆26Sep 23, 2025Updated 5 months ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)☆24Nov 8, 2021Updated 4 years ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- ComfyUI extension for mixing model during sampling☆30Oct 5, 2025Updated 4 months ago
- [NeurIPS 2023 Spotlight] Combating Representation Learning Disparity with Geometric Harmonization☆24May 14, 2025Updated 9 months ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆239Jan 24, 2026Updated last month
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- ☆170Oct 27, 2025Updated 4 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 9 months ago
- fashionAI clothes keypoint detection☆21Jun 5, 2018Updated 7 years ago
- Minimal journaling CLI for developers. Git-backed, terminal-native, zero friction. Just type `journal` and start writing.☆72Oct 28, 2025Updated 4 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆35Jan 2, 2026Updated 2 months ago
- Cross Modal Retrieval with Querybank Normalisation☆57Nov 21, 2023Updated 2 years ago
- Custom nodes that bring Character.AI's Ovi video+audio generator to ComfyUI with streamlined setup, selectable precision, attention-backe…☆122Oct 16, 2025Updated 4 months ago
- DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder☆179Oct 5, 2025Updated 4 months ago
- ☆52Jan 6, 2026Updated last month
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆26May 26, 2025Updated 9 months ago
- documentation used in my projects☆16Feb 24, 2026Updated last week
- ☆67Nov 27, 2025Updated 3 months ago
- ☆132Jun 24, 2025Updated 8 months ago
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆43Oct 3, 2025Updated 5 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆183Jul 21, 2025Updated 7 months ago
- ☆15Apr 28, 2023Updated 2 years ago
- DragonBall Online Client Development (Base: KR 0.50)☆10Jul 31, 2017Updated 8 years ago
- A ComfyUI extension for OmniGen2☆48Jul 1, 2025Updated 8 months ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- Pytorch implementation of Self-Refining Video Sampling☆146Feb 6, 2026Updated 3 weeks ago
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)☆125Feb 6, 2026Updated 3 weeks ago
- Bloom image post processing effect for ComfyUI. Soft and fast Gaussian Blur bloom, box blur for speed, star pattern support. Uses GPU and…☆66Jul 10, 2025Updated 7 months ago
- Gradient-Free Textual Inversion for Personalized Text-to-Image Generation☆44Jan 23, 2023Updated 3 years ago
- ☆72Jun 10, 2025Updated 8 months ago