guandeh17 / Self-ForcingLinks
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
☆2,742Updated last month
Alternatives and similar repositories for Self-Forcing
Users that are interested in Self-Forcing are comparing it to the libraries listed below
Sorting:
- MAGI-1: Autoregressive Video Generation at Scale☆3,524Updated 4 months ago
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆994Updated 2 months ago
- Scalable and memory-optimized training of diffusion models☆1,289Updated 4 months ago
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation☆2,320Updated 2 weeks ago
- OmniGen2: Exploration to Advanced Multimodal Generation.☆3,915Updated last month
- A unified inference and post-training framework for accelerated video generation.☆2,480Updated this week
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,707Updated 5 months ago
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆650Updated 2 months ago
- ☆753Updated 8 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆659Updated last month
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,317Updated last month
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,688Updated last month
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆639Updated last year
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆814Updated 4 months ago
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,369Updated 2 weeks ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,443Updated last month
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆652Updated 2 weeks ago
- ☆779Updated 3 months ago
- A pipeline parallel training script for diffusion models.☆1,663Updated this week
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆1,034Updated 2 weeks ago
- Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.☆5,855Updated last month
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,810Updated 3 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,188Updated 2 weeks ago
- Official implementation of BLIP3o-Series☆1,558Updated this week
- Enhance-A-Video: Better Generated Video for Free☆576Updated 7 months ago
- 🔥🔥 Open-sourced unified customization model☆1,165Updated last month
- ☆1,039Updated 5 months ago
- Official implementation of OneDiffusion paper (CVPR 2025)☆651Updated 10 months ago
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,480Updated this week
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆2,343Updated last week