guandeh17 / Self-ForcingLinks
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
☆3,008Updated 3 months ago
Alternatives and similar repositories for Self-Forcing
Users that are interested in Self-Forcing are comparing it to the libraries listed below
Sorting:
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,128Updated 4 months ago
- MAGI-1: Autoregressive Video Generation at Scale☆3,620Updated 6 months ago
- A unified inference and post-training framework for accelerated video generation.☆2,878Updated this week
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation☆2,617Updated 2 months ago
- ☆1,728Updated last week
- Scalable and memory-optimized training of diffusion models☆1,312Updated 6 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,681Updated last month
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,758Updated 7 months ago
- OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871☆3,979Updated 3 weeks ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,196Updated 2 months ago
- Official inference repo for FLUX.2 models☆1,268Updated 3 weeks ago
- Light Video Generation Inference Framework☆1,336Updated this week
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,537Updated 2 months ago
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆785Updated 4 months ago
- Native Multimodal Models are World Learners☆1,374Updated last month
- TurboDiffusion: 100–200× Acceleration for Video Diffusion Models☆1,749Updated this week
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆1,808Updated last month
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillation☆1,070Updated last week
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆844Updated last month
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,039Updated last week
- Pusa: Thousands Timesteps Video Diffusion Model☆669Updated 3 months ago
- ☆1,044Updated 7 months ago
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆2,581Updated last week
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,343Updated 3 months ago
- A pipeline parallel training script for diffusion models.☆1,776Updated last week
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,469Updated 3 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆665Updated 2 months ago
- LongLive: Real-time Interactive Long Video Generation☆925Updated 3 weeks ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,213Updated 6 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,142Updated last year