guandeh17 / Self-ForcingLinks
☆2,493Updated last month
Alternatives and similar repositories for Self-Forcing
Users that are interested in Self-Forcing are comparing it to the libraries listed below
Sorting:
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆885Updated 3 weeks ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆4,655Updated last week
- MAGI-1: Autoregressive Video Generation at Scale☆3,459Updated 2 months ago
- A unified inference and post-training framework for accelerated video generation.☆2,108Updated this week
- Scalable and memory-optimized training of diffusion models☆1,273Updated 2 months ago
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆1,997Updated last week
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,179Updated 3 months ago
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,245Updated this week
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,650Updated 3 months ago
- OmniGen2: Exploration to Advanced Multimodal Generation.☆3,771Updated last month
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆782Updated 2 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,165Updated 2 months ago
- A pipeline parallel training script for diffusion models.☆1,490Updated last week
- ☆768Updated last month
- ☆752Updated 6 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,082Updated 2 months ago
- ☆1,025Updated 3 months ago
- [ICCV'25 Oral] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,420Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆597Updated last week
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,603Updated last month
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆1,010Updated 3 weeks ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,382Updated 2 months ago
- ☆2,404Updated last month
- Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.☆4,595Updated last week
- Official PyTorch implementation of One-Minute Video Generation with Test-Time Training☆2,074Updated 2 months ago
- Open-source unified multimodal model☆4,925Updated last week
- Enhance-A-Video: Better Generated Video for Free☆567Updated 5 months ago
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,755Updated 2 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆630Updated 10 months ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models☆1,331Updated 2 weeks ago